Retrosheet parser update
20 November 2009 1:46 pm
I updated my Retrosheet parser / database creator thingy to work with Chadwick 0.5.2, which introduced six new extra fields. The change also includes a new way of running the CSV files into MySQL- it’s slower, but more complete, as it uses the actual headers from the Chadwick export rather than loading the entire file at once.
Parsing the files from 1953 – 2008 took a little over two hours on my rather macho Linux box. 8,594,270 events.
Also: in awesome news, Retrosheet has completed the game files for 2009 and they should be available this weekend. I can’t say enough good things about the guys over there. They do incredible work and our lives – all of us baseball obsessives – are far better for it. ¶
Leave a comment