My investigations led to fixing an error in the publication of King County restaurant data.
I spoke with Hilary Karasz — her tech guy, Mark, fixed the error in the publication of their data.They are once again publishing violation information on their site.It also appears that they have gone back and re-published to records during the time period where violation data was missing.I think that the fix now on our end is to rescrape back to November 1 or so.
Some links for fact-checking:
* Here’s the TAQUERIA EL MAGUEY record that sparked our inquiry: http://www.decadeonline.com/insp.phtml?agency=skc&forceresults=1&record_id=PR0077496* It now has violation data in there– wasn’t there before, as reflected by our record of the inspection: http://seattle.everyblock.com/restaurant-inspections/by-date/2008/12/31/588057/* They fixed the system, and our scraper picked up the fix, as seen in this link from an inspection this week: http://seattle.everyblock.com/restaurant-inspections/by-date/2009/1/5/592332/
More backup info, below, but if we point our taser to the past, we should heal ourselves.
Yay!
- Charlotte Restaurant inspections (http://charlotte.everyblock.com/restaurant-inspections/): we obtain this information on a nightly basis via a direct data feed
- Chicago Restaurant inspections (http://chicago.everyblock.com/restaurant-inspections/): we obtain this data via a direct query to the City’s Web-based system (http://webapps.cityofchicago.org/health/inspection.jsp)
- New York City Restaurant inspections (http://nyc.everyblock.com/restaurant-inspections/): we obtain this data via a direct query to the City’s Web-based system (http://home2.nyc.gov/html/doh/html/rii/index.shtml)
- Philadelphia Restaurant inspections (http://philly.everyblock.com/restaurant-inspections/)
- San Francisco Building permits (http://sf.everyblock.com/restaurant-inspections/): we obtain this data via a database dump placed on an FTP site on a regular basis by the San Francisco Department of Health
As you can see, we can work with whatever format you have and pretty much any retrieval method that makes sense for you. If it is better for you that we obtain the records from the public system, that works for us. Let me know what you think — we’re looking forward to working with you!–Daniel X. O’Neil