Michal Migurski's notebook, listening post, and soapbox. Subscribe to this blog. Check out the rest of my site as well.

Aug 22, 2004 2:56am

temporary troubles with In The News

I discovered today that Google had made some small tweaks to the HTML structure of their news page. It caused an interruption in my ability to scrape their site between the afternoon of August 20th and the morning of August 21st. Hopefully this won't be noticeable by tomorrow as my fix works its way through various levels of caching, but it does highlight the difficulties of working with capricious methods like page scraping. I'm currently looking at the feasibility of doing a similar visualization investigation with Open Secrets, and troubles with unexpected modifications to data formats underscore the usefulness of web services with stable API's. "Small pieces, loosely joined" —Flickr gets it, even Google gets it with their normal search service. Here's hoping that Google News moves out of beta soon.
October 2021
Su M Tu W Th F Sa

Recent Entries

  1. Mapping Remote Roads with OpenStreetMap, RapiD, and QGIS
  2. How It’s Made: A PlanScore Predictive Model for Partisan Elections
  3. Micromobility Data Policies: A Survey of City Needs
  4. Open Precinct Data
  5. Scoring Pennsylvania
  6. Coming To A Street Near You: Help Remix Create a New Tool for Street Designers
  7. planscore: a project to score gerrymandered district plans
  8. blog all dog-eared pages: human transit
  9. the levity of serverlessness
  10. three open data projects: openstreetmap, openaddresses, and who’s on first
  11. building up redistricting data for North Carolina
  12. district plans by the hundredweight
  13. baby steps towards measuring the efficiency gap
  14. things I’ve recently learned about legislative redistricting
  15. oh no
  16. landsat satellite imagery is easy to use
  17. openstreetmap: robots, crisis, and craft mappers
  18. quoted in the news
  19. dockering address data
  20. blog all dog-eared pages: the best and the brightest