This is the sort of thing that makes me like Google again. Google just announced work on the open source OCRopus project, a document analysis and OCR (Optical Character Recognition) system:The goal of ...