Google now indexed Text from Images – Uses OCR
November 1st, 2008

Google leads the search business, and every passing they they convince me that no one is even getting close to them. Just recently they announced that they can now crawl and understand text in Flash animations, they now have something even better! Apparently now Google uses a OCR technology to read scanned documents / images (within PDF files).
This Optical Character Recognition (OCR) technology lets us convert a picture (of a thousand words) into a thousand words — words that can be searched and indexed, so that these valuable documents are more easily found. This is a small but important step forward in our mission of making all the world’s information accessible and useful. [Google Blog]
Categories: Concept / Educative, First look, Tech Industry News | Tags: Google, google ocr, google search, index images, ocr, pdf, read images, search | 1 Comment








