What is OCR?

If you’re just joining us, Optical Character Recognition is an automated system that translates an image of text into encoded selectable text.  Google uses OCR to scan your pictures and PDF files, it then turns the scan into an editable Google Doc format.  Over the past 2 years, Google has been using human input from reCAPTCHA puzzles to increase their success at identifying complex words.

What Languages were added?

Along with the additional languages, Google also improved OCR quality for the 5 previously implemented languages: English, Italian, German, Spanish, and French.  The 29 new languages that have been added are the following: When uploading images or PDF files to Google Docs, be sure to Select the language that the text in your file is written in!  To do so, put your file in queue to be uploaded, then Check the box for Convert text from PDF or images files to Google Docs documents.  A Document Language drop-down menu will appear, there you can Select your language.

Have you tried out Google’s OCR technology for scanning old family journals, books, or whatever else you have laying around the house?  You can also try it out on your iPhone or Android phone if you have the Google Goggles app! Comment Name * Email *

Δ  Save my name and email and send me emails as new comments are made to this post.

Google Adds OCR Support for 34 Languages - 51Google Adds OCR Support for 34 Languages - 45Google Adds OCR Support for 34 Languages - 74