Questions tagged [ocr]

Optical character recognition - mechanical or electronic translation of scanned images of handwritten, typewritten or printed text into machine-encoded text.

It is widely used to convert books and documents into electronic files, to computerize a record-keeping system in an office, or to publish the text on a website. OCR makes it possible to edit the text, search for a word or phrase, store it more compactly, display or print a copy free of scanning artifacts, and apply techniques such as machine translation, text-to-speech and text mining to it. OCR is a field of research in pattern recognition, artificial intelligence and computer vision.

OCR systems require calibration to read a specific font; early versions needed to be programmed with images of each character, and worked on one font at a time. "Intelligent" systems with a high degree of recognition accuracy for most fonts are now common. Some systems are capable of reproducing formatted output that closely approximates the original scanned page including images, columns and other non-textual components.

15 questions
45
votes
4 answers

Google Translate an image, not on your phone

The Google Translate application now supports in-image OCR and translation of text: https://support.google.com/translate/answer/6142483?hl=en but this functionality does not seem to be accessible using a PC and visiting translate.google.com. How can…
einpoklum
  • 1,305
  • 2
  • 14
  • 28
11
votes
4 answers

How to select text in a scanned document in Google Drive PDF viewer

In my PDF documents in Google Drive, I used to be able to select text of a scanned document. I've always scanned at 300dpi. Since at least a year I can't select text in scanned documents anymore in Google Drive's PDF viewer. I've tried selecting…
7
votes
1 answer

OCR in new Google Drive

The OCR option for images and PDF files seems to be missing in the new Google Drive (the check box in the upload configuration is missing). Is this option completely gone or did it just move elsewhere?
Šimon Tóth
  • 267
  • 1
  • 7
5
votes
1 answer

How do I get all the plain text from a Google Books full-view book?

Google Books has a lot of old books that are available for "Full View". And there is an option to see a certain amount of plain text: when you're viewing the page images of an old book, if you click on the gear icon in the upper right and click on…
76987
  • 181
  • 2
  • 4
3
votes
3 answers

Image to text online converter

Can anyone recommend a website (free tool) where I can enter an image (it contains some text) and get back text.
igor
  • 351
  • 4
  • 14
3
votes
2 answers

How do I make Google Drive perform OCR on a PDF I upload?

I have a PDF which is a scan of a few pages of a book. I want to be able to search inside this PDF for specific terms. I know OCR can be performed on files in Google Drive. However, I don't seem to be able to initiate this manually, and the PDF I…
user170838
  • 31
  • 1
  • 2
3
votes
1 answer

Converting all PNG images in Google Drive to text (OCR)

I found Google Drive the best OCR for Persian texts. The problem is Google Drive doesn't convert files larger than 2MB so I can't use big PDFs. So I extracted all pages from PDF into PNG images. Now, how can I tell Google Drive to convert all of my…
AVEbrahimi
  • 133
  • 1
  • 6
2
votes
0 answers

Google Translate OCR supported alphabets

Google Translate supports input from camera (OCR), but which alphabets do they support? They support Latin for sure, but what about others (Arabic, Cyrillic, Thai, Chinese, etc.)?
valodzka
  • 173
  • 5
2
votes
1 answer

How to download a scanned .pdf OCRed by Google?

I have an email with a scanned .pdf attached in my Gmail account. When I clicked "View", I see that Google has OCRed it. When I click "Download", the PDF is the original one, i.e. without being OCRed. How can I download a .pdf file with content…
Tim
  • 3,439
  • 13
  • 39
  • 45
1
vote
0 answers

Will Google Drive perform OCR on existing (badly) OCRed PDFs?

I use ScanSnap's software to perform OCR on scanned PDFs, then realized the OCR contains a lot of errors and I couldn't find a lot of information in search. So I tried some scans without OCR, uploaded to Google Drive and it recognized the text…
ytk
  • 151
  • 5
1
vote
0 answers

Google Drive not indexing searchable PDF

I have a fully searchable PDF. The PDF was scanned and then converted to text by Fujitsu's ScanSnap Manager #iX500 (W). For those who want all the details, here's the info from Document Properties: PDF Producer (Acrobat 11.0.23 Paper Capture…
Bobo
  • 19
  • 3
0
votes
1 answer

Does GDrive search use Optical Character Recognition?

Evernote can find and identify text inside images that are uploaded. It even reads handwriting. Does GDrive do any of this? Note: I'm not asking about the GDrive OCR feature that converts images with text into text documents.
Homer
  • 539
  • 1
  • 7
  • 15
0
votes
1 answer

Does Google Drive OCR images for the search internal function in Google Drive?

I was just searching my Google Drive account for a document containing the word 'internet' and the top hit was a screenshot i took of instructions of how to configure the DNS settings to use for a proxy VPN service - which is littered with the word…
sam
  • 7,107
  • 42
  • 100
  • 140
0
votes
0 answers

Google Translate Issue

I have issue that when I use google translate in mobile phone it is able to use import picture and convert it to the text, but the using Computer by Chrome or Firefox browser it cannot use it like mobile phone. How to get the Khmer text from picture…
ShirleyC Wilson
0
votes
1 answer

How to convert all images to text in a Word document?

These images have text in them and I can't find a tool that does this easily, preferably without any installations.
user8759
  • 117
  • 1