Questions tagged [ocr]

Optical Character Recognition is mechanical or electronic translation of images of handwritten, typewritten or printed text into text data (a sequence of codes used to represent characters in the computer).

36 questions
10
votes
2 answers

Filtering kinda "half-toned" image for OCR processing

I have a scanned PDF material to which I want to add hidden text layer, so I could index the document. I used ghostscript black and white tiff output device (tiffg4) to extract pages as tiff images, and here is example of what they look…
zetah
  • 573
  • 3
  • 12
8
votes
1 answer

Suggested Preprocessing methods for OCR on Circular Images

Hello this is my sample image I am going to do real time character detection on images like that. I've tried SURF, SIFT, MSER and template matching on original image without any preprocessing. I can detect characters but when the image changes this…
6
votes
1 answer

Nutrition facts label OCR

My team is working on a program that can read nutrition facts labels using OCR, much like NutriScanner (though by its reviews it doesn't seem like it works particularly well). I've seen other questions noting that it'd be a good idea to straighten…
Ian Kim
  • 63
  • 1
  • 3
5
votes
1 answer

Text recognition for Android app

I´d like to use some OCR algorithm (Image to Text) into an Android app (Java). I have previously used several algorithms but for a PC (Matlab) app. Which library or algorithm should I use? Computational cost and response time are important issues.
4
votes
2 answers

Document image vanishing point detection and evaluation

It is a natural idea to substitute our scanner with mobile phone, and perform OCR with the phone at the same time. If this can come true, two problems must be tackled: one is related to document image binarization, and the other is related to…
4
votes
1 answer

Sharpen Defocused Image (Deconvolution / Image Restoration)

Using OCR, I want to extract text from product packages using Google Glass. However, because of the fixed focus of the camera the package pictures are blurred. Is there a way to sharpen the image? Currently I use unsharp masking to enlarge the…
4
votes
1 answer

Feature Detection on a Circular Metal Surface for Character Recognition

Hello I'm trying to recognize characters on the circular metal surface. Without any preprocessing I could match cropped features with original image. I've actually cropped 2 characters together to increase interested points on a template.…
Anar Bayramov
  • 247
  • 1
  • 4
4
votes
2 answers

Licence plate detection with different backgrounds

I just studied the Algorithm of ANPR (Automatic Number Plate Recognition) from the chp 5 of book Packtpub.Mastering.OpenCV.with.Practical.Computer.Vision.Projects in which they detect the vehicle licence plate , but their are some restriction in…
ARG
  • 591
  • 1
  • 4
  • 13
3
votes
1 answer

Smoothing "font characters" outlines in bitmap image

Adobe Acrobat has this as a feature called ClearScan. While doing OCR it allows image enhancement, by smoothing irregularities on font outlines. For example, this source: turns to this: Are there other tools with such feature, or public algorithms…
zetah
  • 573
  • 3
  • 12
3
votes
1 answer

Recognizing hand-written archaeological signs

I have 75 images of handwritten signs from which I extracted 7 Hu moments and solidity features. How can I find similarities among them to train a classifier and predict the value? I thought SVM was a good choice, but I don't have a target vector…
postgres
  • 233
  • 2
  • 8
3
votes
2 answers

Line tracing folowing a path of an almost connected components

say we have a binary image like the one in below i would like to extract each black line, even if it's not almost connected i have marked here in red, example of lines that correspond to those to be extracted **the purpose here is to extract lines a…
3
votes
1 answer

Match (OCR?) a digital image of a product label to similar

Excuse my ignorance... Given a digital image of a label I'd like to Requirement: Match it to the closest pre-existing image in a datastore Optional: OCR it, if possible (only necessary if step 1 fails) I'm a programmer but not an engineer. I have…
Meltemi
  • 131
  • 3
2
votes
1 answer

How to Clean Black and White Images Which Are Results of Edge Detection

I want to extract text from some images with messy backgrounds. First step was switching to gray scale, applying an edge detection algorithm, and then cleaning up a little bit. Here's an example of the result: I want to use the tesseract OCR to…
Robin
  • 131
  • 4
2
votes
1 answer

What is the algorithm behind OCR?

I'm curious about the algorithm and basic principles behind Optical Character Recognition. Can anyone explain: The most basic OCR algorithm that can be fairly easily implemented from scratch? Or the common denominator between different OCR…
Geerten
  • 1,361
  • 1
  • 9
  • 17
2
votes
2 answers

Compare feature vectors of an image in MATLAB

I am working on OCR. I have extracted features of 26 alphabets, and I want to compare them with an experimental character feature. I have tried using Euclidean distance and correlation functions, but I am not getting good efficiency. What are some…
SKK
  • 21
  • 2
1
2 3