Sunday, January 30, 2011

Document Recognition and Retrieval. Electronic Imaging Conference. 26-Jan-2011

Capture: Image to Archive (conference)

Scene Analysis "Functional Role Labeling"

Image Template -> template management

Trainable pattern classifiers. Features + Classifiers.
Features
--------
Haar
runlength
Fourier
word counts

Classifiers
-----------
Decision tree
nearest neighbor
SVM (Support Vector Machine)
generative probability
density

"Learning Image Anchor Templates for Document Classification and Data Extraction"
Sarkar. http://www.icpr2010.org/pdfs/icpr2010_ThAT7.5.pdf

Constellation Model
http://en.wikipedia.org/wiki/Constellation_model

Information extraction by finding repeated structure
Evgeniy Bart, Prateek Sarkar
http://dx.doi.org/10.1145/1815330.1815353

Best First Leaf Search (from aforementioned paper)

NIST tax form data sets
http://www.nist.gov/srd/nistsd2.cfm



"Introduction of Statistical Information in a Syntactic Analyzer for Document Image Recognition"

Sayre's Paradox
http://dx.doi.org/10.1016/0031-3203(73)90044-7

Hidden Markov Model
http://en.wikipedia.org/wiki/Hidden_Markov_model



"MRF Model w/ Parameter Optimization by CRF for online recognition of handwritten Japanese characters"

MRF - Markov Random Field
http://en.wikipedia.org/wiki/Markov_random_field

CRF - Conditional Random Field
http://en.wikipedia.org/wiki/Conditional_random_field

Extract feature points using Ramner method
U. Ramer “An Iterative Procedure for the Polygonal Approximation of Plan Closed Curves” Computer Graphics and Image Processing, vol.1, pp244-256, 1972.
http://dx.doi.org/10.1016/S0146-664X(72)80017-0

Stochastic Gradient Descent
http://en.wikipedia.org/wiki/Stochastic_gradient_descent

Viterbi algorithm
http://en.wikipedia.org/wiki/Viterbi_algorithm

Baum-Welch algorithm
http://en.wikipedia.org/wiki/Baum%E2%80%93Welch_algorithm

Elastic matching
http://en.wikipedia.org/wiki/Elastic_Matching



"Improving an HMM based offline handwriting recognition system using MME-PSO optimization"

MME -

PSO - Particle Swarm Optimization
http://en.wikipedia.org/wiki/Particle_swarm_optimization

MD-LSTM
http://en.wikipedia.org/wiki/Long_short_term_memory

HTK - toolkit for building HMMs (Cambridge)
http://htk.eng.cam.ac.uk/



"Segmenting text from outdoor images taken by mobile phones using color features"

Preprocessing:
RGB -> HSI
histogram equalization Intensity channel
HSI -> RGB

Image binarization

Noise removal

Image Segmentation.
http://people.cs.uchicago.edu/~pff/

"Font and Background Color Independent Text Binarization"
Kasar edge cue based algorithm
http://www.imlab.jp/cbdar2007/proceedings/papers/O1-1.pdf

Levenshtein Distance
http://en.wikipedia.org/wiki/Levenshtein_distance

Bag of Words
http://en.wikipedia.org/wiki/Bag_of_words_model_in_computer_vision

Local Adaptive Binarization



"Perceptive Method for Handwritten Text Segmentation"

Kalman Filtering
http://en.wikipedia.org/wiki/Kalman_filter

Delaunay graph for distance computation
http://en.wikipedia.org/wiki/Delaunay_triangulation

DMOSp



"A masked based enhancement method for historical documents"

Filtering (noise reduction)
- Wiener
http://en.wikipedia.org/wiki/Wiener_filter
- Median
http://en.wikipedia.org/wiki/Median_filter

Markov Random Fields
http://en.wikipedia.org/wiki/Markov_random_field

Local Binarization [Gatos 2006]
"Adaptive degraded document image binarization"
http://dx.doi.org/10.1016/j.patcog.2005.09.010

OCR - Tesseract
http://en.wikipedia.org/wiki/Tesseract_%28software%29

No comments:

Post a Comment

Note: Only a member of this blog may post a comment.