Capture: Image to Archive (conference)
Scene Analysis "Functional Role Labeling"
Image Template -> template management
Trainable pattern classifiers. Features + Classifiers.
Features
--------
Haar
runlength
Fourier
word counts
Classifiers
-----------
Decision tree
nearest neighbor
SVM (Support Vector Machine)
generative probability
density
"Learning Image Anchor Templates for Document Classification and Data Extraction"
Sarkar. http://www.icpr2010.org/pdfs/icpr2010_ThAT7.5.pdf
Constellation Model
http://en.wikipedia.org/wiki/Constellation_model
Information extraction by finding repeated structure
Evgeniy Bart, Prateek Sarkar
http://dx.doi.org/10.1145/1815330.1815353
Best First Leaf Search (from aforementioned paper)
NIST tax form data sets
http://www.nist.gov/srd/nistsd2.cfm
"Introduction of Statistical Information in a Syntactic Analyzer for Document Image Recognition"
Sayre's Paradox
http://dx.doi.org/10.1016/0031-3203(73)90044-7
Hidden Markov Model
http://en.wikipedia.org/wiki/Hidden_Markov_model
"MRF Model w/ Parameter Optimization by CRF for online recognition of handwritten Japanese characters"
MRF - Markov Random Field
http://en.wikipedia.org/wiki/Markov_random_field
CRF - Conditional Random Field
http://en.wikipedia.org/wiki/Conditional_random_field
Extract feature points using Ramner method
U. Ramer “An Iterative Procedure for the Polygonal Approximation of Plan Closed Curves” Computer Graphics and Image Processing, vol.1, pp244-256, 1972.
http://dx.doi.org/10.1016/S0146-664X(72)80017-0
Stochastic Gradient Descent
http://en.wikipedia.org/wiki/Stochastic_gradient_descent
Viterbi algorithm
http://en.wikipedia.org/wiki/Viterbi_algorithm
Baum-Welch algorithm
http://en.wikipedia.org/wiki/Baum%E2%80%93Welch_algorithm
Elastic matching
http://en.wikipedia.org/wiki/Elastic_Matching
"Improving an HMM based offline handwriting recognition system using MME-PSO optimization"
MME -
PSO - Particle Swarm Optimization
http://en.wikipedia.org/wiki/Particle_swarm_optimization
MD-LSTM
http://en.wikipedia.org/wiki/Long_short_term_memory
HTK - toolkit for building HMMs (Cambridge)
http://htk.eng.cam.ac.uk/
"Segmenting text from outdoor images taken by mobile phones using color features"
Preprocessing:
RGB -> HSI
histogram equalization Intensity channel
HSI -> RGB
Image binarization
Noise removal
Image Segmentation.
http://people.cs.uchicago.edu/~pff/
"Font and Background Color Independent Text Binarization"
Kasar edge cue based algorithm
http://www.imlab.jp/cbdar2007/proceedings/papers/O1-1.pdf
Levenshtein Distance
http://en.wikipedia.org/wiki/Levenshtein_distance
Bag of Words
http://en.wikipedia.org/wiki/Bag_of_words_model_in_computer_vision
Local Adaptive Binarization
"Perceptive Method for Handwritten Text Segmentation"
Kalman Filtering
http://en.wikipedia.org/wiki/Kalman_filter
Delaunay graph for distance computation
http://en.wikipedia.org/wiki/Delaunay_triangulation
DMOSp
"A masked based enhancement method for historical documents"
Filtering (noise reduction)
- Wiener
http://en.wikipedia.org/wiki/Wiener_filter
- Median
http://en.wikipedia.org/wiki/Median_filter
Markov Random Fields
http://en.wikipedia.org/wiki/Markov_random_field
Local Binarization [Gatos 2006]
"Adaptive degraded document image binarization"
http://dx.doi.org/10.1016/j.patcog.2005.09.010
OCR - Tesseract
http://en.wikipedia.org/wiki/Tesseract_%28software%29
No comments:
Post a Comment
Note: Only a member of this blog may post a comment.