Amazon cover image
Image from Amazon.com

Machine Learning in Document Analysis and Recognition [electronic resource] / edited by Simone Marinai, Hiromichi Fujisawa.

By: Contributor(s): Material type: TextTextSeries: Studies in Computational Intelligence ; 90Publisher: Berlin, Heidelberg : Springer Berlin Heidelberg, 2008Description: XII, 434 p. 142 illus. online resourceContent type:
  • text
Media type:
  • computer
Carrier type:
  • online resource
ISBN:
  • 9783540762805
Subject(s): Additional physical formats: Printed edition:: No titleDDC classification:
  • 519 23
LOC classification:
  • TA329-348
  • TA640-643
Online resources:
Contents:
to Document Analysis and Recognition -- Structure Extraction in Printed Documents Using Neural Approaches -- Machine Learning for Reading Order Detection in Document Image Understanding -- Decision-Based Specification and Comparison of Table Recognition Algorithms -- Machine Learning for Digital Document Processing: from Layout Analysis to Metadata Extraction -- Classification and Learning Methods for Character Recognition: Advances and Remaining Problems -- Combining Classifiers with Informational Confidence -- Self-Organizing Maps for Clustering in Document Image Analysis -- Adaptive and Interactive Approaches to Document Analysis -- Cursive Character Segmentation Using Neural Network Techniques -- Multiple Hypotheses Document Analysis -- Learning Matching Score Dependencies for Classifier Combination -- Perturbation Models for Generating Synthetic Training Data in Handwriting Recognition -- Review of Classifier Combination Methods -- Machine Learning for Signature Verification -- Off-line Writer Identification and Verification Using Gaussian Mixture Models.
In: Springer eBooksSummary: The objective of Document Analysis and Recognition (DAR) is to recognize the text and graphicalcomponents of a document and to extract information. With ?rst papers dating back to the 1960’s, DAR is a mature but still gr- ing research?eld with consolidated and known techniques. Optical Character Recognition (OCR) engines are some of the most widely recognized pr- ucts of the research in this ?eld, while broader DAR techniques are nowadays studied and applied to other industrial and o?ce automation systems. In the machine learning community, one of the most widely known - search problems addressed in DAR is recognition of unconstrained handwr- ten characters which has been frequently used in the past as a benchmark for evaluating machine learning algorithms, especially supervised classi?ers. However, developing a DAR system is a complex engineering task that involves the integration of multiple techniques into an organic framework. A reader may feel that the use of machine learning algorithms is not approp- ate for other DAR tasks than character recognition. On the contrary, such algorithms have been massively used for nearly all the tasks in DAR. With large emphasis being devoted to character recognition and word recognition, other tasks such as pre-processing, layout analysis, character segmentation, and signature veri?cation have also bene?ted much from machine learning algorithms.
Tags from this library: No tags from this library for this title. Log in to add tags.
Star ratings
    Average rating: 0.0 (0 votes)
Holdings
Item type Current library Call number Status Date due Barcode
E-Book E-Book Central Library Available E-44756

to Document Analysis and Recognition -- Structure Extraction in Printed Documents Using Neural Approaches -- Machine Learning for Reading Order Detection in Document Image Understanding -- Decision-Based Specification and Comparison of Table Recognition Algorithms -- Machine Learning for Digital Document Processing: from Layout Analysis to Metadata Extraction -- Classification and Learning Methods for Character Recognition: Advances and Remaining Problems -- Combining Classifiers with Informational Confidence -- Self-Organizing Maps for Clustering in Document Image Analysis -- Adaptive and Interactive Approaches to Document Analysis -- Cursive Character Segmentation Using Neural Network Techniques -- Multiple Hypotheses Document Analysis -- Learning Matching Score Dependencies for Classifier Combination -- Perturbation Models for Generating Synthetic Training Data in Handwriting Recognition -- Review of Classifier Combination Methods -- Machine Learning for Signature Verification -- Off-line Writer Identification and Verification Using Gaussian Mixture Models.

The objective of Document Analysis and Recognition (DAR) is to recognize the text and graphicalcomponents of a document and to extract information. With ?rst papers dating back to the 1960’s, DAR is a mature but still gr- ing research?eld with consolidated and known techniques. Optical Character Recognition (OCR) engines are some of the most widely recognized pr- ucts of the research in this ?eld, while broader DAR techniques are nowadays studied and applied to other industrial and o?ce automation systems. In the machine learning community, one of the most widely known - search problems addressed in DAR is recognition of unconstrained handwr- ten characters which has been frequently used in the past as a benchmark for evaluating machine learning algorithms, especially supervised classi?ers. However, developing a DAR system is a complex engineering task that involves the integration of multiple techniques into an organic framework. A reader may feel that the use of machine learning algorithms is not approp- ate for other DAR tasks than character recognition. On the contrary, such algorithms have been massively used for nearly all the tasks in DAR. With large emphasis being devoted to character recognition and word recognition, other tasks such as pre-processing, layout analysis, character segmentation, and signature veri?cation have also bene?ted much from machine learning algorithms.

There are no comments on this title.

to post a comment.

Maintained by VTU Library