Application of statistical pattern recognition to document segmentation and labelling.

  • 0 Ratings
  • 0 Want to read
  • 0 Currently reading
  • 0 Have read
Application of statistical pattern recognitio ...
Kevin Laven
Not in Library

My Reading Lists:

Create a new list

Check-In

×Close
Add an optional check-in date. Check-in dates are used to track yearly reading goals.
Today

  • 0 Ratings
  • 0 Want to read
  • 0 Currently reading
  • 0 Have read

Buy this book

Last edited by WorkBot
January 24, 2010 | History

Application of statistical pattern recognition to document segmentation and labelling.

  • 0 Ratings
  • 0 Want to read
  • 0 Currently reading
  • 0 Have read

In the field of computer analysis of document images, the problems of physical and logical layout analysis have been approached through a variety of heuristic, rule-based, and grammar-based techniques. In this paper we investigate the effectiveness of statistical pattern recognition algorithms for solving these two problems. Using a new software environment for manual page image segmentation and labelling, a dataset containing 932 page images from academic journals has been created. Several physical layout analysis algorithms have been implemented, including a new algorithm based on a logistic regression classifier. Three statistical classifiers were applied to the logical layout analysis problem, with encouraging results. A new model for how ink is laid out on a page was used to develop a prototype combined segmentation and labeling system. Finally, several applications have been investigated, and rudimentary implementations demonstrated. Results indicate that statistical pattern recognition approaches to these problems will be very fruitful.

Publish Date
Language
English
Pages
145

Buy this book

Edition Availability
Cover of: Application of statistical pattern recognition to document segmentation and labelling.

Add another edition?

Book Details


Edition Notes

Source: Masters Abstracts International, Volume: 44-02, page: 0936.

Advisor: S. Roweis.

Thesis (M.Sc.)--University of Toronto, 2005.

Electronic version licensed for access by U. of T. users.

GERSTEIN MICROTEXT copy on microfiche (2 microfiches).

The Physical Object

Pagination
145 leaves.
Number of pages
145

ID Numbers

Open Library
OL19216571M
ISBN 10
0494071877

Community Reviews (0)

Feedback?
No community reviews have been submitted for this work.

Lists

This work does not appear on any lists.

History

Download catalog record: RDF / JSON
January 24, 2010 Edited by WorkBot add more information to works
December 11, 2009 Created by WorkBot add works page