An Integrated System for the Automatic Segmentation and Classification of Documents using Quad-Trees

L. Cinque, S. Levialdi, and A. Malizia (Italy)


Pattern Recognition, Image processing and segmentation, Document Analysis


The paper documents recognition is fundamental for office automation becoming every day a more powerful tool in those fields where information is still on paper. Document recognition follows from data acquisition, from both journals, and entire books in order to transform them in digital objects. We present a new system for Document recognition that follows the Open Source methodologies, XML description for documents segmentation and classification, which turns to be beneficial in terms of classification precision, and general-purpose availability.

