» tagged pages
» logout
User:Matthias
Return to User:Matthias

User:Matthias Tagging Details

Matthias’s tags (120 of 305)

Pages Tagged 'OCR' (2)

  1. OCRopus (google open-source document-analysis character-recognition ocr ubuntu) Mar 15 07
  2. OCRopus is a state-of-the-art document analysis and OCR system, featuring pluggable layout analysis, pluggable character recognition, statistical natural language modeling, and multi-lingual capabilities. The OCRopus engine is based on two research projects: a high-performance handwriting recognizer developed in the mid-90’s and deployed by the US Census bureau, and novel high-performance layout analysis methods. OCRopus is development is sponsored by Google.
  3. tesseract-ocr (open-source ocr google tesseract scanning programming character-recognition) Mar 15 07
  4. The Tesseract OCR engine was one of the top 3 engines in the 1995 UNLV Accuracy test. Since then it has had little work done on it, but it is probably one of the most accurate open source OCR engines available. The source code will read a binary, grey or color image and output text. A tiff reader is built in that will read uncompressed TIFF images, or libtiff can be added to read compressed images.