Open-source project called Ocropus is an AI research group. Among other things, they are looking to establish advanced character-recognition technologies
"OCRopus is a state-of-the-art document analysis and OCR system, featuring pluggable layout analysis, pluggable character recognition, statistical natural language modeling, and multi-lingual capabilities."