OCR
Aug 2, 2021
This NEH-funded project focuses on the development of modern Optical Character Recognition (OCR) and post-correction tools tailored for Indigenous Latin American Languages.
Antonios Anastasopoulos
Assistant Professor
I work on multilingual models, machine translation, speech recognition, and NLP for under-served languages.
Related
- Lexically-Aware Semi-Supervised Learning for OCR Post-Correction
- OCR Post-Correction for Endangered Language Texts
- Global Voices, Local Biases: Socio-Cultural Prejudices across Languages
- GlobalBench: A Benchmark for Global Progress in Natural Language Processing
- LIMIT: Language Identification, Misidentification, and Translation using Hierarchical Models in 350+ Languages