on LIGHTON
LightOn Expands Paradigm Platform with Arabic OCR Capabilities
In a strategic move, LightOn enhances its Paradigm Platform by introducing real-time web access and expanding its LightOnOCR-2 capabilities to Arabic, aided by fine-tuning methods. This development uses synthetic data generation to address the gap in OCR tools for underrepresented languages. The initiative focuses on complex datasets comprising 12,000 synthetic pages, highlighting diverse scenarios and script challenges associated with Arabic's unique cursive and right-to-left characteristics.
This effort seeks to streamline the automation processes for organizations handling Arabic documents, such as archives or legal papers, particularly in the Middle East. LightOn's solution supports both public and private sectors by meeting extensive community needs. Open-sourced under the Apache 2.0 license, LightOnOCR-2 aims to empower users, being a key part of LightOn's self-service LightOn Console.
R. H.
Copyright © 2026 FinanzWire, all reproduction and representation rights reserved.
Disclaimer: although drawn from the best sources, the information and analyzes disseminated by FinanzWire are provided for informational purposes only and in no way constitute an incentive to take a position on the financial markets.
Click here to consult the press release on which this article is based
See all LIGHTON news