The artificial intelligence company has developed an OCR tool that, according to published comparative tests, shows greater accuracy in processing documents with elements such as mathematical equations, tables, and multilingual content.
Mistral has announced the launch of Mistral OCR, an API designed for optical character recognition in digital documents. The tool is aimed at processing documents containing diverse elements such as text, images, tables, and mathematical equations.
According to data presented by the company, approximately 90% of organizational information is stored in document format. The new API processes both images and PDF files and extracts their content while maintaining the original structure, facilitating integration with retrieval-augmented generation (RAG) systems that work with multimodal documents.
The service has already been implemented as the default model for document understanding in Le Chat, Mistral's conversational platform. The company has published comparative test results where Mistral OCR achieves an overall performance of 94.89% compared to solutions such as Google Document AI (83.42%), Azure OCR (89.52%), and GPT-4o (89.77%).
Technical specifications indicate that the system can process up to 2,000 pages per minute on a single node. Another notable feature is the ability to use entire documents as instructions and generate outputs in structured formats such as JSON.
Developers mention various fields where this technology could be applied, such as digitizing scientific research, preserving historical documents, optimizing customer service, and converting technical and educational literature into formats that can be processed by artificial intelligence systems.
The API is available on Mistral's developer platform, called "la Plateforme." The company also offers free trials through Le Chat and contemplates on-premises installation options for organizations with special data privacy requirements.
Artificial intelligence assistant that combines conversational capabilities with specialized tools. Provides chat functions, code generation, data analysis and custom workflow creation. Designed for ...
Mistral AI develops portable language models with multilingual capabilities and high computational efficiency. The platform enables cloud or on-premises implementations, with customization options ...
17/02/2026
Meta and NVIDIA have announced a multi-year strategic partnership for the large-scale deployment of chips and networking in Meta's data centers, with ...
11/02/2026
Zoë Hitzig, who spent two years at OpenAI shaping AI models and safety policies, has resigned following the company's announcement to test ads on ...
05/02/2026
Kuaishou Technology has introduced Kling AI 3.0, which includes four new video and image generation models with significant improvements in visual ...
05/02/2026
OpenAI has introduced Frontier, a platform designed to enable businesses to build, deploy, and manage artificial intelligence agents that integrate ...