Mistral has announced the release of OCR 4, a document understanding model designed for enterprise and developer use. This new version brings expanded capabilities, including extraction of structured content with bounding boxes, typed block classification, and inline confidence scores for each region of a document. OCR 4 supports 170 languages across 10 language groups, outperforming previous iterations and other leading systems, particularly with rare and low-resource languages. It is engineered for both high-volume and interactive document workflows, with notable acceleration in processing speed and cost efficiency compared to prior versions and industry competitors.
We ran OCR 4 head-to-head against the field. Independent annotators blindly ranked 600+ real-world documents across 12+ languages, and preferred OCR 4 over every system tested, with win rates averaging 72%. pic.twitter.com/nGRXtVVQT7
— Mistral AI (@MistralAI) June 23, 2026
The model is available via API, Mistral Studio, Amazon SageMaker, Microsoft Foundry, and soon on Snowflake Parse Document. For organizations with strict data privacy or residency requirements, OCR 4 can be deployed as a single-container, self-hosted solution. Target customers include enterprises in legal, financial, healthcare, and technical domains that require reliable extraction from complex, multilingual document formats such as PDF, DOC, PPT, and OpenDocument.
Mistral’s approach with OCR 4 focuses on delivering precise, localized, and classified document data, enabling downstream use in RAG pipelines, compliance workflows, and enterprise search. Industry engineers have reported substantial reductions in cost and latency when switching to OCR 4, and early users are leveraging the model for structured field extraction, archive digitization, and technical document parsing.