// AI TOOLS · MISTRAL OCR 4

Mistral OCR 4

Document-intelligence OCR model that extracts structured, markdown-ready text from PDFs, slides, and images.

Last verified · 2026-06-23 · by Moe Ameen

What Mistral OCR 4 is

Mistral OCR 4 is an optical character recognition and document-understanding model from Mistral, the French AI lab. It was released on June 23, 2026 as `mistral-ocr-4-0`, and Mistral's `mistral-ocr-latest` alias now points to it. Where a basic OCR engine hands you a flat wall of text, OCR 4 reads a document and returns a structured, machine-readable version of it.

The structure is the point. OCR 4 returns bounding boxes that localize each element on the page, typed-block classification that labels titles, tables, equations, signatures and other elements, and confidence scores reported per page and per word. The extracted text is formatted as clean markdown, which makes it usable directly for semantic chunking, RAG pipelines, and agent workflows. Mistral also positions OCR 4 as an ingestion component of its open-source Search Toolkit, feeding citation-ready inputs into retrieval and search workflows.

It handles PDF, DOC, PPT, and OpenDocument formats along with images, and supports 170 languages across 10 language groups, with Mistral reporting gains on rare and low-resource languages where several competing systems degrade. On public benchmarks the company reports 85.20 on OlmOCRBench and 93.07 on OmniDocBench, and says independent annotators preferred OCR 4 over competing systems at an average 72% win rate across 600+ multilingual documents. Treat specific scores as a snapshot and verify against the official source.

One thing to be clear about: OCR 4 is an extraction model, not a content generator. It reads documents and gives you structured text. It does not write social posts, draft a blog, or make video or images. Turning what it extracts into published content is a separate job.

What you can make with it

  • Clean, markdown-structured text pulled from a scanned or native PDF
  • Tables extracted from reports, decks, and statements as structured blocks, not flattened lines
  • Typed blocks — titles, equations, signatures — labeled for downstream parsing
  • Searchable, citation-ready text from slide decks (PPT) and OpenDocument files
  • Multilingual document transcription across 170 languages, including low-resource scripts
  • Per-word confidence scores you can use to flag uncertain passages for human review

How Kompozy turns Mistral OCR 4 output into content

Mistral OCR 4 solves the front-door problem: it turns a locked PDF, a scanned report, a slide deck, or a photo of a whiteboard into clean, structured text you can actually work with. What it does not do is turn that text into anything publishable. That handoff is where Kompozy comes in. Take the markdown OCR 4 returns, drop it into Kompozy as a source, and the engine fans one document into a full content set — a carousel that walks through the report's key findings, a LinkedIn post and an X thread written in your own voice through your Persona Brief, a blog article, an email newsletter, and a short script for a persona or avatar video — then schedules and publishes the set across all nine connected platforms from one queue.

The leverage is in the asymmetry. A 40-page whitepaper or a chart-heavy deck is hours of reading nobody on social will do. OCR 4 gets the words out in seconds; Kompozy turns those words into the week's posts. Because OCR 4 preserves tables and document structure as markdown, the source you hand Kompozy is clean enough that the generated carousel and blog track the actual document instead of drifting around a messy copy-paste.

  1. Run your PDF, scanned document, or slide deck through Mistral OCR 4 to get clean markdown text.
  2. Paste that text into Kompozy as a source and choose your formats — carousel, blog, newsletter, text posts, persona video script.
  3. Let Kompozy generate each format in your voice via your Persona Brief, with brand-exact carousels rendered through HyperFrames.
  4. Review and approve the set in the pipeline.
  5. Schedule and publish across LinkedIn, X, Instagram, and the rest of the nine platforms from one queue.

Frequently asked questions

What is Mistral OCR 4?

Mistral OCR 4 is an OCR and document-understanding model from Mistral, released June 23, 2026 as mistral-ocr-4-0. It extracts structured, markdown-formatted text from documents and images, returning bounding boxes, typed-block classification (titles, tables, equations, signatures), and per-page and per-word confidence scores across 170 languages.

How much does Mistral OCR 4 cost?

Mistral lists the API at $4 per 1,000 pages standard and $2 per 1,000 pages via the Batch API (a 50% discount), plus its Document AI product at $5 per 1,000 pages. A self-hosted option is available to enterprise customers. Check the official pricing page for current rates.

What file types and languages does Mistral OCR 4 support?

It handles PDF, DOC, PPT, and OpenDocument formats along with images, and supports 170 languages across 10 language groups, with reported gains on rare and low-resource languages.

Can Mistral OCR 4 write social posts or make video?

No. OCR 4 extracts text from documents; it does not generate content or media. To turn the extracted text into posts, carousels, blogs, newsletters, or video, pass it to a content engine like Kompozy, which generates each format in your voice and publishes across platforms.

How is Mistral OCR 4 different from OCR 3?

OCR 4 is the successor to Mistral OCR 3. Mistral reports higher benchmark scores and stronger multilingual and low-resource handling, along with structured output (bounding boxes, typed blocks, confidence scores) and a single-container self-hosting option. Verify specifics against Mistral's announcement.

Related tools

  • ApertusA fully open, multilingual foundation model built in Switzerland for sovereign AI.

← All AI tools · Get started →