Chandra OCR Hits 85.9% SOTA: Why Layout-Aware OCR Is the Missing Piece in Document AI

Model	Overall Score
Chandra 2	85.9%
dots.ocr	83.9%
Chandra 1	83.1%
olmOCR 2	78.5%
DeepSeek OCR	75.4%

Model

Overall Score

Chandra 2

85.9%

dots.ocr

83.9%

Chandra 1

83.1%

olmOCR 2

78.5%

DeepSeek OCR

75.4%

Feature	Chandra OCR	Tesseract	AWS Textract	Azure Document AI
Layout awareness	Yes (native)	No	Yes	Yes
Table extraction	Yes (high accuracy)	No	Yes	Yes
Handwriting	Yes	Limited	Yes	Yes
Open source	Yes (Apache 2.0)	Yes	No	No
Cost	Free (self-hosted)	Free	$1.50/1K pages	$1/1K pages

Feature

Chandra OCR

Tesseract

AWS Textract

Azure Document AI

Layout awareness

Yes (native)

Yes

Table extraction

Yes (high accuracy)

Yes

Handwriting

Yes

Limited

Yes

Open source

Yes (Apache 2.0)

Yes

Cost

Free (self-hosted)

Free

$1.50/1K pages

$1/1K pages

from chandra_ocr import ChandraOCR ocr = ChandraOCR(model="unstructuredai/chandra-ocr") result = ocr.process("invoice.pdf") for page in result.pages: for table in page.tables: print(table.to_dataframe()) for paragraph in page.text_blocks: print(paragraph.text)

Chandra OCR: Frequently Asked Questions

What is Chandra OCR?+

Chandra OCR is an open-source OCR model developed by Datalab that uses full-page decoding with layout awareness to extract text, tables, equations, forms, and handwriting from complex documents. The latest version, Chandra 2, achieves 85.9% state-of-the-art accuracy on the olmOCR benchmark.

How does Chandra differ from traditional OCR like Tesseract?+

Traditional OCR segments pages into blocks and processes each independently, losing structural context. Chandra processes entire pages at once using a vision-language model, preserving table structure, column layout, equation formatting, and the relationship between document elements.

What output formats does Chandra support?+

Chandra outputs structured Markdown, HTML, or JSON with full layout metadata including bounding boxes for every element. Tables preserve colspan and rowspan, equations are output as LaTeX, and form fields retain their states.

What hardware do I need to run Chandra?+

Chandra 2 is a 4B parameter model. On an H100 GPU, it processes up to 4 pages per second. Quantized variants are available for lower-end hardware. Datalab also offers a hosted API for teams without GPU infrastructure.

How many languages does Chandra support?+

Chandra supports 90+ languages with 77.8% average accuracy across 43 tested multilingual benchmarks, significantly outperforming alternatives like Gemini 2.5 Flash (67.6%) on multilingual document processing.

Is Chandra free to use?+

Chandra is open-source with some restrictions. Startups under $2M revenue can use it free. Larger organizations can use the hosted API from Datalab for a fully-licensed option. The model weights are available on Hugging Face.

How does Chandra handle tables with merged cells?+

Chandra's full-page decoding approach processes the entire table structure at once, preserving colspan and rowspan attributes. It achieves 89.9% accuracy on table extraction in the olmOCR benchmark, the highest among tested models.

Can Chandra process handwritten text?+

Yes. Chandra can process handwritten text alongside printed text within the same page, using the full page context to interpret handwriting in relation to form fields, annotations, and surrounding printed content. This is particularly valuable for medical records and archived documents.

Chandra OCR Hits 85.9% SOTA: Why Layout-Aware OCR Is the Missing Piece in Document AI

Soizic

OCR Was Supposed to Be a Solved Problem

The Benchmark Numbers

Overall Accuracy

Category Breakdown

How Chandra Works: Full-Page Decoding

Traditional Pipeline Approach

Chandra's Approach

Specifications and Deployment

Model Size

Performance

Language Support

Deployment Options

Licensing

Why Layout-Aware OCR Matters for AI Pipelines

RAG on Complex Documents

Invoice and Form Processing

Legal Document Analysis

Medical Records and Handwriting

Multilingual Document Processing

The Competitive Landscape

General-Purpose LLMs

Specialized OCR Models

Legacy OCR

Datalab's Own Earlier Work

Practical Integration

What Chandra Means for the OCR Market

Chandra OCR: Frequently Asked Questions

Ready to get started?