Layout-aware
OCR for documents
that resist parsing.

A single endpoint that returns Markdown, regions, reading order, per-region confidence, and source bounding boxes for PDFs, scans, screenshots, and the first frame of a WebM. Built for retrieval pipelines, invoice intake, and QA review where the layout actually matters.

inputs:pdf · png · jpg · webp · webm[0]

outputs:markdown · regions[] · bbox · conf

latency:p50 412ms · p95 980ms / single page

price:free · while in public beta

§ 00.1 · TRY IT

One endpoint. Three lines of code.

$ pip install hardparse · npm i hardparse copy ⏎

POST /v1/parse

anonymous · 30 req/min · 20 MB/file

01Layout first

Regions, categories, and reading order before any OCR runs.

02Source boxes

Every paragraph keeps its bbox — perfect for QA review and click-through citations.

03Self-host ready

Same binary in prod as in beta. On-prem image available on request.

§ 01 · DETECTOR · LIVE

Figure 1. A single page, parsed in real time.

hover any region to inspect category / order / confidence →
the side panel mirrors the response payload byte-for-byte.

idle · drop a file or wait for auto-demo q3-report.pdf 2.1 MB 1 page

ACME ANALYTICS

Quarterly Performance Report · Q3 2026

PREPARED FOR

Sandra Liu, VP Finance

Northwind Capital Partners

PREPARED BY

Marcus Chen, Senior Analyst

ACME Analytics Group, LLC

This quarter, portfolio companies in the consumer SaaS segment grew net revenue by 18.4% QoQ, outpacing the broader index by 6.2 points. Gross margin remained stable at 71.3%, while operating expenses compressed 240 bps on disciplined headcount management.

Top performing holdings — net revenue

Holding	Q2 ($M)	Q3 ($M)	Δ QoQ
Plover Ledger Inc.	12.40	15.71	+26.7%
Quill & Beam Software	8.92	10.84	+21.5%
Halftone Studio Cloud	6.15	7.02	+14.1%
Ridgeline Forecast Co.	4.81	5.30	+10.2%

Confidential — for internal distribution only · Page 3 of 14

▦

Document parsed

regions and markdown are in the side panel →

DROP TO PARSE · LIVE API

Drop a real document.

PDF · PNG · JPG · WebP · screenshots · WebM frames.
Drop a file to hit the live api.hardparse.com endpoint —
or click here to run the sample.

regions markdown json

# waiting for input

regions will appear here

markdown output 0/0 chars

# output will stream here…

Layout-aware OCR for documents that resist parsing.

Layout-aware
OCR for documents
that resist parsing.