Extract structured data from documents with high accuracy
| Use Case | Endpoint | Output |
|---|---|---|
| Get document text/markdown | POST /api/v1/parse |
Markdown + chunks |
| Extract form data (invoices, IDs, tax forms) | POST /api/v1/extract |
Structured JSON fields |
| Process large files or batches | POST /api/v1/extract/async |
Job ID (poll for results) |
Convert documents to clean Markdown with structure. Best for full text extraction and search indexing. Returns content in chunks with page coordinates.
Extract structured data from forms, invoices, IDs, and tax documents. Returns field-level data with confidence scores. Auto-detects document type.
Background processing for large files. Submit a job and poll for results or receive webhook notification on completion.
Get a signed URL for uploading files larger than 100MB. Returns document_id for processing.
Health check endpoint. Returns service status and component health.