Getting Started
Get up and running with Smole in minutes.
Quick Start
Smole is a document extraction API that converts documents to Markdown and extracts structured data using AI. Get started in minutes with just a few API calls. See for details on how the pipeline works.
1. Get your API key
Sign up for an account and create an API key from your dashboard. Your API key starts with ak_. See for best practices on storing your keys.
2. Register a schema
Create a JSON schema that defines the structure of data you want to extract. See for a complete guide on schema structure, supported types, and best practices.
Save the returned id - you'll use it for extraction jobs.
Tip: Don't want to write JSON Schema by hand? Use /api/schemas/generate to have AI create one from simple field definitions. See for a full example.
3. Upload a document
Send a document file with your schema ID to start extraction:
4. Poll for results
The pipeline returns a job ID. Poll the status endpoint until the job completes:
When status is "completed", the result field contains your extracted JSON data.
Tutorials
Dive deeper with step-by-step guides:
PDF to JSON API Guide
Convert digital and scanned PDFs to structured data
Python Integration Guide
Complete Python code for extraction, batching, and CSV export
JSON Schema Design Guide
Best practices for schema design and field naming
Automation Workflows
Build pipelines for batch processing and system integration
