LatestFebruary 24, 2026How to Convert PDFs to JSON with an APIA practical guide to converting PDF documents into structured JSON data using a REST API. Covers digital PDFs, scanned documents, and batch processing.SSmole TeamRead article
pdfFeb 23How to Extract Tables from PDFs into Structured DataExtract tables from PDF documents into structured JSON or CSV. Handle multi-column layouts, merged cells, and inconsistent formatting with schema-based extraction.
pythonFeb 22Extract Structured Data from Documents with PythonHow to extract structured JSON data from PDFs, scanned documents, and Word files using Python. Complete code examples with requests, error handling, and batch processing.
pdfFeb 21Convert PDF to CSV: Extract Tabular Data via APIConvert PDF documents to CSV files by extracting structured data via API. Turn invoices, reports, and tables into spreadsheet-ready formats with schema-based extraction.
ocrFeb 20How to Extract Data from Scanned DocumentsLearn how to extract structured data from scanned PDFs, photographed documents, and image-based files using OCR and schema-based extraction.
imagesFeb 19How to Extract Data from Images with an APIExtract structured data from photos, screenshots, and scanned images using OCR and schema-based extraction. Process receipts, business cards, forms, and documents captured on phones.
docxFeb 17Convert Word Documents (DOCX) to JSON via APIHow to extract structured JSON data from Word documents using a REST API. Convert DOCX files to structured data for contracts, reports, and forms.
javascriptFeb 16Extract Document Data with JavaScript and Node.jsHow to extract structured JSON from PDFs and documents using JavaScript and Node.js. Complete code examples with fetch, error handling, and batch processing.
invoicesFeb 14How to Automate Invoice Processing with an APIStep-by-step guide to automating invoice data extraction. Extract vendor details, line items, totals, and VAT from invoices into structured JSON using a REST API.
xlsxFeb 13Extract Data from Spreadsheets (XLSX) via APIExtract structured data from Excel spreadsheets and XLSX files using a REST API. Convert complex workbooks with multiple sheets, formulas, and formatting into clean JSON.
automationFeb 11How to Automate Data Entry from DocumentsEliminate manual data entry by automatically extracting structured data from documents. A practical guide for teams processing invoices, forms, receipts, and reports.
contractsFeb 10How to Extract Key Data from Contracts AutomaticallyA guide to extracting parties, dates, obligations, payment terms, and key clauses from contracts and legal agreements using schema-based document extraction.
batchFeb 8Batch Document Processing: Process Hundreds of Files via APIProcess large volumes of documents at scale using a REST API. Batch extract data from invoices, contracts, forms, and reports with parallel processing and error handling.
json-schemaFeb 7JSON Schema Guide for Document ExtractionEverything you need to know about designing JSON Schemas for document data extraction. Field naming, data types, nested objects, arrays, and real-world schema patterns.
receiptsFeb 6Receipt OCR API: Extract Data from Receipts AutomaticallyExtract store names, items, prices, totals, and payment methods from receipts using OCR and schema-based extraction. Works with photos, scans, and digital receipts.
automationFeb 1Automating Document Workflows with Smole APILearn how to build automated document processing pipelines that scale with your business.
json-schemaJan 25Building Effective JSON Schemas for Invoice ExtractionA practical guide to designing JSON schemas that maximize extraction accuracy for invoices and receipts.
aiJan 20OCR vs AI-Powered Extraction: What's the Difference?Understanding the key differences between traditional OCR and modern AI-powered document extraction.
tutorialJan 15Getting Started with Document ExtractionLearn how to extract structured data from documents using Smole's API in just a few minutes.