PDF OCR

PDF OCR Overview

Qoest PDF OCR endpoint extracts metadata and per-page text from public PDF URLs.

#Available Endpoint

  • POST /v1/pdf - Extract metadata and page text from a PDF URL (1 credit per page, minimum 1, maximum 30)

Credits are only deducted after a successful extraction.

#Supported Inputs

  • PDF URLs ending in .pdf

#Limits

  • PDFs are limited to a maximum of 30 pages
  • Requests for PDFs exceeding 30 pages return 400

#Authentication

Send your API token as a bearer token:

Authorization: Bearer YOUR_API_TOKEN

#Quick Start Example

curl -X POST /v1/pdf \
  -H "Authorization: Bearer your-token" \
  -H "Content-Type: application/json" \
  -d '{
    "url": "https://example.com/report.pdf"
  }'

#What's Included

  • PDF text extraction - Extract text page-by-page from PDF documents
  • Metadata - Get word count, page count, file size, and creation date
  • Built for production - Fast, reliable text extraction for real-world workloads