A zero-dependency PHP library to generate, parse, and convert documents. PDF, HTML, CSV, DOCX — one unified API.
Install
composer require paperdoc-dev/paperdoc-lib
Parse & Render
DOCX
Parse & Render
HTML
Parse & Render
CSV
Parse & Render
XLSX
Parse & Render
PPTX
Parse Only
OCR
Tesseract
LLM
AI augmentation
Core Features
Paperdoc handles the full document lifecycle — from ingestion to transformation — so you can focus on building your application.
Parse PDF, DOCX, HTML, Markdown, CSV, XLS, and PPTX files into a unified document model.
Convert between any supported format seamlessly. PDF to HTML, DOCX to Markdown, and more.
Extract text from scanned documents and images using Tesseract with intelligent post-processing.
Enrich documents with AI-powered summaries, translations, and structured data extraction.
Capabilities
From raw file ingestion to AI-enriched output, Paperdoc covers the entire document pipeline for your PHP applications.
Install via Composer and start parsing, converting, and generating documents in minutes.
Get startedCreate pixel-perfect PDFs from templates, HTML, or structured data.
Generate Word documents with headers, tables, images, and custom styles.
Render documents to clean, semantic HTML for web display.
Parse and generate spreadsheet files with full cell-level control.
Read and write Markdown with frontmatter and rich formatting.
Tesseract-powered text extraction with n-gram correction and post-processing.
PaperdocAgent for intelligent document analysis, summaries, and Q&A.
ServiceProvider, Facade, and Artisan commands for seamless Laravel usage.
Install Paperdoc via Composer and start parsing, converting, and generating documents in minutes.
Why Paperdoc
Paperdoc is built for PHP developers who need a single, reliable solution for all document formats — no juggling multiple packages.
| Feature | Others | Paperdoc |
|---|---|---|
| Unified API for all formats | ✕ | ✓ |
| PDF, DOCX, HTML, CSV, XLSX, PPTX | ✕ | ✓ |
| Built-in OCR with Tesseract | ✕ | ✓ |
| LLM-powered enrichment | ✕ | ✓ |
| Laravel integration (Facade, ServiceProvider) | ✕ | ✓ |
| Zero external PHP dependencies | ✓ | ✓ |
| Structured document model | ✕ | ✓ |
| Bi-directional conversion | ✕ | ✓ |
| Active maintenance & support | ✕ | ✓ |
Book a quick call or send us a message — we respond within hours, not days.
How it works
Paperdoc follows a clear pipeline to process your documents.
Ingest
01
Pass any supported file format to the Paperdoc factory.
Detect
02
Paperdoc identifies the format and selects the right parser.
Parse
03
Content is parsed into a unified Document model.
Enrich
04
Optionally extract text from images or augment with AI.
Convert
05
Render the document to any target format.
Export
06
Get your file, stream, or string — ready to use.
By the numbers
Paperdoc powers document processing for teams of all sizes — from solo developers to enterprise platforms handling thousands of files daily.
7+
Supported formats
<1s
Avg parse time
100%
PHP native
0
External deps
Testimonials
From SaaS platforms to internal tools, developers rely on Paperdoc to handle their document workflows without friction.
"We replaced three separate libraries with Paperdoc. PDF parsing, DOCX generation, OCR — everything through one clean API. Our document pipeline went from brittle to bulletproof."
Techfly
Lead Developer
FAQ
Everything you need to know about integrating Paperdoc into your PHP projects.
Need something else?
Reach out
Install via Composer and start processing documents in minutes.
No results found for ""
Try a different search term
Start typing to search commands...
Ask AI anything
Press Enter to send your question
AI is thinking...