Open source

Parse, render
& convert.

A zero-dependency PHP library to generate, parse, and convert documents. PDF, HTML, CSV, DOCX — one unified API.

PHP 8.2+ Zero deps Laravel ready

Core Features

One Library,
Every Format

Paperdoc handles the full document lifecycle — from ingestion to transformation — so you can focus on building your application.

Multi-Format Parsing

Parse PDF, DOCX, HTML, Markdown, CSV, XLS, and PPTX files into a unified document model.

Format Conversion

Convert between any supported format seamlessly. PDF to HTML, DOCX to Markdown, and more.

OCR Processing

Extract text from scanned documents and images using Tesseract with intelligent post-processing.

LLM Augmentation

Enrich documents with AI-powered summaries, translations, and structured data extraction.

Capabilities

Everything You Need to Process Documents

From raw file ingestion to AI-enriched output, Paperdoc covers the entire document pipeline for your PHP applications.

Ready to simplify
document handling?

Install via Composer and start parsing, converting, and generating documents in minutes.

Get started
01

PDF Generation

Create pixel-perfect PDFs from templates, HTML, or structured data.

02

DOCX Export

Generate Word documents with headers, tables, images, and custom styles.

03

HTML Rendering

Render documents to clean, semantic HTML for web display.

04

CSV & Excel

Parse and generate spreadsheet files with full cell-level control.

05

Markdown Support

Read and write Markdown with frontmatter and rich formatting.

06

OCR Pipeline

Tesseract-powered text extraction with n-gram correction and post-processing.

07

AI Agent

PaperdocAgent for intelligent document analysis, summaries, and Q&A.

08

Laravel Integration

ServiceProvider, Facade, and Artisan commands for seamless Laravel usage.

Ready to simplify document handling?

Install Paperdoc via Composer and start parsing, converting, and generating documents in minutes.

Get started

Why Paperdoc

Why choose Paperdoc
over alternatives?

Paperdoc is built for PHP developers who need a single, reliable solution for all document formats — no juggling multiple packages.

Feature Others Paperdoc
Unified API for all formats
PDF, DOCX, HTML, CSV, XLSX, PPTX
Built-in OCR with Tesseract
LLM-powered enrichment
Laravel integration (Facade, ServiceProvider)
Zero external PHP dependencies
Structured document model
Bi-directional conversion
Active maintenance & support

Need help choosing?

Book a quick call or send us a message — we respond within hours, not days.

Get started

How it works

From File to
Structured Data

Paperdoc follows a clear pipeline to process your documents.

Ingest

01

Load file

Pass any supported file format to the Paperdoc factory.

Detect

02

Auto-detect

Paperdoc identifies the format and selects the right parser.

Parse

03

Extract data

Content is parsed into a unified Document model.

Enrich

04

OCR & LLM

Optionally extract text from images or augment with AI.

Convert

05

Transform

Render the document to any target format.

Export

06

Output

Get your file, stream, or string — ready to use.

By the numbers

Built for Scale
and Reliability

Paperdoc powers document processing for teams of all sizes — from solo developers to enterprise platforms handling thousands of files daily.

7+

Supported formats

<1s

Avg parse time

100%

PHP native

0

External deps

Have questions? We're here to help.

Contact us

Testimonials

Teams Who Trust
Paperdoc

From SaaS platforms to internal tools, developers rely on Paperdoc to handle their document workflows without friction.

★★★★★
"We replaced three separate libraries with Paperdoc. PDF parsing, DOCX generation, OCR — everything through one clean API. Our document pipeline went from brittle to bulletproof."

Techfly

Lead Developer

FAQ

Common
Questions

Everything you need to know about integrating Paperdoc into your PHP projects.

Need something else?
Reach out

Contact us

Paperdoc requires PHP 8.2 or higher. It leverages modern PHP features like enums, fibers, and readonly properties for a clean, type-safe API.

No. Paperdoc is built with zero external PHP dependencies. It only requires standard PHP extensions (dom, mbstring, zip, zlib) that are commonly available.

Paperdoc integrates with Tesseract OCR for text extraction from scanned documents and images. It includes intelligent post-processing with n-gram correction to improve accuracy.

Absolutely. Paperdoc ships with a Laravel ServiceProvider, a Facade, and Artisan commands. Just install via Composer and start using Paperdoc::parse() or Paperdoc::render() immediately.

Paperdoc includes an LlmAugmenter and PaperdocAgent powered by neuron-ai. You can enrich documents with AI-generated summaries, translations, structured data extraction, and intelligent Q&A over your documents.

Parse, render &
convert — all in PHP.

Install via Composer and start processing documents in minutes.