#601

Global Rank · of 601 Skills

pdf-tools AI Agent Skill

View Source: oakoss/agent-skills

Safe

Installation

npx skills add oakoss/agent-skills --skill pdf-tools

44

Installs

PDF Tools

Full-lifecycle PDF engineering covering extraction, generation, modification, form filling, and security. Prioritizes JavaScript-first solutions (pdf-lib, unpdf, Puppeteer) with Python/CLI utilities for advanced scenarios.

When to use: Extracting structured data from PDFs, generating pixel-perfect PDFs from HTML/React, modifying existing PDFs, filling forms (fillable or non-fillable), or securing documents with encryption.

When NOT to use: Simple text file processing, image-only manipulation without PDF context, or tasks better handled by a word processor.

Quick Reference

Task Tool Key Point
Generate PDF from HTML Puppeteer / Playwright page.pdf(); use networkidle0 (Puppeteer) or networkidle (Playwright)
Extract text (lightweight) unpdf Edge/serverless compatible
Extract tables (AI) Vision model + Zod schema Multi-column and merged cell support
Extract tables (non-AI) pdfplumber (Python) Precise cell boundary detection
Modify, merge, split pdf-lib (or @pdfme/pdf-lib) Byte-level PDF manipulation in JS
Fill fillable forms pdf-lib (or @pdfme/pdf-lib) Inspect AcroForm fields before writing
Fill non-fillable forms Python annotation scripts Visual analysis + bounding box annotations
Encrypt PDF qpdf AES-256: qpdf --encrypt user owner 256 --
Repair corrupted PDF qpdf qpdf input.pdf --replace-input
Fast text extraction (CLI) poppler-utils pdftotext -layout input.pdf -
Merge thousands of files pypdf (Python) Lighter than headless browser
Batch queue processing BullMQ + unpdf Redis-backed with retry, concurrency, progress tracking
PDF/A archival compliance ghostscript + verapdf gs -dPDFA=2 for conversion; verapdf for validation
Tagged PDF (accessibility) Puppeteer tagged: true maps HTML semantics to PDF structure tags
Digital signatures @signpdf/* PKCS#7 signing with P12 certificates
PDF comparison unpdf + diff / pixelmatch Text diff or pixel-level visual diff between versions
Secure redaction pymupdf (fitz) apply_redactions() removes content bytes, not just visual overlay

Common Mistakes

Mistake Correct Pattern
Using canvas drawing commands for PDF generation Use Puppeteer/Playwright with HTML/CSS templates
Running Puppeteer in edge/serverless environments Use unpdf for edge; Puppeteer requires full Node.js
Extracting complex layouts with basic text parsers Use AI-assisted OCR or pdfplumber for multi-column text
Storing unencrypted PDFs with PII in public storage Apply AES-256 encryption via qpdf before storage
Relying on window.print() for server-side generation Use headless browser APIs (page.pdf()) for deterministic output
Using pypdf for complex layout extraction Use pdfplumber or AI OCR for multi-column or overlapping text
Skipping font embedding in containerized environments Embed Google Fonts or WOFF2 files with Puppeteer
Writing to flattened PDF form fields Inspect AcroForm fields with pdf-lib before writing
Using unmaintained pdf-lib for encrypted PDFs Use @cantoo/pdf-lib fork which adds encrypted PDF support

Delegation

  • Inspect PDF structure and diagnose extraction issues: Use Explore agent to examine AcroForm fields, encoding, and metadata
  • Build end-to-end document processing pipelines: Use Task agent to implement extraction, transformation, and generation workflows
  • Design PDF architecture for a new system: Use Plan agent to select tools and plan extraction, generation, or modification strategies

References

Installs

Installs 44
Global Rank #601 of 601

Security Audit

ath Safe
socket Safe
Alerts: 0 Score: 90
snyk Low
EU EU-Hosted Inference API

Power your AI Agents with the best open-source models.

Drop-in OpenAI-compatible API. No data leaves Europe.

Explore Inference API

GLM

GLM 5

$1.00 / $3.20

per M tokens

Kimi

Kimi K2.5

$0.60 / $2.80

per M tokens

MiniMax

MiniMax M2.5

$0.30 / $1.20

per M tokens

Qwen

Qwen3.5 122B

$0.40 / $3.00

per M tokens

How to use this skill

1

Install pdf-tools by running npx skills add oakoss/agent-skills --skill pdf-tools in your project directory. Run the install command above in your project directory. The skill file will be downloaded from GitHub and placed in your project.

2

No configuration needed. Your AI agent (Claude Code, Cursor, Windsurf, etc.) automatically detects installed skills and uses them as context when generating code.

3

The skill enhances your agent's understanding of pdf-tools, helping it follow established patterns, avoid common mistakes, and produce production-ready output.

What you get

Skills are plain-text instruction files — not executable code. They encode expert knowledge about frameworks, languages, or tools that your AI agent reads to improve its output. This means zero runtime overhead, no dependency conflicts, and full transparency: you can read and review every instruction before installing.

Compatibility

This skill works with any AI coding agent that supports the skills.sh format, including Claude Code (Anthropic), Cursor, Windsurf, Cline, Aider, and other tools that read project-level context files. Skills are framework-agnostic at the transport level — the content inside determines which language or framework it applies to.

Data sourced from the skills.sh registry and GitHub. Install counts and security audits are updated regularly.

EU Made in Europe

Chat with 100+ AI Models in one App.

Use Claude, ChatGPT, Gemini alongside with EU-Hosted Models like Deepseek, GLM-5, Kimi K2.5 and many more.

Customer Support