#601

Globales Ranking · von 601 Skills

pdf-tools AI Agent Skill

Quellcode ansehen: oakoss/agent-skills

Safe

Installation

npx skills add oakoss/agent-skills --skill pdf-tools

44

Installationen

PDF Tools

Full-lifecycle PDF engineering covering extraction, generation, modification, form filling, and security. Prioritizes JavaScript-first solutions (pdf-lib, unpdf, Puppeteer) with Python/CLI utilities for advanced scenarios.

When to use: Extracting structured data from PDFs, generating pixel-perfect PDFs from HTML/React, modifying existing PDFs, filling forms (fillable or non-fillable), or securing documents with encryption.

When NOT to use: Simple text file processing, image-only manipulation without PDF context, or tasks better handled by a word processor.

Quick Reference

Task Tool Key Point
Generate PDF from HTML Puppeteer / Playwright page.pdf(); use networkidle0 (Puppeteer) or networkidle (Playwright)
Extract text (lightweight) unpdf Edge/serverless compatible
Extract tables (AI) Vision model + Zod schema Multi-column and merged cell support
Extract tables (non-AI) pdfplumber (Python) Precise cell boundary detection
Modify, merge, split pdf-lib (or @pdfme/pdf-lib) Byte-level PDF manipulation in JS
Fill fillable forms pdf-lib (or @pdfme/pdf-lib) Inspect AcroForm fields before writing
Fill non-fillable forms Python annotation scripts Visual analysis + bounding box annotations
Encrypt PDF qpdf AES-256: qpdf --encrypt user owner 256 --
Repair corrupted PDF qpdf qpdf input.pdf --replace-input
Fast text extraction (CLI) poppler-utils pdftotext -layout input.pdf -
Merge thousands of files pypdf (Python) Lighter than headless browser
Batch queue processing BullMQ + unpdf Redis-backed with retry, concurrency, progress tracking
PDF/A archival compliance ghostscript + verapdf gs -dPDFA=2 for conversion; verapdf for validation
Tagged PDF (accessibility) Puppeteer tagged: true maps HTML semantics to PDF structure tags
Digital signatures @signpdf/* PKCS#7 signing with P12 certificates
PDF comparison unpdf + diff / pixelmatch Text diff or pixel-level visual diff between versions
Secure redaction pymupdf (fitz) apply_redactions() removes content bytes, not just visual overlay

Common Mistakes

Mistake Correct Pattern
Using canvas drawing commands for PDF generation Use Puppeteer/Playwright with HTML/CSS templates
Running Puppeteer in edge/serverless environments Use unpdf for edge; Puppeteer requires full Node.js
Extracting complex layouts with basic text parsers Use AI-assisted OCR or pdfplumber for multi-column text
Storing unencrypted PDFs with PII in public storage Apply AES-256 encryption via qpdf before storage
Relying on window.print() for server-side generation Use headless browser APIs (page.pdf()) for deterministic output
Using pypdf for complex layout extraction Use pdfplumber or AI OCR for multi-column or overlapping text
Skipping font embedding in containerized environments Embed Google Fonts or WOFF2 files with Puppeteer
Writing to flattened PDF form fields Inspect AcroForm fields with pdf-lib before writing
Using unmaintained pdf-lib for encrypted PDFs Use @cantoo/pdf-lib fork which adds encrypted PDF support

Delegation

  • Inspect PDF structure and diagnose extraction issues: Use Explore agent to examine AcroForm fields, encoding, and metadata
  • Build end-to-end document processing pipelines: Use Task agent to implement extraction, transformation, and generation workflows
  • Design PDF architecture for a new system: Use Plan agent to select tools and plan extraction, generation, or modification strategies

References

Installationen

Installationen 44
Globales Ranking #601 von 601

Sicherheitsprüfung

ath Safe
socket Safe
Warnungen: 0 Bewertung: 90
snyk Low
EU EU-Hosted Inference API

Power your AI Agents with the best open-source models.

Drop-in OpenAI-compatible API. No data leaves Europe.

Explore Inference API

GLM

GLM 5

$1.00 / $3.20

per M tokens

Kimi

Kimi K2.5

$0.60 / $2.80

per M tokens

MiniMax

MiniMax M2.5

$0.30 / $1.20

per M tokens

Qwen

Qwen3.5 122B

$0.40 / $3.00

per M tokens

So verwenden Sie diesen Skill

1

Install pdf-tools by running npx skills add oakoss/agent-skills --skill pdf-tools in your project directory. Führen Sie den obigen Installationsbefehl in Ihrem Projektverzeichnis aus. Die Skill-Datei wird von GitHub heruntergeladen und in Ihrem Projekt platziert.

2

Keine Konfiguration erforderlich. Ihr KI-Agent (Claude Code, Cursor, Windsurf usw.) erkennt installierte Skills automatisch und nutzt sie als Kontext bei der Code-Generierung.

3

Der Skill verbessert das Verständnis Ihres Agenten für pdf-tools, und hilft ihm, etablierte Muster zu befolgen, häufige Fehler zu vermeiden und produktionsreifen Code zu erzeugen.

Was Sie erhalten

Skills sind Klartext-Anweisungsdateien — kein ausführbarer Code. Sie kodieren Expertenwissen über Frameworks, Sprachen oder Tools, das Ihr KI-Agent liest, um seine Ausgabe zu verbessern. Das bedeutet null Laufzeit-Overhead, keine Abhängigkeitskonflikte und volle Transparenz: Sie können jede Anweisung vor der Installation lesen und prüfen.

Kompatibilität

Dieser Skill funktioniert mit jedem KI-Coding-Agenten, der das skills.sh-Format unterstützt, einschließlich Claude Code (Anthropic), Cursor, Windsurf, Cline, Aider und anderen Tools, die projektbezogene Kontextdateien lesen. Skills sind auf Transportebene framework-agnostisch — der Inhalt bestimmt, für welche Sprache oder welches Framework er gilt.

Data sourced from the skills.sh registry and GitHub. Install counts and security audits are updated regularly.

EU Made in Europe

Chat with 100+ AI Models in one App.

Use Claude, ChatGPT, Gemini alongside with EU-Hosted Models like Deepseek, GLM-5, Kimi K2.5 and many more.

Kundensupport