Handles PDF operations in code: reading, writing, merging, splitting, rotating, watermarking, encrypting, extracting text and tables, OCR on scanned pages, and creating new PDFs from scratch.
Setup & Installation
What This Skill Does
Handles PDF operations in code: reading, writing, merging, splitting, rotating, watermarking, encrypting, extracting text and tables, OCR on scanned pages, and creating new PDFs from scratch. Covers both Python libraries (pypdf, pdfplumber, reportlab) and CLI tools (qpdf, pdftk, pdftotext).
Instead of piecing together which Python library handles which task, this skill maps every common PDF operation to the right tool with working code snippets ready to adapt.
When to use it
- Extracting tables from a multi-page PDF report into a spreadsheet
- Merging dozens of invoice PDFs into a single file for accounting
- Running OCR on scanned contracts to make them text-searchable
- Splitting a large PDF manual into per-chapter files
- Adding a watermark to all pages before sending a document externally
Similar Skills
pptx-generator
Generates, edits, and reads PowerPoint files using PptxGenJS and markitdown.
wp-project-triage
Inspects a WordPress repository and outputs a structured JSON report describing the project kind, tooling, tests, and version hints.
internal-comms
Writes internal communications in company-specific formats.
pptx
Handles everything involving .
