← 返回
未分类

instant-data-qa

View, extract, edit, and manipulate PDF files. Supports text extraction, text editing (overlay and replacement), merging, splitting, rotating pages, and getting PDF metadata. Use when working with PDF documents for reading content, adding/editing text, reorganizing pages, combining files, or extracting information.
user_33aa5c8b
未分类 community v1.0.100000000000000000000000000000000000000000000000 2 版本 100000 Key: 无需
★ 0
Stars
📥 122
下载
💾 0
安装
2
版本
#latest

概述

PDF Tools

Tools for viewing, extracting, and editing PDF files using Python libraries (pdfplumber and PyPDF2).

Quick Start

All scripts require dependencies:

pip3 install pdfplumber PyPDF2

Core Operations

Extract Text

Extract text from PDF (all pages or specific pages):

scripts/extract_text.py document.pdf
scripts/extract_text.py document.pdf -p 1 3 5
scripts/extract_text.py document.pdf -o output.txt

Get PDF Info

View metadata and structure:

scripts/pdf_info.py document.pdf
scripts/pdf_info.py document.pdf -f json

Merge PDFs

Combine multiple PDFs into one:

scripts/merge_pdfs.py file1.pdf file2.pdf file3.pdf -o merged.pdf

Split PDF

Split into individual pages:

scripts/split_pdf.py document.pdf -o output_dir/

Split by page ranges:

scripts/split_pdf.py document.pdf -o output_dir/ -m ranges -r "1-3,5-7,10-12"

Rotate Pages

Rotate all pages or specific pages:

scripts/rotate_pdf.py document.pdf -o rotated.pdf -r 90
scripts/rotate_pdf.py document.pdf -o rotated.pdf -r 180 -p 1 3 5

Edit Text

Add text overlay on a page:

scripts/edit_text.py document.pdf -o edited.pdf --overlay "New Text" --page 1 --x 100 --y 700
scripts/edit_text.py document.pdf -o edited.pdf --overlay "Watermark" --page 1 --x 200 --y 400 --font-size 20

Replace text (limited, works best for simple cases):

scripts/edit_text.py document.pdf -o edited.pdf --replace "Old Text" "New Text"

Note: PDF text editing is complex due to the format. The overlay method is more reliable than replacement.

Workflow Patterns

Viewing PDF Content

  1. Get basic info: scripts/pdf_info.py file.pdf
  2. Extract text to preview: scripts/extract_text.py file.pdf -p 1
  3. Extract full text if needed: scripts/extract_text.py file.pdf -o content.txt

Reorganizing PDFs

  1. Split into pages: scripts/split_pdf.py input.pdf -o pages/
  2. Merge selected pages: scripts/merge_pdfs.py pages/page_1.pdf pages/page_3.pdf -o reordered.pdf

Extracting Sections

  1. Get page count: scripts/pdf_info.py document.pdf
  2. Split by ranges: scripts/split_pdf.py document.pdf -o sections/ -m ranges -r "1-5,10-15"

Advanced Usage

For detailed library documentation and advanced patterns, see references/libraries.md.

Notes

  • Page numbers are 1-indexed in all scripts (page 1 = first page)
  • Text extraction works best with text-based PDFs (not scanned images)
  • Rotation angles: 90, 180, 270, or -90 (counterclockwise)
  • All scripts validate file existence before processing

版本历史

共 2 个版本

  • v1.0.100000000000000000000000000000000000000000000000 Initial release 当前
    2026-04-21 19:54 安全 安全
  • v1.0.0 Initial release
    2026-04-13 19:50 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

search-baidu

user_33aa5c8b
通过腾讯 ADP 平台的多模态插件能力,根据文本或图文组合输入生成图片、视频和 3D 模型,统一以 submit + 轮询 query 完成任务并返回成品 URL。覆盖文生图、图生图、文生视频、图生视频、文生 3D、图生 3D 全场景,并按
★ 0 📥 100

search-baidu

user_33aa5c8b
Use this skill whenever the user wants to do anything with PDF files. This includes reading or extracting text/tables fr
★ 0 📥 145