概述

Paper Deep Read

Three-layer academic paper analysis: Overview -> Method Detail -> Innovation.

All analysis is performed by the agent itself. The Python package handles PDF parsing and quality assessment only.

Pipeline

PDF -> parse_pdf(quality_check=True) -> Assess quality
  -> score >= 70: use extracted text
  -> score < 70 + multimodal: Read tool on PDF (VLM)
  -> score < 70 + text-only: OCR fallback
-> Layer 1 Overview -> Layer 2 Method Detail -> Layer 3 Innovation -> Markdown Report

Step 1: Parse PDF & Assess Quality

pip install pdfplumber pymupdf
python -m paper_deep_read <paper.pdf> -o parsed.json

The script always runs quality assessment (5 checks: garbled text, formula quality, text misalignment, empty sections, missing tables). Output includes quality.score (0-100).

Quality thresholds:

>= 70: Good. Use extracted content.
40-69: Degraded. Switch to fallback.
< 40: Critical. Must use fallback.

Fallback Chain

score < 70 AND model is multimodal -> Read PDF directly with Read tool (VLM understands formulas/tables natively)
score < 70 AND model is text-only  -> render pages + OCR (tencentcloud-ocr or similar)
both fail                         -> ask user

Do NOT ask user which fallback. Auto-detect model capability and proceed.

Step 2: Layer 1 - Overview

Extract in this exact order:

Background - Research context, motivation
Problem - General problem -> specific gap -> why it matters
Target Problem - Formal statement, input/output, constraints
Method - Name, category, core idea, architecture, key components
Experiments - Datasets, baselines, main results (with numbers)
Ablation - What each component contributes
Conclusion - Contributions, limitations, future work

Output format: Structured Markdown with tables for experiments/ablation.

Decision points:

Language ambiguity -> match paper's primary language, ask if truly uncertain
10+ formulas in paper -> offer overview-first, then user-selected deep-dive

Step 3: Layer 2 - Method Detail

For every mathematical formula:

Field	Content
-------	---------
Formula text	Exact as it appears
Purpose	What it computes
Symbol table	Each symbol: name, meaning, type, domain, shape
Intuition	Plain-language explanation
Connection	How it relates to other formulas
Complexity	O(...) if applicable

Also describe: overall architecture, data flow, training/inference pipeline, hyperparameters.

Formula template:

#### Formula [id]: [brief name]
**Formula:** [exact text]
**Purpose:** ...
**Symbols:**
| Symbol | Meaning | Type | Domain |
|--------|---------|------|--------|
| ... | ... | scalar/vector/matrix/function | R^d, ... |
**Intuition:** ...
**Connection:** links to formula [x]

Step 4: Layer 3 - Innovation & Optimization

Strengths (3-5) with evidence from paper
Weaknesses (3-5) with suggested fixes
Optimization Opportunities (3-5) - concrete, implementable
New Research Directions (3-5) - each with: title, motivation, connection, expected contribution, methodology sketch, target venues
Experiment Ideas - additional experiments to validate extensions

Ask user about their research direction to tailor suggestions.

Step 5: Output

Generate Markdown report (all 3 layers) -> save to workspace
Use deliver_attachments to deliver
Present Layer 1 inline, offer deep-dive on Layers 2/3

MCP Integration (Optional)

When available, use MCP tools for:

Knowledge base search (related papers)
Any tool that provides complementary analysis

Discover tools via ToolSearch with queries like ["knowledge", "search", "paper"].

Decision Points

#	Situation	Auto-behavior	Ask user only when
---	-----------	---------------	-------------------
1	Language ambiguity	Match paper language	Truly mixed CN/EN
2	10+ formulas	Overview first, offer drill-down	After overview
3	Section boundary unclear	Merge and analyze	—
7	Research domain unknown	Paper-specific suggestions	Always offer to customize
9	PDF quality < 70	Auto-fallback chain	Both fallbacks fail

Full decision definitions: paper_deep_read/schemas.py + paper_deep_read/prompts.py

版本历史

共 1 个版本

v1.0.0 Initial release 当前

2026-06-08 21:43 安全安全

安全检测

腾讯云安全 (Keen)

安全，无风险

查看报告

腾讯云安全 (Sanbu)