← 返回
未分类 中文

Doc2Markdown

Lightweight document utility designed to convert files to Markdown (MD), built specifically for intelligent agents (e.g., OpenClaw, ClaudeCode) to read and p...
轻量级文档工具,专为将文件转换为 Markdown (MD) 设计,供智能代理(如 OpenClaw、ClaudeCode)读取和处理。
haoyt27
未分类 clawhub v1.0.10 2 版本 100000 Key: 无需
★ 6
Stars
📥 688
下载
💾 6
安装
2
版本
#latest

概述

doc2markdown

Document conversion assistant that automatically converts documents to Markdown (MD), saving output to the same directory as the source file. Designed to help intelligent agents read and process document content in various formats.

Quick Start

# Convert document (auto-polls for 60s, downloads if complete, returns doc ID if timeout)
node scripts/doc2markdown.js convert <file_path>     # Downloads MD package
node scripts/doc2markdown.js convert <file_path> --md  # Downloads single MD file

# Check status and download (for documents that exceeded timeout)
node scripts/doc2markdown.js check <doc_id> <original_file_path>     # Downloads MD package
node scripts/doc2markdown.js check <doc_id> <original_file_path> --md  # Downloads single MD file

Capabilities

  • Supported formats: docx, doc, pdf, ppt, pptx, xls, xlsx, jpg, jpeg, png, ceb, teb, caj, odt, ofd, cebx, odp, ott, wps, ods, et, dps, epub, chm, sdc, sdd, sdw, mobi, etc.
  • Preserves document structure, tables, and images
  • No API Key or account required, zero external dependencies
  • Downloaded ZIP files are extracted to {doc_id}_{filename}/ under the source file's parent directory; single MD files are saved directly there

When to Use

  • User requests to "read", "extract", "convert", or "view" a document
  • User provides a document path and asks about its content
  • User needs to summarize or analyze a document
  • User needs to convert document content to Markdown package

Download Modes

This tool supports two download modes:

  • --md mode: Downloads a single merged MD file to the source file's parent directory. Images are not included
  • MD package: Downloads and extracts a ZIP package to {doc_id}_{filename}/ in the source file's parent directory. Includes image files and tables, tables are rendered in HTML format

Choosing the Right Mode

User IntentExample PhrasesMode to Use
---------
Read / view / analyze a document"read this file", "what's in this doc", "summarize this PDF"--md (single MD file)
Explicitly convert to MD"convert to MD", "export as markdown", "转成MD"MD package (default, no --md); use --md only if user specifically asks for a single file

Workflow

convert — Convert Document

  1. Invoke file parsing service
  2. Auto-poll conversion status (up to 60 seconds)
  3. Completes within 60s → Auto-download to source file directory
  4. Exceeds 60s → Return doc ID for subsequent check query

check — Query and Download

  1. Provide the previously returned doc ID
  2. Download if complete, otherwise continue polling for 60 seconds
  3. Prompt to retry later if still not complete

Data & Privacy

  • convert uploads files to the docchain cloud service (lab.hjcloud.com) for parsing. Results are returned as a ZIP archive and extracted locally.
  • All transfers use HTTPS encryption.
  • Users should ensure that documents do not contain sensitive or confidential information unless they have verified the service's data handling practices.
  • Service endpoint: https://lab.hjcloud.com/llmdoc

Feedback & Support

For parsing errors, format issues, or other problems, please submit an issue on GitHub:

https://github.com/wct-lab/docchain-skills

版本历史

共 2 个版本

  • v1.0.10 当前
    2026-05-03 04:10 安全 安全
  • v1.0.9
    2026-03-31 04:14

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

security-compliance

Skill Vetter

spclaudehome
AI智能体技能安全预审工具。安装ClawdHub、GitHub等来源技能前,检查风险信号、权限范围及可疑模式。
★ 1,210 📥 266,122
ai-intelligence

self-improving agent

pskoett
捕获经验教训、错误和纠正,以实现持续改进。使用时机:(1)命令或操作意外失败;(2)用户纠正……
★ 4,055 📥 795,652
ai-intelligence

Self-Improving + Proactive Agent

ivangdavila
自我反思+自我批评+自我学习+自组织记忆。智能体评估自身工作、发现错误并持续改进。
★ 1,349 📥 317,674