← 返回
开发者工具 Key 中文

XCrawl Map

Use this skill for XCrawl map tasks, including site URL discovery, regex filtering, scope estimation, and crawl planning before full-site crawling.
用于XCrawl地图任务,包括全站爬取前的站点URL发现、正则过滤、范围估算和爬取规划。
wykings
开发者工具 clawhub v1.0.2 1 版本 99788.7 Key: 需要
★ 0
Stars
📥 1,417
下载
💾 43
安装
1
版本
#latest

概述

XCrawl Map

Overview

This skill uses XCrawl Map API to discover URLs for a site.

Default behavior is raw passthrough: return upstream API response bodies as-is.

Required Local Config

Before using this skill, the user must create a local config file and write XCRAWL_API_KEY into it.

Path: ~/.xcrawl/config.json

{
  "XCRAWL_API_KEY": "<your_api_key>"
}

Read API key from local config file only. Do not require global environment variables.

Credits and Account Setup

Using XCrawl APIs consumes credits.

If the user does not have an account or available credits, guide them to register at https://dash.xcrawl.com/.

After registration, they can activate the free 1000 credits plan before running requests.

Tool Permission Policy

Request runtime permissions for curl and node only.

Do not request Python, shell helper scripts, or other runtime permissions.

API Surface

  • Start map task: POST /v1/map
  • Base URL: https://run.xcrawl.com
  • Required header: Authorization: Bearer

Usage Examples

cURL

API_KEY="$(node -e "const fs=require('fs');const p=process.env.HOME+'/.xcrawl/config.json';const k=JSON.parse(fs.readFileSync(p,'utf8')).XCRAWL_API_KEY||'';process.stdout.write(k)")"

curl -sS -X POST "https://run.xcrawl.com/v1/map" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer ${API_KEY}" \
  -d '{"url":"https://example.com","filter":"/docs/.*","limit":2000,"include_subdomains":true,"ignore_query_parameters":false}'

Node

node -e '
const fs=require("fs");
const apiKey=JSON.parse(fs.readFileSync(process.env.HOME+"/.xcrawl/config.json","utf8")).XCRAWL_API_KEY;
const body={url:"https://example.com",filter:"/docs/.*",limit:3000,include_subdomains:true,ignore_query_parameters:false};
fetch("https://run.xcrawl.com/v1/map",{
  method:"POST",
  headers:{"Content-Type":"application/json",Authorization:`Bearer ${apiKey}`},
  body:JSON.stringify(body)
}).then(async r=>{console.log(await r.text());});
'

Request Parameters

Request endpoint and headers

  • Endpoint: POST https://run.xcrawl.com/v1/map
  • Headers:
  • Content-Type: application/json
  • Authorization: Bearer

Request body: top-level fields

FieldTypeRequiredDefaultDescription
---------:------
urlstringYes-Site entry URL
filterstringNo-Regex filter for URLs
limitintegerNo5000Max URLs (up to 100000)
include_subdomainsbooleanNotrueInclude subdomains
ignore_query_parametersbooleanNotrueIgnore URLs with query parameters

Response Parameters

FieldTypeDescription
---------
map_idstringTask ID
endpointstringAlways map
versionstringVersion
statusstringcompleted
urlstringEntry URL
dataobjectURL list data
started_atstringStart time (ISO 8601)
ended_atstringEnd time (ISO 8601)
total_credits_usedintegerTotal credits used

data fields:

  • links: URL list
  • total_links: URL count
  • credits_used: credits used
  • credits_detail: credit breakdown

Workflow

  1. Restate mapping objective.
    • Discovery only, selective crawl planning, or structure analysis.
  1. Build and execute POST /v1/map.
    • Keep filters explicit and reproducible.
  1. Return raw API response directly.
    • Do not synthesize URL-family summaries unless requested.

Output Contract

Return:

  • Endpoint used (POST /v1/map)
  • request_payload used for the request
  • Raw response body from map call
  • Error details when request fails

Do not generate summaries unless the user explicitly requests a summary.

Guardrails

  • Do not claim full site coverage if limit is reached.
  • Do not mix inferred URLs with returned URLs.
  • Do not hardcode provider-specific tool schemas in core logic.

版本历史

共 1 个版本

  • v1.0.2 当前
    2026-03-29 19:03 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

developer-tools

Github

steipete
使用 `gh` CLI 与 GitHub 交互,通过 `gh issue`、`gh pr`、`gh run` 和 `gh api` 管理议题、PR、CI 运行及高级查询。
★ 668 📥 323,973

XCrawl

wykings
此技能作为XCrawl的默认入口,用于直接发起请求,支持单URL抓取、格式选择、同步或异步执行及JSON处理。
★ 1 📥 2,539
developer-tools

CodeConductor.ai

larsonreever
AI驱动平台,提供快速全栈开发、智能体、工作流自动化及低代码AI集成的可扩展产品创建。
★ 66 📥 179,996