← 返回
开发者工具

RSS & Atom Feed Generator

Generate valid RSS 2.0 or Atom 1.0 feeds from web pages that contain post lists but lack a native feed. Triggers on phrases like "generate a feed for", "crea...
为缺少原生订阅源的文章列表网页生成 RSS 2.0 或 Atom 1.0 订阅源,支持"生成订阅源"等触发短语。
kidehen
开发者工具 clawhub v1.0.0 1 版本 100000 Key: 无需
★ 0
Stars
📥 616
下载
💾 7
安装
1
版本
#latest

概述

RSS / Atom Feed Generator (v1.0.0)

Generates valid RSS 2.0 or Atom 1.0 XML feeds from any web page that

lists posts, articles, or news items but does not publish its own feed.


Defaults & Settings

ParameterDefault value
-----------------------------------------------------------
Preferred formatRSS 2.0 (Atom 1.0 on request)
Max items20 (most-recent first)
Date fallbackPage scrape date (ISO 8601, UTC)
DescriptionFirst 280 chars of post excerpt/lede
EncodingUTF-8
Output modesRaw XML · HTML page · Both
Interaction styleFriendly and professional

Trigger Phrases

#User says …Action
-------------------------------------------------------------------------------------------
T1"Generate an RSS/Atom feed for {url}"Fetch page → detect posts → emit XML
T2"This page has no feed: {url}"Same as T1
T3"Create a feed from {url}"Same as T1
T4"Convert {url} posts to RSS"Same as T1
T5"Make a self-hostable feed for {url}"Emit XML + HTML wrapper
T6"Update / refresh the feed for {url}"Re-scrape and re-emit

Order of Operations

  1. Page Fetch — Retrieve the target URL by invoking the WEB_FETCH function

(defined in https://linkeddata.uriburner.com/chat/functions/openapi.yaml,

endpoint /WEB_FETCH). Use whichever available protocol applies — REST, MCP,

OPAL, or curl. Required parameter: url. Optional: headers, max_redirects,

timeout_seconds. WEB_FETCH retrieves the page just like a web browser and

returns the full page content for subsequent processing. See

protocol-routing.md for exact invocation

patterns per protocol.

  1. Feed Discovery — Check tags. If a feed already

exists, report it and stop (offer to proxy/mirror instead).

  1. Post Extraction — Use the heuristics in references/extraction-rules.md

to identify post entries (title, URL, date, author, summary).

  1. Feed Assembly — Build XML using the templates in

references/feed-templates.md. Validate structure against the checklist

in references/validation-checklist.md.

  1. Output — Present the feed XML in a code block; optionally wrap in the

HTML discovery page template from references/html-wrapper-template.md.

  1. Download Link — Save the .xml file to /mnt/user-data/outputs/ and

present it with present_files.


Extraction Heuristics (summary — full rules in references/)

Structural signals to look for

  • Repeated
    ,
  • , or
    blocks with consistent class names

(e.g., post, entry, article, blog-post, card)

Date handling

SituationStrategy
-----------------------------------------------------------------------
Use ISO value directly
Human-readable date textParse with locale awareness; emit RFC 822
No date foundUse today's date (UTC) with a comment
Relative ("3 days ago")Calculate from scrape time

URL normalisation

All item values must be absolute. Resolve against the page's

if present, otherwise against the origin of the page URL.


Feed Format Specs

RSS 2.0 required elements

<rss version="2.0">
  <channel>
    <title>, <link>, <description>   ← required channel fields
    <item>
      <title>, <link>, <guid>        ← required per item
      <pubDate>                      ← RFC 822 (e.g. Mon, 01 Jan 2024 00:00:00 +0000)
      <description>                  ← plain text or CDATA-wrapped HTML

Atom 1.0 required elements

<feed xmlns="http://www.w3.org/2005/Atom">
  <title>, <id>, <updated>           ← required feed fields
  <entry>
    <title>, <id>, <updated>         ← required per entry
    <link href="…" rel="alternate"/> ← post URL
    <summary> or <content>           ← excerpt or full body

Full templates → references/feed-templates.md


Output Modes

ModeDescription
-----------------------------------------------------------------------------------
xmlRaw feed XML in a fenced code block
fileSave to /mnt/user-data/outputs/-feed.xml and present link
htmlHTML discovery page with + embedded XML
bothfile + html wrapper saved as -feed-page.html

Default: file (saves XML and presents download link).


Error Handling

ProblemResponse
-----------------------------------------------------------------------------------------------
Page returns non-200Report HTTP status; suggest checking URL or auth
No repeating post structure foundShow raw HTML skeleton; ask user to identify the pattern
Feed already existsReport the existing feed URL; offer to mirror/augment
Dates unparseableUse today's date; flag items with
Relative URLs unresolvableAsk user for the site's base URL

Commands

CommandDescription
--------------------------------------------------------------------------------
/helpUsage guidance for this skill
`/format [rssatom]`Override output format
/limit [n]Set maximum number of feed items
/fulltextAttempt to embed full post body (fetches each post URL)
/validateRun checklist from references/validation-checklist.md
/previewShow first 3 items as formatted Markdown before XML

Operational Rules

  1. Always check for an existing feed before generating a synthetic one.
  2. Never fabricate post content — only use text found on the page.
  3. Escape &, <, > in text nodes; use CDATA for HTML description bodies.
  4. All / Atom values must be the canonical post URL.
  5. Sort items newest-first by default (pubDate DESC).
  6. Do not request or store credentials; only scrape publicly accessible pages.
  7. Clearly label generated feeds as synthetic (add tag).
  8. Respect robots.txt — do not scrape pages that disallow crawlers.

版本历史

共 1 个版本

  • v1.0.0 当前
    2026-03-19 22:14 安全 安全

🔗 相关推荐

ai-intelligence

Virtuoso Product Support

kidehen
为OpenLink Virtuoso服务器提供技术支持与数据库管理,包括RDF视图生成、SPARQL查询及全面数据库操作。
★ 0 📥 1,100
developer-tools

Gog

steipete
Google Workspace 命令行工具,支持 Gmail、日历、云端硬盘、通讯录、表格和文档。
★ 921 📥 185,755
developer-tools

CodeConductor.ai

larsonreever
AI驱动平台,提供快速全栈开发、智能体、工作流自动化及低代码AI集成的可扩展产品创建。
★ 66 📥 179,936