← 返回
数据分析 中文

llms.txt File Builder

Generate, validate, and optimize llms.txt files for AI crawler accessibility. Creates structured markdown files that help AI platforms (ChatGPT, Perplexity,...
生成、验证并优化 llms.txt 文件,以提升 AI 爬虫的可访问性。创建结构化的 Markdown 文件,帮助 AI 平台(如 ChatGPT、Perplexity 等)更好地索引和理解内容。
geoly-geo
数据分析 clawhub v1.0.1 1 版本 100000 Key: 无需
★ 0
Stars
📥 807
下载
💾 59
安装
1
版本
#ai-crawl#geo#latest#llms-txt

概述

llms.txt File Builder

> Methodology by GEOly AI (geoly.ai) — GEO infrastructure for the AI search era.

Generate well-structured llms.txt files to help AI platforms understand and cite your content.

Quick Start

Generate an llms.txt file for any website:

python scripts/generate_llms_txt.py <domain> [--output llms.txt]

Example:

python scripts/generate_llms_txt.py example.com --output llms.txt

What is llms.txt?

The llms.txt standard helps AI crawlers understand:

  • What your brand/company does
  • Which pages contain the most valuable information
  • How content is organized (products, docs, blog, etc.)
  • Where to find key facts and data

Full standard details: See references/standard.md

Standard Format

# [Brand Name]

> [One-sentence brand description]

[2-3 paragraph overview: what you do, who it's for, key differentiators]

## Key Pages

- [Page Title](URL): One-line description
- [Page Title](URL): One-line description

## Products / Services

- [Product Name](URL): What it does and who it's for

## Documentation

- [Doc Title](URL): What this doc explains

## Blog / Resources

- [Article Title](URL): Key insight or topic covered

## About

- [About Us](URL): Company background and mission
- [Contact](URL): How to reach the team

Generation Methods

Method 1: From Sitemap (Automated)

python scripts/generate_llms_txt.py example.com --from-sitemap

Automatically fetches sitemap.xml, analyzes each page, and generates descriptions.

Method 2: Interactive (Guided)

python scripts/generate_llms_txt.py example.com --interactive

Prompts you for brand info and key URLs, then drafts descriptions.

Method 3: From URL List

python scripts/generate_llms_txt.py example.com --urls urls.txt

Where urls.txt contains one URL per line.

Validation

Validate an existing llms.txt file:

python scripts/validate_llms_txt.py llms.txt

Checks for:

  • Proper markdown structure
  • Valid URLs
  • No duplicate entries
  • Optimal link count (15-40 pages)
  • Factual tone (not promotional)

Quality Criteria

AspectGoodBad
-------------------
Brand description"GEOly AI is a GEO monitoring platform tracking brand visibility across ChatGPT, Perplexity, Gemini.""We are the best AI SEO tool ever!"
Page descriptions"Explains how to set up MCP integration with Claude Desktop""Our awesome docs page"
Link count15–40 curated pages500+ URLs (sitemap dump)
ToneFactual, entity-focusedPromotional, keyword-stuffed
StructureClear sections by content typeFlat list or random order

Output Formats

  • Markdown (default): Ready-to-deploy llms.txt
  • JSON: Structured data for programmatic use
  • HTML: Styled preview for stakeholder review

Advanced Usage

Custom Sections

python scripts/generate_llms_txt.py example.com \
  --sections "Products,API Reference,Case Studies,Changelog"

Exclude Patterns

python scripts/generate_llms_txt.py example.com \
  --exclude "/admin/,/private/,/draft/"

Multi-language Support

python scripts/generate_llms_txt.py example.com \
  --language zh-CN \
  --output llms-zh.txt

Deployment

Once generated, place the file at:

https://[your-domain]/llms.txt

Ensure it:

  • Returns HTTP 200
  • Is accessible without authentication
  • Has Content-Type: text/plain or text/markdown

See Also

版本历史

共 1 个版本

  • v1.0.1 当前
    2026-03-29 23:28 安全 安全

安全检测

腾讯云安全 (Keen)

安全,无风险
查看报告

腾讯云安全 (Sanbu)

安全,无风险
查看报告

🔗 相关推荐

data-analysis

A股量化 AkShare

mbpz
A股量化数据分析工具,基于AkShare库获取A股行情、财务数据、板块信息等。用于回答关于A股股票查询、行情数据、财务分析、选股等问题。
★ 162 📥 59,673
data-analysis

Data Analysis

ivangdavila
{"answer":"数据分析与可视化。查询数据库、生成报告、自动化电子表格,将原始数据转化为清晰可行的见解。适用于:(1) 您……"}
★ 198 📥 64,857
content-creation

Schema Markup Generator

geoly-geo
为任何内容类型生成完整、经验证的Schema.org JSON-LD标记,提升AI引用率。支持为组织、FAQPage、文章等多种类型创建结构化数据。
★ 1 📥 801