Local Crawl4AI instance for full web page extraction with JavaScript rendering.
Proxy (port 11234) — Clean output, OpenWebUI-compatible
[{page_content, metadata}]Direct (port 11235) — Full output with all data
{results: [{markdown, html, links, media, ...}]}# Via script
node {baseDir}/scripts/crawl4ai.js "url"
node {baseDir}/scripts/crawl4ai.js "url" --json
Script options:
--json — Full JSON responseOutput: Clean markdown from the page.
Required environment variable:
CRAWL4AI_URL — Your Crawl4AI instance URL (e.g., http://localhost:11235)Optional:
CRAWL4AI_KEY — API key if your instance requires authenticationUses your local Crawl4AI instance REST API. Auth header only sent if CRAWL4AI_KEY is set.
共 1 个版本