F
firecrawl-crawl
by @firecrawlv
4.7(108)
批量提取整个网站或网站部分的页面内容,用于爬取站点或文档部分。
安装方式
npx skills add firecrawl/cli --skill firecrawl-crawlcompare_arrows
Before / After 效果对比
1 组使用前
从整个网站或特定部分批量提取内容时,需要手动遍历链接并处理抓取逻辑,过程复杂且容易遗漏。
使用后
使用Firecrawl Crawl批量提取整个网站或特定目录下的所有页面内容,自动处理深度限制和链接跟踪,高效构建数据集。
SKILL.md
firecrawl crawl
Bulk extract content from a website. Crawls pages following links up to a depth/limit.
When to use
- You need content from many pages on a site (e.g., all
/docs/) - You want to extract an entire site section
- Step 4 in the workflow escalation pattern: search → scrape → map → crawl → browser
Quick start
# Crawl a docs section
firecrawl crawl "<url>" --include-paths /docs --limit 50 --wait -o .firecrawl/crawl.json
# Full crawl with depth limit
firecrawl crawl "<url>" --max-depth 3 --wait --progress -o .firecrawl/crawl.json
# Check status of a running crawl
firecrawl crawl <job-id>
Options
| Option | Description |
|---|---|
--wait | Wait for crawl to complete before returning |
--progress | Show progress while waiting |
--limit <n> | Max pages to crawl |
--max-depth <n> | Max link depth to follow |
--include-paths <paths> | Only crawl URLs matching these paths |
--exclude-paths <paths> | Skip URLs matching these paths |
--delay <ms> | Delay between requests |
--max-concurrency <n> | Max parallel crawl workers |
--pretty | Pretty print JSON output |
-o, --output <path> | Output file path |
Tips
- Always use
--waitwhen you need the results immediately. Without it, crawl returns a job ID for async polling. - Use
--include-pathsto scope the crawl — don't crawl an entire site when you only need one section. - Crawl consumes credits per page. Check
firecrawl credit-usagebefore large crawls.
See also
- firecrawl-scrape — scrape individual pages
- firecrawl-map — discover URLs before deciding to crawl
- firecrawl-download — download site to local files (uses map + scrape)
用户评价 (0)
发表评价
效果
易用性
文档
兼容性
暂无评价
统计数据
安装量43.5K
评分4.7 / 5.0
版本
更新日期2026年5月23日
对比案例1 组
用户评分
4.7(108)
5
23%
4
52%
3
23%
2
2%
1
0%
为此 Skill 评分
0.0
兼容平台
🔧Claude Code
🔧OpenClaw
🔧OpenCode
🔧Codex
🔧Gemini CLI
🔧GitHub Copilot
🔧Amp
🔧Kimi CLI
时间线
创建2026年3月16日
最后更新2026年5月23日