F

firecrawl-crawl

by @firecrawlv
4.7(108)

批量提取整个网站或网站部分的页面内容,用于爬取站点或文档部分。

web-crawlingdata-indexingfirecrawl-apisite-mappingcontent-discoveryGitHub
安装方式
npx skills add firecrawl/cli --skill firecrawl-crawl
compare_arrows

Before / After 效果对比

1
使用前

从整个网站或特定部分批量提取内容时,需要手动遍历链接并处理抓取逻辑,过程复杂且容易遗漏。

使用后

使用Firecrawl Crawl批量提取整个网站或特定目录下的所有页面内容,自动处理深度限制和链接跟踪,高效构建数据集。

SKILL.md

firecrawl crawl

Bulk extract content from a website. Crawls pages following links up to a depth/limit.

When to use

  • You need content from many pages on a site (e.g., all /docs/)
  • You want to extract an entire site section
  • Step 4 in the workflow escalation pattern: search → scrape → map → crawl → browser

Quick start

# Crawl a docs section
firecrawl crawl "<url>" --include-paths /docs --limit 50 --wait -o .firecrawl/crawl.json

# Full crawl with depth limit
firecrawl crawl "<url>" --max-depth 3 --wait --progress -o .firecrawl/crawl.json

# Check status of a running crawl
firecrawl crawl <job-id>

Options

OptionDescription
--waitWait for crawl to complete before returning
--progressShow progress while waiting
--limit <n>Max pages to crawl
--max-depth <n>Max link depth to follow
--include-paths <paths>Only crawl URLs matching these paths
--exclude-paths <paths>Skip URLs matching these paths
--delay <ms>Delay between requests
--max-concurrency <n>Max parallel crawl workers
--prettyPretty print JSON output
-o, --output <path>Output file path

Tips

  • Always use --wait when you need the results immediately. Without it, crawl returns a job ID for async polling.
  • Use --include-paths to scope the crawl — don't crawl an entire site when you only need one section.
  • Crawl consumes credits per page. Check firecrawl credit-usage before large crawls.

See also

用户评价 (0)

发表评价

效果
易用性
文档
兼容性

暂无评价

统计数据

安装量43.5K
评分4.7 / 5.0
版本
更新日期2026年5月23日
对比案例1 组

用户评分

4.7(108)
5
23%
4
52%
3
23%
2
2%
1
0%

为此 Skill 评分

0.0

兼容平台

🔧Claude Code
🔧OpenClaw
🔧OpenCode
🔧Codex
🔧Gemini CLI
🔧GitHub Copilot
🔧Amp
🔧Kimi CLI

时间线

创建2026年3月16日
最后更新2026年5月23日