F

firecrawl

by @firecrawlv
4.7(952)

Firecrawl技能提供命令行接口和代理能力,支持网页抓取、内容搜索等功能,帮助用户高效获取和处理网络数据,是后端数据处理的利器。

web-scrapingdata-extractionapi-developmentbackend-servicescli-toolsGitHub
安装方式
npx skills add firecrawl/cli --skill firecrawl
compare_arrows

Before / After 效果对比

1
使用前

手动编写爬虫程序复杂,数据抓取效率低下。难以快速获取网页内容,影响数据分析和信息检索。

使用后

智能辅助Firecrawl数据抓取,简化操作。显著提升数据获取效率,加速信息检索和内容分析。

SKILL.md

firecrawl

Firecrawl CLI

Web scraping, search, and browser automation CLI. Returns clean markdown optimized for LLM context windows.

Run firecrawl --help or firecrawl <command> --help for full option details.

Prerequisites

Must be installed and authenticated. Check with firecrawl --status.

  🔥 firecrawl cli v1.8.0

  ● Authenticated via FIRECRAWL_API_KEY
  Concurrency: 0/100 jobs (parallel scrape limit)
  Credits: 500,000 remaining

  • Concurrency: Max parallel jobs. Run parallel operations up to this limit.

  • Credits: Remaining API credits. Each scrape/crawl consumes credits.

If not ready, see rules/install.md. For output handling guidelines, see rules/security.md.

firecrawl search "query" --scrape --limit 3

Workflow

Follow this escalation pattern:

  • Search - No specific URL yet. Find pages, answer questions, discover sources.

  • Scrape - Have a URL. Extract its content directly.

  • Map + Scrape - Large site or need a specific subpage. Use map --search to find the right URL, then scrape it.

  • Crawl - Need bulk content from an entire site section (e.g., all /docs/).

  • Browser - Scrape failed because content is behind interaction (pagination, modals, form submissions, multi-step navigation).

Need Command When

Find pages on a topic search No specific URL yet

Get a page's content scrape Have a URL, page is static or JS-rendered

Find URLs within a site map Need to locate a specific subpage

Bulk extract a site section crawl Need many pages (e.g., all /docs/)

AI-powered data extraction agent Need structured data from complex sites

Interact with a page browser Content requires clicks, form fills, pagination, or login

Download a site to files download Save an entire site as local files

For detailed command reference, use the individual skill for each command (e.g., firecrawl-search, firecrawl-browser) or run firecrawl <command> --help.

Scrape vs browser:

  • Use scrape first. It handles static pages and JS-rendered SPAs.

  • Use browser when you need to interact with a page, such as clicking buttons, filling out forms, navigating through a complex site, infinite scroll, or when scrape fails to grab all the content you need.

  • Never use browser for web searches - use search instead.

Avoid redundant fetches:

  • search --scrape already fetches full page content. Don't re-scrape those URLs.

  • Check .firecrawl/ for existing data before fetching again.

Output & Organization

Unless the user specifies to return in context, write results to .firecrawl/ with -o. Add .firecrawl/ to .gitignore. Always quote URLs - shell interprets ? and & as special characters.

firecrawl search "react hooks" -o .firecrawl/search-react-hooks.json --json
firecrawl scrape "<url>" -o .firecrawl/page.md

Naming conventions:

.firecrawl/search-{query}.json
.firecrawl/search-{query}-scraped.json
.firecrawl/{site}-{path}.md

Never read entire output files at once. Use grep, head, or incremental reads:

wc -l .firecrawl/file.md && head -50 .firecrawl/file.md
grep -n "keyword" .firecrawl/file.md

Single format outputs raw content. Multiple formats (e.g., --format markdown,links) output JSON.

Working with Results

These patterns are useful when working with file-based output (-o flag) for complex tasks:

# Extract URLs from search
jq -r '.data.web[].url' .firecrawl/search.json

# Get titles and URLs
jq -r '.data.web[] | "\(.title): \(.url)"' .firecrawl/search.json

Parallelization

Run independent operations in parallel. Check firecrawl --status for concurrency limit:

firecrawl scrape "<url-1>" -o .firecrawl/1.md &
firecrawl scrape "<url-2>" -o .firecrawl/2.md &
firecrawl scrape "<url-3>" -o .firecrawl/3.md &
wait

For browser, launch separate sessions for independent tasks and operate them in parallel via --session <id>.

Credit Usage

firecrawl credit-usage
firecrawl credit-usage --json --pretty -o .firecrawl/credits.json

Weekly Installs14.5KRepositoryfirecrawl/cliGitHub Stars190First SeenJan 21, 2026Security AuditsGen Agent Trust HubPassSocketPassSnykWarnInstalled onopencode12.9Kcodex12.9Kgemini-cli12.8Kgithub-copilot12.3Kkimi-cli11.7Kamp11.7K

用户评价 (0)

发表评价

效果
易用性
文档
兼容性

暂无评价

统计数据

安装量58.9K
评分4.7 / 5.0
版本
更新日期2026年5月23日
对比案例1 组

用户评分

4.7(952)
5
36%
4
49%
3
14%
2
1%
1
0%

为此 Skill 评分

0.0

兼容平台

🔧Claude Code
🔧OpenClaw
🔧OpenCode
🔧Codex
🔧Gemini CLI
🔧GitHub Copilot
🔧Amp
🔧Kimi CLI

时间线

创建2026年3月17日
最后更新2026年5月23日