首页/AI 智能体外部交互/firecrawl-build-scrape
F

firecrawl-build-scrape

by @firecrawlv
4.6(11)

从已知URL提取完整网页内容,支持检索增强、摘要生成和内容监控

web-scrapingdata-extractioncontent-creationapi-integrationautomationGitHub
安装方式
npx skills add firecrawl/skills --skill firecrawl-build-scrape
compare_arrows

Before / After 效果对比

1
使用前

手动配置爬虫或使用浏览器开发者工具复制HTML,再解析提取文本内容,一个页面需要10-15分钟

使用后

输入URL自动提取结构化内容,处理动态渲染和反爬机制,30秒获取干净文本和元数据

SKILL.md

firecrawl-build-scrape

Firecrawl Build Scrape

Use this when the application already has the URL and needs content from one page.

Use This When

  • the feature starts from a known URL

  • you need page content for retrieval, summarization, enrichment, or monitoring

  • you want the default extraction primitive before considering /interact

Default Recommendations

  • Return markdown unless the feature truly needs another format.

  • Use onlyMainContent for article-like pages where nav and chrome add noise.

  • Add waits or other rendering options only when the page needs them.

Common Product Patterns

  • knowledge ingestion from known URLs

  • enrichment from a company, product, or docs page

  • pricing, changelog, and documentation extraction

  • page-level quality checks or monitoring

Escalation Rules

Implementation Notes

  • Keep the integration narrow: one feature, one URL, one extraction contract.

  • Treat /scrape as the default primitive for downstream LLM or indexing pipelines.

  • Request richer formats only when the consumer needs them, such as links, screenshots, or branding data.

Docs (Source of Truth)

Read the source-of-truth page for your project language before writing integration code:

See Also

Weekly Installs1.6KRepositoryfirecrawl/skillsGitHub Stars2First Seen5 days agoSecurity AuditsGen Agent Trust HubPassSocketPassSnykWarnInstalled onopencode1.6Kcodex1.6Kantigravity1.6Kclaude-code1.6Kgithub-copilot1.6Kamp1.6K

用户评价 (0)

发表评价

效果
易用性
文档
兼容性

暂无评价

统计数据

安装量26.9K
评分4.6 / 5.0
版本
更新日期2026年5月23日
对比案例1 组

用户评分

4.6(11)
5
55%
4
36%
3
9%
2
0%
1
0%

为此 Skill 评分

0.0

兼容平台

🔧Claude Code

时间线

创建2026年4月14日
最后更新2026年5月23日