Home/Data & AI/firecrawl-parse
F

firecrawl-parse

by @firecrawlv
4.5(120)

"firecrawl parse" converts local documents (PDF, DOCX, HTML) into clean Markdown format. It supports AI-powered summarization and Q&A, helping users quickly extract key information or create content from documents. Ideal for automating document processing and data analysis workflows.

document-parsingmarkdown-conversionai-summarydata-extractioncli-toolGitHub
Installation
git clone https://github.com/firecrawl/cli.git
compare_arrows

Before / After Comparison

1
Before

Manually extracting key information from numerous local documents (PDF, DOCX) or converting them into AI-ready text formats is time-consuming, error-prone, and often results in messy formatting, severely hindering data utilization efficiency.

After

Automatically convert various local documents to clean Markdown, leveraging AI for summaries and Q&A, significantly reducing document processing time and improving data quality and utilization efficiency, accelerating AI application development.

SKILL.md

firecrawl parse

Turn a local document into clean markdown on disk. Supports PDF, DOCX, DOC, ODT, RTF, XLSX, XLS, HTML/HTM/XHTML.

When to use

  • You have a file on disk (not a URL) and want its text as markdown
  • User drops a PDF/DOCX and asks what it says, or to summarize it
  • Use scrape instead when the source is a URL

Quick start

Always save to .firecrawl/ with -o — parsed docs can be hundreds of KB and blow up context if streamed to stdout. Add .firecrawl/ to .gitignore.

mkdir -p .firecrawl

# File → markdown
firecrawl parse ./paper.pdf -o .firecrawl/paper.md

# AI summary
firecrawl parse ./paper.pdf -S -o .firecrawl/paper-summary.md

# Ask a question about the doc
firecrawl parse ./paper.pdf -Q "What are the main conclusions?" \
  -o .firecrawl/paper-qa.md

Then head, grep, rg etc., or incrementally read the file - don't load the whole thing at once.

Options

OptionDescription
-S, --summaryAI-generated summary
-Q, --query <prompt>Ask a question about the parsed content
-o, --output <path>Output file path — always use this
-f, --format <fmt>markdown (default), html, summary
--timeout <ms>Timeout for the parse job
--timingShow request duration

Tips

  • Quote paths with spaces: firecrawl parse "./My Doc.pdf" -o .firecrawl/mydoc.md.
  • Max upload size: 50 MB per file.
  • Credits: ~1 per PDF page; HTML is 1 flat.
  • Check .firecrawl/ before re-parsing the same file.
  • To check your credit balance (recommended for batch processing and similar workflows), use the firecrawl credit-usage command.

See also

User Reviews (0)

Write a Review

Effect
Usability
Docs
Compatibility

No reviews yet

Statistics

Installs14.8K
Rating4.5 / 5.0
Version
Updated2026年5月23日
Comparisons1

User Rating

4.5(120)
5
37%
4
43%
3
13%
2
5%
1
2%

Rate this Skill

0.0

Compatible Platforms

🤖claude-code

Timeline

Created2026年5月8日
Last Updated2026年5月23日