defuddle
Defuddle CLI extracts clean markdown content from web pages, removing clutter and navigation to save tokens. It's ideal for processing online documentation, articles, and blog posts, enhancing efficiency for AI analysis or reading.
git clone https://github.com/kepano/obsidian-skills.gitBefore / After Comparison
1 组When processing web content using traditional methods or direct scraping, a significant amount of irrelevant information (e.g., ads, navigation, footers) is included. This degrades the reading experience and consumes excessive AI tokens, increasing costs and processing time.
Defuddle CLI automatically cleans web pages, extracting only the core content and outputting it as Markdown. This dramatically improves reading efficiency and significantly reduces the number of tokens required for AI processing, cutting costs and accelerating analysis.
Defuddle
Use Defuddle CLI to extract clean readable content from web pages. Prefer over WebFetch for standard web pages — it removes navigation, ads, and clutter, reducing token usage.
If not installed: npm install -g defuddle
Usage
Always use --md for markdown output:
defuddle parse <url> --md
Save to file:
defuddle parse <url> --md -o content.md
Extract specific metadata:
defuddle parse <url> -p title
defuddle parse <url> -p description
defuddle parse <url> -p domain
Output formats
| Flag | Format |
|---|---|
--md | Markdown (default choice) |
--json | JSON with both HTML and markdown |
| (none) | HTML |
-p <name> | Specific metadata property |
User Reviews (0)
Write a Review
No reviews yet
Statistics
User Rating
Rate this Skill