agent data_extraction skill risk: low
Defuddle CLI Markdown Web Extraction
Instructs the model to use Defuddle CLI to extract clean markdown from web page URLs instead of WebFetch, including when to use it, installation steps, command examples, output for…
- External action: medium
SKILL 1 file
SKILL.md
--- name: defuddle description: "Extract clean markdown content from web pages using Defuddle CLI, removing clutter and navigation to save tokens. Use instead of WebFetch when the user provides a URL to read or analyze, for online documentation, articles, blog posts, or any standard web page." --- # Defuddle Use Defuddle CLI to extract clean readable content from web pages. Prefer over WebFetch for standard web pages — it removes navigation, ads, and clutter, reducing token usage. ## When to Use - Use when the user provides a normal webpage URL to read, summarize, or analyze. - Prefer it over noisy page-fetch approaches when token efficiency matters. - Use for docs, articles, blog posts, and similar public web content. If not installed: `npm install -g defuddle` ## Usage Always use `--md` for markdown output: ```bash defuddle parse <url> --md ``` Save to file: ```bash defuddle parse <url> --md -o content.md ``` Extract specific metadata: ```bash defuddle parse <url> -p title defuddle parse <url> -p description defuddle parse <url> -p domain ``` ## Output formats | Flag | Format | |------|--------| | `--md` | Markdown (default choice) | | `--json` | JSON with both HTML and markdown | | (none) | HTML | | `-p <name>` | Specific metadata property | ## Limitations - Use this skill only when the task clearly matches the scope described above. - Do not treat the output as a substitute for environment-specific validation, testing, or expert review. - Stop and ask for clarification if required inputs, permissions, safety boundaries, or success criteria are missing.
INPUTS
- <url> REQUIRED
URL of the web page to parse
REQUIRED CONTEXT
- webpage URL
ROLES & RULES
- Prefer over WebFetch for standard web pages
- Always use `--md` for markdown output
- Use this skill only when the task clearly matches the scope described above
- Do not treat the output as a substitute for environment-specific validation, testing, or expert review
- Stop and ask for clarification if required inputs, permissions, safety boundaries, or success criteria are missing
EXPECTED OUTPUT
- Format
- markdown
- Constraints
- always use --md for markdown output
- prefer over WebFetch for standard web pages
EXAMPLES
Includes multiple bash command examples for parsing URLs with different flags, saving output, and extracting metadata.
CAVEATS
- Dependencies
- Requires Defuddle CLI installation via npm if not present
- Missing context
- How to integrate or pass the extracted content onward
- Tool version or compatibility constraints
- Ambiguities
- The phrase 'any standard web page' is broad and undefined.
- Does not specify handling for command failures or non-zero exits.
QUALITY
- OVERALL
- 0.76
- CLARITY
- 0.82
- SPECIFICITY
- 0.78
- REUSABILITY
- 0.80
- COMPLETENESS
- 0.68
IMPROVEMENT SUGGESTIONS
- Add an explicit error-handling subsection after the Usage examples.
- Replace the vague 'any standard web page' with a short list of included/excluded page types.
USAGE
Copy the prompt above and paste it into your AI of choice — Claude, ChatGPT, Gemini, or anywhere else you're working. Replace any placeholder sections with your own context, then ask for the output.
MORE FOR AGENT
- CELLxGENE Census Single-Cell Data Queryagentdata_extraction
- XLSX File Creation and Analysis Standardsagentdata_extraction
- Defuddle CLI Markdown Web Extractoragentdata_extraction
- FlowIO FCS File Handleragentdata_extraction
- Folder Secrets Extractor and Notes Organizeragentdata_extraction
- Vector Database Engineer Expertagentdata_extraction
- Vector Database Engineer Expertagentdata_extraction
- Firecrawl Deep Web Scraperagentdata_extraction
- ELV PDF Symbol Counter Skillagentdata_extraction
- Firecrawl Deep Web Scraperagentdata_extraction
- Comprehensive Codebase Bug Analysis and Fixeragentanalysis
- Xcode MCP Usage Guidelines for Agentsagenttool_use
- Xcode MCP Usage Guidelinesagenttool_use
- Rapid App MVP Prototyperagentcoding
- Local Documentation Online Sync Automatoragentoperations
- HashiCorp Packer Golden Image Expertagentoperations
- Xquik X/Twitter API Integration Skillagenttool_use
- MoltPass Client for AI Agent Identitiesagentsecurity
- AI-First Design Handoff Specs Generatoragentcoding
- Consciousness Council Multi-Perspective Deliberationagentplanning
- Creative Thinking Frameworks for CS Researchagentresearch
- Filesystem Agent Context Engineeringagenttool_use
- Academic Paper Figure Generatoragentresearch
- Multi-Agent Architecture Patterns Guideagentplanning
- Existing Web Design Premium Upgraderagentcreative