agent data_extraction skill risk: low

Defuddle CLI Markdown Web Extraction

Instructs the model to use Defuddle CLI to extract clean markdown from web page URLs instead of WebFetch, including when to use it, installation steps, command examples, output for…

External action: medium

SKILL 1 file

SKILL.md

Download

---
name: defuddle
description: "Extract clean markdown content from web pages using Defuddle CLI, removing clutter and navigation to save tokens. Use instead of WebFetch when the user provides a URL to read or analyze, for online documentation, articles, blog posts, or any standard web page."
---
# Defuddle

Use Defuddle CLI to extract clean readable content from web pages. Prefer over WebFetch for standard web pages — it removes navigation, ads, and clutter, reducing token usage.

## When to Use
- Use when the user provides a normal webpage URL to read, summarize, or analyze.
- Prefer it over noisy page-fetch approaches when token efficiency matters.
- Use for docs, articles, blog posts, and similar public web content.

If not installed: `npm install -g defuddle`

## Usage

Always use `--md` for markdown output:

```bash
defuddle parse <url> --md
```

Save to file:

```bash
defuddle parse <url> --md -o content.md
```

Extract specific metadata:

```bash
defuddle parse <url> -p title
defuddle parse <url> -p description
defuddle parse <url> -p domain
```

## Output formats

| Flag | Format |
|------|--------|
| `--md` | Markdown (default choice) |
| `--json` | JSON with both HTML and markdown |
| (none) | HTML |
| `-p <name>` | Specific metadata property |

## Limitations
- Use this skill only when the task clearly matches the scope described above.
- Do not treat the output as a substitute for environment-specific validation, testing, or expert review.
- Stop and ask for clarification if required inputs, permissions, safety boundaries, or success criteria are missing.

INPUTS

<url> REQUIRED: URL of the web page to parse

REQUIRED CONTEXT

webpage URL

ROLES & RULES

Prefer over WebFetch for standard web pages
Always use `--md` for markdown output
Use this skill only when the task clearly matches the scope described above
Do not treat the output as a substitute for environment-specific validation, testing, or expert review
Stop and ask for clarification if required inputs, permissions, safety boundaries, or success criteria are missing

EXPECTED OUTPUT

Format

markdown

Constraints

always use --md for markdown output
prefer over WebFetch for standard web pages

EXAMPLES

Includes multiple bash command examples for parsing URLs with different flags, saving output, and extracting metadata.

CAVEATS

Dependencies

Requires Defuddle CLI installation via npm if not present

Missing context

How to integrate or pass the extracted content onward
Tool version or compatibility constraints

Ambiguities

The phrase 'any standard web page' is broad and undefined.
Does not specify handling for command failures or non-zero exits.

QUALITY

OVERALL: 0.76
CLARITY: 0.82
SPECIFICITY: 0.78
REUSABILITY: 0.80
COMPLETENESS: 0.68

IMPROVEMENT SUGGESTIONS

Add an explicit error-handling subsection after the Usage examples.
Replace the vague 'any standard web page' with a short list of included/excluded page types.

USAGE

Copy the prompt above and paste it into your AI of choice — Claude, ChatGPT, Gemini, or anywhere else you're working. Replace any placeholder sections with your own context, then ask for the output.