Skip to main content
NEW · APP STORE Now on iOS · macOS · iPad Android & Windows soon GET IT
Prompts SEO Sitemap Analyzer and Generator

agent marketing skill risk: low

SEO Sitemap Analyzer and Generator

Defines two modes for analyzing an existing sitemap against validation checks, quality signals, and common issues, or generating a new sitemap via business-type detection, template…

SKILL 2 files

SKILL.md
---
name: seo-sitemap
description: ")"
---
# Sitemap Analysis & Generation

## Mode 1: Analyze Existing Sitemap

### Validation Checks
- Valid XML format
- URL count <50,000 per file (protocol limit)
- All URLs return HTTP 200
- `<lastmod>` dates are accurate (not all identical)
- No deprecated tags: `<priority>` and `<changefreq>` are ignored by Google
- Sitemap referenced in robots.txt
- Compare crawled pages vs sitemap; flag missing pages

### Quality Signals
- Sitemap index file if >50k URLs
- Split by content type (pages, posts, images, videos)
- No non-canonical URLs in sitemap
- No noindexed URLs in sitemap
- No redirected URLs in sitemap
- HTTPS URLs only (no HTTP)

### Common Issues
| Issue | Severity | Fix |
|-------|----------|-----|
| >50k URLs in single file | Critical | Split with sitemap index |
| Non-200 URLs | High | Remove or fix broken URLs |
| Noindexed URLs included | High | Remove from sitemap |
| Redirected URLs included | Medium | Update to final URLs |
| All identical lastmod | Low | Use actual modification dates |
| Priority/changefreq used | Info | Can remove (ignored by Google) |

## Mode 2: Generate New Sitemap

### Process
1. Ask for business type (or auto-detect from existing site)
2. Load industry template from `../seo-plan/assets/` directory
3. Interactive structure planning with user
4. Apply quality gates:
   - ⚠️ WARNING at 30+ location pages (require 60%+ unique content)
   - 🛑 HARD STOP at 50+ location pages (require justification)
5. Generate valid XML output
6. Split at 50k URLs with sitemap index
7. Generate STRUCTURE.md documentation

### Safe Programmatic Pages (OK at scale)
✅ Integration pages (with real setup docs)
✅ Template/tool pages (with downloadable content)
✅ Glossary pages (200+ word definitions)
✅ Product pages (unique specs, reviews)
✅ User profile pages (user-generated content)

### Penalty Risk (avoid at scale)
❌ Location pages with only city name swapped
❌ "Best [tool] for [industry]" without industry-specific value
❌ "[Competitor] alternative" without real comparison data
❌ AI-generated pages without human review and unique value

## Sitemap Format

### Standard Sitemap
```xml
<?xml version="1.0" encoding="UTF-8"?>
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
  <url>
    <loc>https://example.com/page</loc>
    <lastmod>2026-02-07</lastmod>
  </url>
</urlset>
```

### Sitemap Index (for >50k URLs)
```xml
<?xml version="1.0" encoding="UTF-8"?>
<sitemapindex xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
  <sitemap>
    <loc>https://example.com/sitemap-pages.xml</loc>
    <lastmod>2026-02-07</lastmod>
  </sitemap>
  <sitemap>
    <loc>https://example.com/sitemap-posts.xml</loc>
    <lastmod>2026-02-07</lastmod>
  </sitemap>
</sitemapindex>
```

## Error Handling

- **URL unreachable**: Report the HTTP status code and suggest checking if the site is live
- **No sitemap found**: Check common locations (/sitemap.xml, /sitemap_index.xml, robots.txt reference) before reporting "not found"
- **Invalid XML format**: Report specific parsing errors with line numbers
- **Rate limiting detected**: Back off and report partial results with a note about retry timing

## Output

### For Analysis
- `VALIDATION-REPORT.md`: analysis results
- Issues list with severity
- Recommendations

### For Generation
- `sitemap.xml` (or split files with index)
- `STRUCTURE.md`: site architecture documentation
- URL count and organization summary

REQUIRED CONTEXT

  • existing sitemap URL or site URL for analysis
  • business type or site content for generation

EXPECTED OUTPUT

Format
markdown
Schema
markdown_files · VALIDATION-REPORT.md, Issues list with severity, Recommendations, sitemap.xml, STRUCTURE.md, URL count and organization summary
Constraints
  • produce VALIDATION-REPORT.md or sitemap.xml plus STRUCTURE.md
  • include severity-labeled issues for analysis mode
  • output valid XML for generation mode

SUCCESS CRITERIA

  • Perform all validation checks
  • Apply quality gates
  • Generate valid XML output
  • Split at 50k URLs with index when needed
  • Generate STRUCTURE.md documentation

FAILURE MODES

  • May include non-canonical or noindexed URLs
  • May fail to split files over 50k URLs
  • May generate low-value location pages at scale

CAVEATS

Dependencies
  • ../seo-plan/assets/ directory
  • business type input or site for auto-detect
Missing context
  • How the prompt should be invoked (chat vs single-turn)
  • Exact schema or examples of industry templates
  • Preferred output length or tone for reports
Ambiguities
  • Description field contains only ")" which is invalid.
  • Relative path '../seo-plan/assets/' is hardcoded and not portable.
  • Interactive steps (e.g., 'Ask for business type', 'Interactive structure planning') do not specify conversation format or termination conditions.

QUALITY

OVERALL
0.71
CLARITY
0.78
SPECIFICITY
0.72
REUSABILITY
0.65
COMPLETENESS
0.68

IMPROVEMENT SUGGESTIONS

  • Replace the broken description with a concise one-sentence purpose statement.
  • Make the template directory a configurable placeholder instead of a hardcoded relative path.
  • Add explicit instructions for multi-turn interaction format and when to stop asking questions.

USAGE

Copy the prompt above and paste it into your AI of choice — Claude, ChatGPT, Gemini, or anywhere else you're working. Replace any placeholder sections with your own context, then ask for the output.

MORE FOR AGENT