agent marketing skill risk: low
SEO Sitemap Analyzer and Generator
Defines two modes for analyzing an existing sitemap against validation checks, quality signals, and common issues, or generating a new sitemap via business-type detection, template…
SKILL 2 files
SKILL.md
---
name: seo-sitemap
description: ")"
---
# Sitemap Analysis & Generation
## Mode 1: Analyze Existing Sitemap
### Validation Checks
- Valid XML format
- URL count <50,000 per file (protocol limit)
- All URLs return HTTP 200
- `<lastmod>` dates are accurate (not all identical)
- No deprecated tags: `<priority>` and `<changefreq>` are ignored by Google
- Sitemap referenced in robots.txt
- Compare crawled pages vs sitemap; flag missing pages
### Quality Signals
- Sitemap index file if >50k URLs
- Split by content type (pages, posts, images, videos)
- No non-canonical URLs in sitemap
- No noindexed URLs in sitemap
- No redirected URLs in sitemap
- HTTPS URLs only (no HTTP)
### Common Issues
| Issue | Severity | Fix |
|-------|----------|-----|
| >50k URLs in single file | Critical | Split with sitemap index |
| Non-200 URLs | High | Remove or fix broken URLs |
| Noindexed URLs included | High | Remove from sitemap |
| Redirected URLs included | Medium | Update to final URLs |
| All identical lastmod | Low | Use actual modification dates |
| Priority/changefreq used | Info | Can remove (ignored by Google) |
## Mode 2: Generate New Sitemap
### Process
1. Ask for business type (or auto-detect from existing site)
2. Load industry template from `../seo-plan/assets/` directory
3. Interactive structure planning with user
4. Apply quality gates:
- ⚠️ WARNING at 30+ location pages (require 60%+ unique content)
- 🛑 HARD STOP at 50+ location pages (require justification)
5. Generate valid XML output
6. Split at 50k URLs with sitemap index
7. Generate STRUCTURE.md documentation
### Safe Programmatic Pages (OK at scale)
✅ Integration pages (with real setup docs)
✅ Template/tool pages (with downloadable content)
✅ Glossary pages (200+ word definitions)
✅ Product pages (unique specs, reviews)
✅ User profile pages (user-generated content)
### Penalty Risk (avoid at scale)
❌ Location pages with only city name swapped
❌ "Best [tool] for [industry]" without industry-specific value
❌ "[Competitor] alternative" without real comparison data
❌ AI-generated pages without human review and unique value
## Sitemap Format
### Standard Sitemap
```xml
<?xml version="1.0" encoding="UTF-8"?>
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
<url>
<loc>https://example.com/page</loc>
<lastmod>2026-02-07</lastmod>
</url>
</urlset>
```
### Sitemap Index (for >50k URLs)
```xml
<?xml version="1.0" encoding="UTF-8"?>
<sitemapindex xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
<sitemap>
<loc>https://example.com/sitemap-pages.xml</loc>
<lastmod>2026-02-07</lastmod>
</sitemap>
<sitemap>
<loc>https://example.com/sitemap-posts.xml</loc>
<lastmod>2026-02-07</lastmod>
</sitemap>
</sitemapindex>
```
## Error Handling
- **URL unreachable**: Report the HTTP status code and suggest checking if the site is live
- **No sitemap found**: Check common locations (/sitemap.xml, /sitemap_index.xml, robots.txt reference) before reporting "not found"
- **Invalid XML format**: Report specific parsing errors with line numbers
- **Rate limiting detected**: Back off and report partial results with a note about retry timing
## Output
### For Analysis
- `VALIDATION-REPORT.md`: analysis results
- Issues list with severity
- Recommendations
### For Generation
- `sitemap.xml` (or split files with index)
- `STRUCTURE.md`: site architecture documentation
- URL count and organization summary
REQUIRED CONTEXT
- existing sitemap URL or site URL for analysis
- business type or site content for generation
EXPECTED OUTPUT
- Format
- markdown
- Schema
- markdown_files · VALIDATION-REPORT.md, Issues list with severity, Recommendations, sitemap.xml, STRUCTURE.md, URL count and organization summary
- Constraints
- produce VALIDATION-REPORT.md or sitemap.xml plus STRUCTURE.md
- include severity-labeled issues for analysis mode
- output valid XML for generation mode
SUCCESS CRITERIA
- Perform all validation checks
- Apply quality gates
- Generate valid XML output
- Split at 50k URLs with index when needed
- Generate STRUCTURE.md documentation
FAILURE MODES
- May include non-canonical or noindexed URLs
- May fail to split files over 50k URLs
- May generate low-value location pages at scale
CAVEATS
- Dependencies
- ../seo-plan/assets/ directory
- business type input or site for auto-detect
- Missing context
- How the prompt should be invoked (chat vs single-turn)
- Exact schema or examples of industry templates
- Preferred output length or tone for reports
- Ambiguities
- Description field contains only ")" which is invalid.
- Relative path '../seo-plan/assets/' is hardcoded and not portable.
- Interactive steps (e.g., 'Ask for business type', 'Interactive structure planning') do not specify conversation format or termination conditions.
QUALITY
- OVERALL
- 0.71
- CLARITY
- 0.78
- SPECIFICITY
- 0.72
- REUSABILITY
- 0.65
- COMPLETENESS
- 0.68
IMPROVEMENT SUGGESTIONS
- Replace the broken description with a concise one-sentence purpose statement.
- Make the template directory a configurable placeholder instead of a hardcoded relative path.
- Add explicit instructions for multi-turn interaction format and when to stop asking questions.
USAGE
Copy the prompt above and paste it into your AI of choice — Claude, ChatGPT, Gemini, or anywhere else you're working. Replace any placeholder sections with your own context, then ask for the output.
MORE FOR AGENT
- Product Marketing Context Document Creatoragentmarketing
- Performance Ad Creative Generatoragentmarketing
- App Store Optimization Listing Auditoragentmarketing
- Content Strategy Planning Guideagentmarketing
- Marketing Psychology Mental Models Expertagentmarketing
- SEO Backlink Profile Analyzeragentmarketing
- DataForSEO Live SEO Data Toolsagentmarketing
- GTM Motions Scoring and Planningagentmarketing
- Market Segments Identifier and Analyzeragentmarketing
- SEO Site Audit Expertagentmarketing
- E-commerce SEO Analysis Commandsagentmarketing
- Go-to-Market Strategy Creatoragentmarketing
- Email Sequence Marketing Designeragentmarketing
- SEO Command and Subagent Orchestratoragentmarketing
- Conversion Marketing Copywriter for Web Pagesagentmarketing
- SEO Content Brief Generatoragentmarketing
- Community Marketing Strategy Advisoragentmarketing
- Marketing Page CRO Analyzeragentmarketing
- SaaS Product Launch Strategistagentmarketing
- Lead Magnet Strategy Planneragentmarketing
- In-App Paywall CRO Optimizeragentmarketing
- Popup CRO Optimization Expertagentmarketing
- Cost-Effective Product Marketing Ideas Generatoragentmarketing
- SEO SERP Overlap Keyword Clusteringagentmarketing
- DataForSEO Live SEO Data Handleragentmarketing