agent coding skill risk: low
Codebase Architecture Deepening Advisor
The prompt directs the model to explore a codebase using CONTEXT.md and docs/adr/, identify shallow modules via the deletion test and specific glossary terms, then present numbered…
SKILL 4 files
SKILL.md
---
name: improve-codebase-architecture
description: "Find deepening opportunities in a codebase, informed by the domain language in CONTEXT.md and the decisions in docs/adr/. Use when the user wants to improve architecture, find refactoring opportunities, consolidate tightly-coupled modules, or make a codebase more testable and AI-navigable."
---
# Improve Codebase Architecture
Surface architectural friction and propose **deepening opportunities** — refactors that turn shallow modules into deep ones. The aim is testability and AI-navigability.
## Glossary
Use these terms exactly in every suggestion. Consistent language is the point — don't drift into "component," "service," "API," or "boundary." Full definitions in [LANGUAGE.md](LANGUAGE.md).
- **Module** — anything with an interface and an implementation (function, class, package, slice).
- **Interface** — everything a caller must know to use the module: types, invariants, error modes, ordering, config. Not just the type signature.
- **Implementation** — the code inside.
- **Depth** — leverage at the interface: a lot of behaviour behind a small interface. **Deep** = high leverage. **Shallow** = interface nearly as complex as the implementation.
- **Seam** — where an interface lives; a place behaviour can be altered without editing in place. (Use this, not "boundary.")
- **Adapter** — a concrete thing satisfying an interface at a seam.
- **Leverage** — what callers get from depth.
- **Locality** — what maintainers get from depth: change, bugs, knowledge concentrated in one place.
Key principles (see [LANGUAGE.md](LANGUAGE.md) for the full list):
- **Deletion test**: imagine deleting the module. If complexity vanishes, it was a pass-through. If complexity reappears across N callers, it was earning its keep.
- **The interface is the test surface.**
- **One adapter = hypothetical seam. Two adapters = real seam.**
This skill is _informed_ by the project's domain model. The domain language gives names to good seams; ADRs record decisions the skill should not re-litigate.
## Process
### 1. Explore
Read the project's domain glossary and any ADRs in the area you're touching first.
Then use the Agent tool with `subagent_type=Explore` to walk the codebase. Don't follow rigid heuristics — explore organically and note where you experience friction:
- Where does understanding one concept require bouncing between many small modules?
- Where are modules **shallow** — interface nearly as complex as the implementation?
- Where have pure functions been extracted just for testability, but the real bugs hide in how they're called (no **locality**)?
- Where do tightly-coupled modules leak across their seams?
- Which parts of the codebase are untested, or hard to test through their current interface?
Apply the **deletion test** to anything you suspect is shallow: would deleting it concentrate complexity, or just move it? A "yes, concentrates" is the signal you want.
### 2. Present candidates
Present a numbered list of deepening opportunities. For each candidate:
- **Files** — which files/modules are involved
- **Problem** — why the current architecture is causing friction
- **Solution** — plain English description of what would change
- **Benefits** — explained in terms of locality and leverage, and also in how tests would improve
**Use CONTEXT.md vocabulary for the domain, and [LANGUAGE.md](LANGUAGE.md) vocabulary for the architecture.** If `CONTEXT.md` defines "Order," talk about "the Order intake module" — not "the FooBarHandler," and not "the Order service."
**ADR conflicts**: if a candidate contradicts an existing ADR, only surface it when the friction is real enough to warrant revisiting the ADR. Mark it clearly (e.g. _"contradicts ADR-0007 — but worth reopening because…"_). Don't list every theoretical refactor an ADR forbids.
Do NOT propose interfaces yet. Ask the user: "Which of these would you like to explore?"
### 3. Grilling loop
Once the user picks a candidate, drop into a grilling conversation. Walk the design tree with them — constraints, dependencies, the shape of the deepened module, what sits behind the seam, what tests survive.
Side effects happen inline as decisions crystallize:
- **Naming a deepened module after a concept not in `CONTEXT.md`?** Add the term to `CONTEXT.md` — same discipline as `/grill-with-docs` (see [CONTEXT-FORMAT.md](../grill-with-docs/CONTEXT-FORMAT.md)). Create the file lazily if it doesn't exist.
- **Sharpening a fuzzy term during the conversation?** Update `CONTEXT.md` right there.
- **User rejects the candidate with a load-bearing reason?** Offer an ADR, framed as: _"Want me to record this as an ADR so future architecture reviews don't re-suggest it?"_ Only offer when the reason would actually be needed by a future explorer to avoid re-suggesting the same thing — skip ephemeral reasons ("not worth it right now") and self-evident ones. See [ADR-FORMAT.md](../grill-with-docs/ADR-FORMAT.md).
- **Want to explore alternative interfaces for the deepened module?** See [INTERFACE-DESIGN.md](INTERFACE-DESIGN.md).
REQUIRED CONTEXT
- CONTEXT.md domain language
- docs/adr/ decisions
- LANGUAGE.md glossary
- codebase source
TOOLS REQUIRED
- agent
ROLES & RULES
- Use these terms exactly in every suggestion.
- Don't drift into component, service, API, or boundary.
- Read the project's domain glossary and any ADRs first.
- Use the Agent tool with subagent_type=Explore.
- Apply the deletion test to anything suspected shallow.
- Use CONTEXT.md vocabulary for the domain and LANGUAGE.md vocabulary for architecture.
- Mark ADR conflicts clearly only when friction warrants revisiting.
- Do NOT propose interfaces yet.
- Ask the user which candidate to explore.
- Add new terms to CONTEXT.md when naming deepened modules.
- Update CONTEXT.md when sharpening fuzzy terms.
- Offer an ADR only for load-bearing rejection reasons.
EXPECTED OUTPUT
- Format
- structured_report
- Schema
- markdown_sections · Files, Problem, Solution, Benefits
- Constraints
- use glossary terms exactly
- present numbered list of candidates with Files/Problem/Solution/Benefits
- use CONTEXT.md and LANGUAGE.md vocabulary
- do not propose interfaces; ask which candidate to explore
SUCCESS CRITERIA
- Surface architectural friction
- Propose deepening opportunities
- Use glossary terms exactly
- Explain benefits in locality and leverage
- Improve testability and AI-navigability
FAILURE MODES
- Drift into forbidden vocabulary
- Propose interfaces before user selects candidate
- List ADR conflicts without real friction
- Ignore deletion test
CAVEATS
- Dependencies
- Requires CONTEXT.md
- Requires docs/adr/
- Requires LANGUAGE.md
- Requires Agent tool with Explore subagent
- Missing context
- How the Agent tool is invoked and what it returns
- Whether CONTEXT.md and LANGUAGE.md must already exist or are created on demand
- Ambiguities
- References external files (LANGUAGE.md, CONTEXT.md, ADR-FORMAT.md) without specifying how they are provided or located.
- Agent tool usage described only at high level ('use the Agent tool with subagent_type=Explore') without input/output contract.
QUALITY
- OVERALL
- 0.79
- CLARITY
- 0.78
- SPECIFICITY
- 0.85
- REUSABILITY
- 0.72
- COMPLETENESS
- 0.80
IMPROVEMENT SUGGESTIONS
- Add explicit placeholders such as {{CONTEXT_MD_PATH}} and {{LANGUAGE_MD_PATH}} so the prompt can be reused without editing.
- Specify the exact tool call format and expected return schema for the Explore subagent.
USAGE
Copy the prompt above and paste it into your AI of choice — Claude, ChatGPT, Gemini, or anywhere else you're working. Replace any placeholder sections with your own context, then ask for the output.
MORE FOR AGENT
- Rapid App MVP Prototyperagentcoding
- AI-First Design Handoff Specs Generatoragentcoding
- Test-Driven Development Workflow Rulesagentcoding
- Structured Autonomy Implementation Agentagentcoding
- PROGRESS.md Manager for Agentic Codingagentcoding
- Hard Bug Diagnosis Disciplineagentcoding
- Git Development Branch Finisheragentcoding
- Code Review Feedback Reception Protocolagentcoding
- Systematic Debugging Process Guideagentcoding
- Matplotlib Python Plotting Guideagentcoding
- LaTeX Paper PDF Compileragentcoding
- Full Output Enforcement for Code Generationagentcoding
- PyTorch Geometric GNN Implementation Guideagentcoding
- Premium React UI Design Architectagentcoding
- Astropy Python Astronomy Library Guideagentcoding
- Book SFT Style Transfer Pipelineagentcoding
- Event Sourcing and CQRS Architectagentcoding
- FluidSim Python CFD Simulation Guideagentcoding
- NetworkX Python Graph Analysis Toolkitagentcoding
- Phase-Gated Debugging Protocol Enforceragentcoding
- SimPy Discrete-Event Simulation Guideagentcoding
- Phase-Gated Code Debugging Protocolagentcoding
- Biopython Molecular Biology Toolkit Guideagentcoding
- Haskell Advanced Type Systems Expertagentcoding
- Anime.js Complex Animation Workflowagentcoding