agent research skill risk: low
Exa AI Web Search Workflow
The prompt provides step-by-step instructions for handling Exa-powered web searches, including argument parsing, script location and execution with filters, result presentation in…
- External action: medium
SKILL 1 file
---
name: auto-claude-code-research-in-sleep-exa-search-d97c63b4
description: "AI-powered web search via Exa with content extraction. Use when user says \"exa search\", \"web search with content\", \"find similar pages\", or needs broad web results beyond academic databases (arXiv, Semantic Scholar)."
---
# Exa AI-Powered Web Search
Search query: $ARGUMENTS
## Role & Positioning
Exa is the **broad web search** source with built-in content extraction:
| Skill | Best for |
|------|----------|
| `/arxiv` | Direct preprint search and PDF download |
| `/semantic-scholar` | Published venue papers (IEEE, ACM, Springer), citation counts |
| `/deepxiv` | Layered reading: search, brief, section map, section reads |
| `/exa-search` | Broad web search: blogs, docs, news, companies, research papers — with content extraction |
Use Exa when you need results beyond academic databases, or when you want content (highlights, full text, summaries) extracted alongside search results.
## Constants
- **EXA_FETCHER** — canonical name `exa_search.py`, resolved per
[`shared-references/integration-contract.md`](../shared-references/integration-contract.md) §2
(Policy D1 — standalone `/exa-search` has no documented fallback,
so unresolved helper terminates with an explicit error).
- **MAX_RESULTS = 10** — Default number of results to return.
> Overrides (append to arguments):
> - `/exa-search "RAG pipelines" — max: 5` — top 5 results
> - `/exa-search "diffusion models" — category: research paper` — research papers only
> - `/exa-search "startup funding" — category: news, start date: 2025-01-01` — recent news
> - `/exa-search "transformer" — content: text, max chars: 8000` — full text mode
> - `/exa-search "transformer" — content: summary` — LLM-generated summaries
> - `/exa-search "transformer" — domains: arxiv.org,huggingface.co` — domain filter
> - `/exa-search "https://arxiv.org/abs/2301.07041" — similar` — find similar pages
## Setup
Exa requires the `exa-py` SDK and an API key:
```bash
pip install exa-py
```
Set your API key:
```bash
export EXA_API_KEY=your-key-here
```
Get a key from [exa.ai](https://exa.ai).
## Workflow
### Step 1: Parse Arguments
Parse `$ARGUMENTS` for:
- **query**: The search query (required) or a URL (for `find-similar` mode)
- **similar**: If present, use `find-similar` mode instead of search
- **max**: Override MAX_RESULTS
- **category**: `research paper`, `news`, `company`, `personal site`, `financial report`, `people`
- **content**: `highlights` (default), `text`, `summary`, `none`
- **max chars**: Max characters for content extraction
- **type**: Search type — `auto` (default), `neural`, `fast`, `instant`
- **domains**: Comma-separated include domains
- **exclude domains**: Comma-separated exclude domains
- **include text**: Phrase that must appear in results
- **exclude text**: Phrase to exclude from results
- **start date**: ISO 8601 date — only results after this
- **end date**: ISO 8601 date — only results before this
- **location**: Two-letter ISO country code
### Step 2: Locate Script
Resolve `$EXA_FETCHER` via the canonical strict-safe chain (see
[`shared-references/integration-contract.md`](../shared-references/integration-contract.md) §2).
Policy D1 cascade: there is no native inline fallback for Exa
(retrieval requires the `exa-py` SDK + API key, which lives in the
fetcher), so unresolved helper means the SKILL cannot produce its
primary output — fail with explicit remediation.
```bash
cd "$(git rev-parse --show-toplevel 2>/dev/null || pwd)" || exit 1
if [ -z "${ARIS_REPO:-}" ] && [ -f .aris/installed-skills.txt ]; then
ARIS_REPO=$(awk -F'\t' '$1=="repo_root"{print $2; exit}' .aris/installed-skills.txt 2>/dev/null) || true
fi
EXA_FETCHER=".aris/tools/exa_search.py"
[ -f "$EXA_FETCHER" ] || EXA_FETCHER="tools/exa_search.py"
[ -f "$EXA_FETCHER" ] || { [ -n "${ARIS_REPO:-}" ] && EXA_FETCHER="$ARIS_REPO/tools/exa_search.py"; }
[ -f "$EXA_FETCHER" ] || {
echo "ERROR: exa_search.py not resolved at .aris/tools/, tools/, or \$ARIS_REPO/tools/." >&2
echo " Fix: rerun bash tools/install_aris.sh, export ARIS_REPO, or copy the helper to tools/." >&2
echo " Also ensure 'exa-py' is installed: pip install exa-py" >&2
exit 1
}
```
### Step 3: Execute Search
**Standard search:**
```bash
python3 "$EXA_FETCHER" search "QUERY" --max 10 --content highlights
```
**With filters:**
```bash
python3 "$EXA_FETCHER" search "QUERY" --max 10 \
--category "research paper" \
--start-date 2025-01-01 \
--content text --max-chars 8000
```
**Find similar pages:**
```bash
python3 "$EXA_FETCHER" find-similar "URL" --max 5 --content highlights
```
**Get content for known URLs:**
```bash
python3 "$EXA_FETCHER" get-contents "URL1" "URL2" --content text
```
### Step 4: Present Results
Format results as a structured table:
```
| # | Title | Authors | Venue/Publisher | URL | Date | Key Content |
|---|-------|---------|-----------------|-----|------|-------------|
```
For each result:
- Show title and URL
- Show published date if available
- Show highlights, text excerpt, or summary depending on content mode
- Flag particularly relevant results
- **For `category: "research paper"` hits only** — also record authors
(from Exa's `author`/`authors` fields, or fallback: parse from the
result snippet) and venue/publisher (from `publisher`, `source`, or
the domain hosting the paper). These are needed by Step 6's wiki
hook; if either is unavailable for a given hit, skip wiki ingest
for that one hit and log a note.
### Step 5: Offer Follow-up
After presenting results, suggest:
- **Deepen**: "I can fetch full text for any of these results"
- **Find similar**: "I can find pages similar to any result"
- **Narrow**: "I can re-search with domain/date/text filters"
### Step 6: Update Research Wiki (if active, research-paper results only)
**Required when `research-wiki/` exists AND the search returned
results of `category: "research paper"`**; skip silently otherwise.
General web results (blog posts, docs, news) are **not** ingested —
the wiki is for papers only.
When the predicates hold, resolve `$WIKI_SCRIPT` per the canonical
chain at
[`shared-references/wiki-helper-resolution.md`](../shared-references/wiki-helper-resolution.md)
(Variant B — warn-and-skip). For each research paper hit, try to
recover an arXiv ID from the URL (`arxiv.org/abs/<id>`); if present,
use `--arxiv-id`. Otherwise fall back to manual metadata:
```bash
if [ -d research-wiki/ ] and query category was "research paper":
cd "$(git rev-parse --show-toplevel 2>/dev/null || pwd)" || exit 1
ARIS_REPO="${ARIS_REPO:-$(awk -F'\t' '$1=="repo_root"{print $2; exit}' .aris/installed-skills.txt 2>/dev/null)}"
WIKI_SCRIPT=".aris/tools/research_wiki.py"
[ -f "$WIKI_SCRIPT" ] || WIKI_SCRIPT="tools/research_wiki.py"
[ -f "$WIKI_SCRIPT" ] || { [ -n "${ARIS_REPO:-}" ] && WIKI_SCRIPT="$ARIS_REPO/tools/research_wiki.py"; }
[ -f "$WIKI_SCRIPT" ] || {
echo "WARN: research_wiki.py not found; exa-search results delivered, wiki ingest skipped. Fix: bash tools/install_aris.sh, export ARIS_REPO, or cp <ARIS-repo>/tools/research_wiki.py tools/." >&2
WIKI_SCRIPT=""
}
[ -n "$WIKI_SCRIPT" ] && for each research-paper hit in results:
if URL matches arxiv.org/abs/<id>:
python3 "$WIKI_SCRIPT" ingest_paper research-wiki/ \
--arxiv-id "<id>"
else:
python3 "$WIKI_SCRIPT" ingest_paper research-wiki/ \
--title "<title>" --authors "<authors joined by , >" \
--year <year> --venue "<venue or publisher>"
```
The helper handles slug / dedup / page / index / log — **do not
handwrite `papers/<slug>.md`**. See
[`shared-references/integration-contract.md`](../shared-references/integration-contract.md).
## Key Rules
- Always check that `EXA_API_KEY` is set before searching
- Default to `highlights` content mode for a good balance of speed and context
- Use `category: "research paper"` when the user is clearly looking for academic content
- Use `text` content mode when the user needs full page content
- Combine with `/arxiv` or `/semantic-scholar` for comprehensive literature coverage
INPUTS
- $ARGUMENTS REQUIRED
search query or URL plus optional filter flags
e.g. RAG pipelines --max 5
- $EXA_FETCHER REQUIRED
path to exa_search.py helper
e.g. .aris/tools/exa_search.py
- $WIKI_SCRIPT
path to research_wiki.py helper
e.g. .aris/tools/research_wiki.py
REQUIRED CONTEXT
- $ARGUMENTS (search query or URL)
OPTIONAL CONTEXT
- max
- category
- content mode
- max-chars
- type
- domains
- start date
- end date
- location
TOOLS REQUIRED
- code_execution
ROLES & RULES
- Always check that `EXA_API_KEY` is set before searching
- Default to `highlights` content mode for a good balance of speed and context
- Use `category: "research paper"` when the user is clearly looking for academic content
- Use `text` content mode when the user needs full page content
- Combine with `/arxiv` or `/semantic-scholar` for comprehensive literature coverage
EXPECTED OUTPUT
- Format
- markdown
- Schema
- markdown_table · #, Title, Authors, Venue/Publisher, URL, Date, Key Content
- Constraints
- present results as structured table with columns #, Title, Authors, Venue/Publisher, URL, Date, Key Content
- include follow-up suggestions after results
- conditionally run wiki ingest only for research-paper category results when research-wiki/ exists
SUCCESS CRITERIA
- Parse arguments and execute search via resolved fetcher script
- Present results in the specified structured table
- Offer deepen/find-similar/narrow follow-ups
- Ingest qualifying research papers into wiki when conditions met
FAILURE MODES
- Unresolved EXA_FETCHER script terminates with explicit error
- Wiki ingest skipped silently for non-paper results or missing research-wiki/
- Missing authors/venue for a paper hit skips wiki ingest for that result
EXAMPLES
Includes multiple command examples for argument overrides, search execution modes, filters, find-similar, and wiki ingestion.
CAVEATS
- Dependencies
- EXA_FETCHER (exa_search.py) resolved via integration-contract.md
- EXA_API_KEY environment variable
- Optional research-wiki/ directory and WIKI_SCRIPT via wiki-helper-resolution.md
- Ambiguities
- References external documents (integration-contract.md, wiki-helper-resolution.md) whose content is not provided in the prompt itself.
- Step 6 condition "if [ -d research-wiki/ ] and query category was 'research paper'" mixes shell syntax with natural language.
QUALITY
- OVERALL
- 0.71
- CLARITY
- 0.78
- SPECIFICITY
- 0.88
- REUSABILITY
- 0.42
- COMPLETENESS
- 0.82
IMPROVEMENT SUGGESTIONS
- Extract the long shell resolution blocks into a single reusable helper function or reference to reduce duplication between EXA_FETCHER and WIKI_SCRIPT resolution.
- Add an explicit 'no results' handling instruction in Step 4 to make output behavior fully deterministic.
USAGE
Copy the prompt above and paste it into your AI of choice — Claude, ChatGPT, Gemini, or anywhere else you're working. Replace any placeholder sections with your own context, then ask for the output.
MORE FOR AGENT
- Creative Thinking Frameworks for CS Researchagentresearch
- Academic Paper Figure Generatoragentresearch
- Deep Investigation Agent for Geopolitics Researchagentresearch
- Customer Research Analyst and Synthesizeragentresearch
- Gemini Research Paper Literature Searchagentresearch
- Research Formula Derivation Package Builderagentresearch
- Research Session Provenance Recorderagentresearch
- BIDS Neuroscience Data Organizeragentresearch
- Research Experiment Plan Roadmap Builderagentresearch
- ARA Research Artifact Compileragentresearch
- Research Proposal Experiment Roadmap Generatoragentresearch
- ML AI Theorem Proof Package Writeragentresearch
- Research Formula Derivation Package Builderagentresearch
- Scientific ML Catalog Assistantagentresearch
- OpenMM MDAnalysis Molecular Dynamics Workflowagentresearch
- Publication-Quality Paper Figure Generatoragentresearch
- ML Research Idea Generator and Rankeragentresearch
- ML Paper Figure and Table Generatoragentresearch
- Competitor Profiling Intelligence Analystagentresearch
- Research Method Novelty Checkeragentresearch
- Research Refine and Experiment Planning Pipelineagentresearch
- ML Ablation Study Planneragentresearch
- Research Agent Validation Best Practicesagentresearch
- AlphaXiv arXiv Paper Lookup Workflowagentresearch
- AlphaXiv Single-Paper Lookup and Summarizeragentresearch