agent research skill risk: low
Exa AI Web Search Workflow
Instructs on performing Exa-powered web searches via argument parsing, script location and execution with filters like category/date/content, result presentation in tables, follow-…
- External action: medium
SKILL 1 file
SKILL.md
---
name: auto-claude-code-research-in-sleep-exa-search
description: "AI-powered web search via Exa with content extraction. Use when user says \"exa search\", \"web search with content\", \"find similar pages\", or needs broad web results beyond academic databases (arXiv, Semantic Scholar)."
---
# Exa AI-Powered Web Search
Search query: $ARGUMENTS
## Role & Positioning
Exa is the **broad web search** source with built-in content extraction:
| Skill | Best for |
|------|----------|
| `/arxiv` | Direct preprint search and PDF download |
| `/semantic-scholar` | Published venue papers (IEEE, ACM, Springer), citation counts |
| `/deepxiv` | Layered reading: search, brief, section map, section reads |
| `/exa-search` | Broad web search: blogs, docs, news, companies, research papers — with content extraction |
Use Exa when you need results beyond academic databases, or when you want content (highlights, full text, summaries) extracted alongside search results.
## Constants
- **EXA_FETCHER** — canonical name `exa_search.py`, resolved per
[`shared-references/integration-contract.md`](../shared-references/integration-contract.md) §2
(Codex-side chain: `$ARIS_REPO/tools/` → `tools/` → `~/.codex/skills/exa-search/`).
Policy D1 — standalone `/exa-search` has no documented fallback,
so unresolved helper terminates with an explicit error.
- **MAX_RESULTS = 10** — Default number of results to return.
> Overrides (append to arguments):
> - `/exa-search "RAG pipelines" — max: 5` — top 5 results
> - `/exa-search "diffusion models" — category: research paper` — research papers only
> - `/exa-search "startup funding" — category: news, start date: 2025-01-01` — recent news
> - `/exa-search "transformer" — content: text, max chars: 8000` — full text mode
> - `/exa-search "transformer" — content: summary` — LLM-generated summaries
> - `/exa-search "transformer" — domains: arxiv.org,huggingface.co` — domain filter
> - `/exa-search "https://arxiv.org/abs/2301.07041" — similar` — find similar pages
## Setup
Exa requires the `exa-py` SDK and an API key:
```bash
pip install exa-py
```
Set your API key:
```bash
export EXA_API_KEY=your-key-here
```
Get a key from [exa.ai](https://exa.ai).
## Workflow
### Step 1: Parse Arguments
Parse `$ARGUMENTS` for:
- **query**: The search query (required) or a URL (for `find-similar` mode)
- **similar**: If present, use `find-similar` mode instead of search
- **max**: Override MAX_RESULTS
- **category**: `research paper`, `news`, `company`, `personal site`, `financial report`, `people`
- **content**: `highlights` (default), `text`, `summary`, `none`
- **max chars**: Max characters for content extraction
- **type**: Search type — `auto` (default), `neural`, `fast`, `instant`
- **domains**: Comma-separated include domains
- **exclude domains**: Comma-separated exclude domains
- **include text**: Phrase that must appear in results
- **exclude text**: Phrase to exclude from results
- **start date**: ISO 8601 date — only results after this
- **end date**: ISO 8601 date — only results before this
- **location**: Two-letter ISO country code
### Step 2: Locate Script
```bash
# Resolve $EXA_FETCHER via the canonical strict-safe Codex chain.
if [ -z "${ARIS_REPO:-}" ] && [ -f .aris/installed-skills-codex.txt ]; then
ARIS_REPO=$(awk -F'\t' '$1=="repo_root"{print $2; exit}' .aris/installed-skills-codex.txt 2>/dev/null) || true
fi
EXA_FETCHER=""
[ -n "${ARIS_REPO:-}" ] && [ -f "$ARIS_REPO/tools/exa_search.py" ] && EXA_FETCHER="$ARIS_REPO/tools/exa_search.py"
[ -z "$EXA_FETCHER" ] && [ -f tools/exa_search.py ] && EXA_FETCHER="tools/exa_search.py"
[ -z "$EXA_FETCHER" ] && [ -f ~/.codex/skills/exa-search/exa_search.py ] && EXA_FETCHER="$HOME/.codex/skills/exa-search/exa_search.py"
[ -z "$EXA_FETCHER" ] && {
echo "ERROR: exa_search.py not resolved at \$ARIS_REPO/tools/, tools/, or ~/.codex/skills/exa-search/." >&2
echo " Fix: rerun bash tools/install_aris_codex.sh, export ARIS_REPO, or copy the helper to tools/." >&2
echo " Also ensure 'exa-py' is installed: pip install exa-py" >&2
exit 1
}
```
If not found, tell the user:
```
exa_search.py not found. Run install_aris_codex.sh, set ARIS_REPO to your ARIS repo root, or install/copy the helper into the project/global Codex skill path; then install exa-py:
pip install exa-py
```
### Step 3: Execute Search
**Standard search:**
```bash
python3 "$EXA_FETCHER" search "QUERY" --max 10 --content highlights
```
**With filters:**
```bash
python3 "$EXA_FETCHER" search "QUERY" --max 10 \
--category "research paper" \
--start-date 2025-01-01 \
--content text --max-chars 8000
```
**Find similar pages:**
```bash
python3 "$EXA_FETCHER" find-similar "URL" --max 5 --content highlights
```
**Get content for known URLs:**
```bash
python3 "$EXA_FETCHER" get-contents "URL1" "URL2" --content text
```
### Step 4: Present Results
Format results as a structured table:
```
| # | Title | Authors | Venue/Publisher | URL | Date | Key Content |
|---|-------|---------|-----------------|-----|------|-------------|
```
For each result:
- Show title and URL
- Show published date if available
- Show highlights, text excerpt, or summary depending on content mode
- Flag particularly relevant results
- **For `category: "research paper"` hits only** — also record authors
(from Exa's `author`/`authors` fields, or fallback: parse from the
result snippet) and venue/publisher (from `publisher`, `source`, or
the domain hosting the paper). These are needed by Step 6's wiki
hook; if either is unavailable for a given hit, skip wiki ingest
for that one hit and log a note.
### Step 5: Offer Follow-up
After presenting results, suggest:
- **Deepen**: "I can fetch full text for any of these results"
- **Find similar**: "I can find pages similar to any result"
- **Narrow**: "I can re-search with domain/date/text filters"
### Step 6: Update Research Wiki (if active, research-paper results only)
**Required when `research-wiki/` exists AND the search returned
results of `category: "research paper"`**; skip silently otherwise.
General web results (blog posts, docs, news) are **not** ingested —
the wiki is for papers only.
For each research paper hit, try to recover an arXiv ID from the URL
(`arxiv.org/abs/<id>`); if present, use `--arxiv-id`. Otherwise fall
back to manual metadata:
```
if [ -d research-wiki/ ] and query category was "research paper":
WIKI_SCRIPT=""
[ -n "$ARIS_REPO" ] && [ -f "$ARIS_REPO/tools/research_wiki.py" ] && WIKI_SCRIPT="$ARIS_REPO/tools/research_wiki.py"
[ -z "$WIKI_SCRIPT" ] && [ -f tools/research_wiki.py ] && WIKI_SCRIPT="tools/research_wiki.py"
[ -z "$WIKI_SCRIPT" ] && [ -f ~/.codex/skills/research-wiki/research_wiki.py ] && WIKI_SCRIPT="$HOME/.codex/skills/research-wiki/research_wiki.py"
for each research-paper hit in results:
if URL matches arxiv.org/abs/<id>:
[ -n "$WIKI_SCRIPT" ] && python3 "$WIKI_SCRIPT" ingest_paper research-wiki/ \
--arxiv-id "<id>"
else:
[ -n "$WIKI_SCRIPT" ] && python3 "$WIKI_SCRIPT" ingest_paper research-wiki/ \
--title "<title>" --authors "<authors joined by , >" \
--year <year> --venue "<venue or publisher>"
```
The helper handles slug / dedup / page / index / log — **do not
handwrite `papers/<slug>.md`**. See
[`shared-references/integration-contract.md`](../shared-references/integration-contract.md).
## Key Rules
- Always check that `EXA_API_KEY` is set before searching
- Default to `highlights` content mode for a good balance of speed and context
- Use `category: "research paper"` when the user is clearly looking for academic content
- Use `text` content mode when the user needs full page content
- Combine with `/arxiv` or `/semantic-scholar` for comprehensive literature coverage
INPUTS
- $ARGUMENTS REQUIRED
search query or URL plus optional filter flags
e.g. RAG pipelines --max 5
REQUIRED CONTEXT
- $ARGUMENTS (search query or URL)
OPTIONAL CONTEXT
- max
- category
- content mode
- dates
- domains
- similar flag
ROLES & RULES
- Always check that `EXA_API_KEY` is set before searching
- Default to `highlights` content mode for a good balance of speed and context
- Use `category: "research paper"` when the user is clearly looking for academic content
- Use `text` content mode when the user needs full page content
- Combine with `/arxiv` or `/semantic-scholar` for comprehensive literature coverage
- If not found, tell the user the resolution error message
- For `category: "research paper"` hits only — also record authors and venue/publisher
EXPECTED OUTPUT
- Format
- markdown
- Schema
- markdown_table · #, Title, Authors, Venue/Publisher, URL, Date, Key Content
- Constraints
- use structured table for results
- include specific columns for research papers
- suggest follow-ups after results
SUCCESS CRITERIA
- Parse all supported argument filters from $ARGUMENTS
- Resolve EXA_FETCHER via the documented three-location chain or emit explicit error
- Present every result in the mandated markdown table
- Offer deepen/find-similar/narrow follow-ups
- Ingest research-paper results into research-wiki/ when the directory exists
FAILURE MODES
- May terminate if exa_search.py cannot be located in any of the three paths
- May skip wiki ingestion silently for non-paper results or missing arxiv IDs
- May produce incomplete author/venue fields when Exa metadata is absent
EXAMPLES
Includes seven override examples for max/category/content/domains/similar flags plus four bash execution examples for search/find-similar/get-contents.
CAVEATS
- Dependencies
- EXA_API_KEY environment variable
- exa_search.py script (via ARIS_REPO/tools/, tools/, or ~/.codex/skills/exa-search/)
- research-wiki/ directory (optional, only for paper ingestion)
- research_wiki.py script (optional)
- Missing context
- Exact shell environment assumptions (bash vs zsh, OS paths)
- How ARIS_REPO is normally populated if .aris file is absent
- Ambiguities
- Does not specify the exact action or error message when EXA_API_KEY is unset (only states to check it).
- Step 2 error message differs slightly from the inline script error handling.
QUALITY
- OVERALL
- 0.71
- CLARITY
- 0.78
- SPECIFICITY
- 0.92
- REUSABILITY
- 0.38
- COMPLETENESS
- 0.85
IMPROVEMENT SUGGESTIONS
- Add an explicit early check + error block for missing EXA_API_KEY before any script resolution.
- Extract the long path-resolution logic into a single reusable helper function or note to reduce duplication between EXA_FETCHER and WIKI_SCRIPT.
- Make the result table columns conditional on category so non-paper results do not show empty Authors/Venue columns.
USAGE
Copy the prompt above and paste it into your AI of choice — Claude, ChatGPT, Gemini, or anywhere else you're working. Replace any placeholder sections with your own context, then ask for the output.
MORE FOR AGENT
- Creative Thinking Frameworks for CS Researchagentresearch
- Academic Paper Figure Generatoragentresearch
- Deep Investigation Agent for Geopolitics Researchagentresearch
- Customer Research Analyst and Synthesizeragentresearch
- Gemini Research Paper Literature Searchagentresearch
- Research Formula Derivation Package Builderagentresearch
- Research Session Provenance Recorderagentresearch
- BIDS Neuroscience Data Organizeragentresearch
- Research Experiment Plan Roadmap Builderagentresearch
- ARA Research Artifact Compileragentresearch
- Research Proposal Experiment Roadmap Generatoragentresearch
- ML AI Theorem Proof Package Writeragentresearch
- Research Formula Derivation Package Builderagentresearch
- Scientific ML Catalog Assistantagentresearch
- OpenMM MDAnalysis Molecular Dynamics Workflowagentresearch
- Publication-Quality Paper Figure Generatoragentresearch
- ML Research Idea Generator and Rankeragentresearch
- ML Paper Figure and Table Generatoragentresearch
- Competitor Profiling Intelligence Analystagentresearch
- Research Method Novelty Checkeragentresearch
- Research Refine and Experiment Planning Pipelineagentresearch
- ML Ablation Study Planneragentresearch
- Research Agent Validation Best Practicesagentresearch
- AlphaXiv arXiv Paper Lookup Workflowagentresearch
- AlphaXiv Single-Paper Lookup and Summarizeragentresearch