agent research skill risk: low

Exa AI Web Search Workflow

Instructs on performing Exa-powered web searches via argument parsing, script location and execution with filters like category/date/content, result presentation in tables, follow-…

External action: medium

SKILL 1 file

SKILL.md

Download

---
name: auto-claude-code-research-in-sleep-exa-search
description: "AI-powered web search via Exa with content extraction. Use when user says \"exa search\", \"web search with content\", \"find similar pages\", or needs broad web results beyond academic databases (arXiv, Semantic Scholar)."
---
# Exa AI-Powered Web Search

Search query: $ARGUMENTS

## Role & Positioning

Exa is the **broad web search** source with built-in content extraction:

| Skill | Best for |
|------|----------|
| `/arxiv` | Direct preprint search and PDF download |
| `/semantic-scholar` | Published venue papers (IEEE, ACM, Springer), citation counts |
| `/deepxiv` | Layered reading: search, brief, section map, section reads |
| `/exa-search` | Broad web search: blogs, docs, news, companies, research papers — with content extraction |

Use Exa when you need results beyond academic databases, or when you want content (highlights, full text, summaries) extracted alongside search results.

## Constants

- **EXA_FETCHER** — canonical name `exa_search.py`, resolved per
  [`shared-references/integration-contract.md`](../shared-references/integration-contract.md) §2
  (Codex-side chain: `$ARIS_REPO/tools/` → `tools/` → `~/.codex/skills/exa-search/`).
  Policy D1 — standalone `/exa-search` has no documented fallback,
  so unresolved helper terminates with an explicit error.
- **MAX_RESULTS = 10** — Default number of results to return.

> Overrides (append to arguments):
> - `/exa-search "RAG pipelines" — max: 5` — top 5 results
> - `/exa-search "diffusion models" — category: research paper` — research papers only
> - `/exa-search "startup funding" — category: news, start date: 2025-01-01` — recent news
> - `/exa-search "transformer" — content: text, max chars: 8000` — full text mode
> - `/exa-search "transformer" — content: summary` — LLM-generated summaries
> - `/exa-search "transformer" — domains: arxiv.org,huggingface.co` — domain filter
> - `/exa-search "https://arxiv.org/abs/2301.07041" — similar` — find similar pages

## Setup

Exa requires the `exa-py` SDK and an API key:

```bash
pip install exa-py
```

Set your API key:
```bash
export EXA_API_KEY=your-key-here
```

Get a key from [exa.ai](https://exa.ai).

## Workflow

### Step 1: Parse Arguments

Parse `$ARGUMENTS` for:
- **query**: The search query (required) or a URL (for `find-similar` mode)
- **similar**: If present, use `find-similar` mode instead of search
- **max**: Override MAX_RESULTS
- **category**: `research paper`, `news`, `company`, `personal site`, `financial report`, `people`
- **content**: `highlights` (default), `text`, `summary`, `none`
- **max chars**: Max characters for content extraction
- **type**: Search type — `auto` (default), `neural`, `fast`, `instant`
- **domains**: Comma-separated include domains
- **exclude domains**: Comma-separated exclude domains
- **include text**: Phrase that must appear in results
- **exclude text**: Phrase to exclude from results
- **start date**: ISO 8601 date — only results after this
- **end date**: ISO 8601 date — only results before this
- **location**: Two-letter ISO country code

### Step 2: Locate Script

```bash
# Resolve $EXA_FETCHER via the canonical strict-safe Codex chain.
if [ -z "${ARIS_REPO:-}" ] && [ -f .aris/installed-skills-codex.txt ]; then
    ARIS_REPO=$(awk -F'\t' '$1=="repo_root"{print $2; exit}' .aris/installed-skills-codex.txt 2>/dev/null) || true
fi
EXA_FETCHER=""
[ -n "${ARIS_REPO:-}" ] && [ -f "$ARIS_REPO/tools/exa_search.py" ] && EXA_FETCHER="$ARIS_REPO/tools/exa_search.py"
[ -z "$EXA_FETCHER" ] && [ -f tools/exa_search.py ] && EXA_FETCHER="tools/exa_search.py"
[ -z "$EXA_FETCHER" ] && [ -f ~/.codex/skills/exa-search/exa_search.py ] && EXA_FETCHER="$HOME/.codex/skills/exa-search/exa_search.py"
[ -z "$EXA_FETCHER" ] && {
  echo "ERROR: exa_search.py not resolved at \$ARIS_REPO/tools/, tools/, or ~/.codex/skills/exa-search/." >&2
  echo "       Fix: rerun bash tools/install_aris_codex.sh, export ARIS_REPO, or copy the helper to tools/." >&2
  echo "       Also ensure 'exa-py' is installed: pip install exa-py" >&2
  exit 1
}
```

If not found, tell the user:
```
exa_search.py not found. Run install_aris_codex.sh, set ARIS_REPO to your ARIS repo root, or install/copy the helper into the project/global Codex skill path; then install exa-py:
pip install exa-py
```

### Step 3: Execute Search

**Standard search:**
```bash
python3 "$EXA_FETCHER" search "QUERY" --max 10 --content highlights
```

**With filters:**
```bash
python3 "$EXA_FETCHER" search "QUERY" --max 10 \
  --category "research paper" \
  --start-date 2025-01-01 \
  --content text --max-chars 8000
```

**Find similar pages:**
```bash
python3 "$EXA_FETCHER" find-similar "URL" --max 5 --content highlights
```

**Get content for known URLs:**
```bash
python3 "$EXA_FETCHER" get-contents "URL1" "URL2" --content text
```

### Step 4: Present Results

Format results as a structured table:

```
| # | Title | Authors | Venue/Publisher | URL | Date | Key Content |
|---|-------|---------|-----------------|-----|------|-------------|
```

For each result:
- Show title and URL
- Show published date if available
- Show highlights, text excerpt, or summary depending on content mode
- Flag particularly relevant results
- **For `category: "research paper"` hits only** — also record authors
  (from Exa's `author`/`authors` fields, or fallback: parse from the
  result snippet) and venue/publisher (from `publisher`, `source`, or
  the domain hosting the paper). These are needed by Step 6's wiki
  hook; if either is unavailable for a given hit, skip wiki ingest
  for that one hit and log a note.

### Step 5: Offer Follow-up

After presenting results, suggest:
- **Deepen**: "I can fetch full text for any of these results"
- **Find similar**: "I can find pages similar to any result"
- **Narrow**: "I can re-search with domain/date/text filters"

### Step 6: Update Research Wiki (if active, research-paper results only)

**Required when `research-wiki/` exists AND the search returned
results of `category: "research paper"`**; skip silently otherwise.
General web results (blog posts, docs, news) are **not** ingested —
the wiki is for papers only.

For each research paper hit, try to recover an arXiv ID from the URL
(`arxiv.org/abs/<id>`); if present, use `--arxiv-id`. Otherwise fall
back to manual metadata:

```
if [ -d research-wiki/ ] and query category was "research paper":
    WIKI_SCRIPT=""
    [ -n "$ARIS_REPO" ] && [ -f "$ARIS_REPO/tools/research_wiki.py" ] && WIKI_SCRIPT="$ARIS_REPO/tools/research_wiki.py"
    [ -z "$WIKI_SCRIPT" ] && [ -f tools/research_wiki.py ] && WIKI_SCRIPT="tools/research_wiki.py"
    [ -z "$WIKI_SCRIPT" ] && [ -f ~/.codex/skills/research-wiki/research_wiki.py ] && WIKI_SCRIPT="$HOME/.codex/skills/research-wiki/research_wiki.py"
    for each research-paper hit in results:
        if URL matches arxiv.org/abs/<id>:
            [ -n "$WIKI_SCRIPT" ] && python3 "$WIKI_SCRIPT" ingest_paper research-wiki/ \
                --arxiv-id "<id>"
        else:
            [ -n "$WIKI_SCRIPT" ] && python3 "$WIKI_SCRIPT" ingest_paper research-wiki/ \
                --title "<title>" --authors "<authors joined by , >" \
                --year <year> --venue "<venue or publisher>"
```

The helper handles slug / dedup / page / index / log — **do not
handwrite `papers/<slug>.md`**. See
[`shared-references/integration-contract.md`](../shared-references/integration-contract.md).

## Key Rules
- Always check that `EXA_API_KEY` is set before searching
- Default to `highlights` content mode for a good balance of speed and context
- Use `category: "research paper"` when the user is clearly looking for academic content
- Use `text` content mode when the user needs full page content
- Combine with `/arxiv` or `/semantic-scholar` for comprehensive literature coverage

INPUTS

$ARGUMENTS REQUIRED

search query or URL plus optional filter flags

e.g. RAG pipelines --max 5

REQUIRED CONTEXT

$ARGUMENTS (search query or URL)

OPTIONAL CONTEXT

max
category
content mode
dates
domains
similar flag

ROLES & RULES

Always check that `EXA_API_KEY` is set before searching
Default to `highlights` content mode for a good balance of speed and context
Use `category: "research paper"` when the user is clearly looking for academic content
Use `text` content mode when the user needs full page content
Combine with `/arxiv` or `/semantic-scholar` for comprehensive literature coverage
If not found, tell the user the resolution error message
For `category: "research paper"` hits only — also record authors and venue/publisher

EXPECTED OUTPUT

Format

markdown

Schema

markdown_table · #, Title, Authors, Venue/Publisher, URL, Date, Key Content

Constraints

use structured table for results
include specific columns for research papers
suggest follow-ups after results

SUCCESS CRITERIA

Parse all supported argument filters from $ARGUMENTS
Resolve EXA_FETCHER via the documented three-location chain or emit explicit error
Present every result in the mandated markdown table
Offer deepen/find-similar/narrow follow-ups
Ingest research-paper results into research-wiki/ when the directory exists

FAILURE MODES

May terminate if exa_search.py cannot be located in any of the three paths
May skip wiki ingestion silently for non-paper results or missing arxiv IDs
May produce incomplete author/venue fields when Exa metadata is absent

EXAMPLES

Includes seven override examples for max/category/content/domains/similar flags plus four bash execution examples for search/find-similar/get-contents.

CAVEATS

Dependencies

EXA_API_KEY environment variable
exa_search.py script (via ARIS_REPO/tools/, tools/, or ~/.codex/skills/exa-search/)
research-wiki/ directory (optional, only for paper ingestion)
research_wiki.py script (optional)

Missing context

Exact shell environment assumptions (bash vs zsh, OS paths)
How ARIS_REPO is normally populated if .aris file is absent

Ambiguities

Does not specify the exact action or error message when EXA_API_KEY is unset (only states to check it).
Step 2 error message differs slightly from the inline script error handling.

QUALITY

OVERALL: 0.71
CLARITY: 0.78
SPECIFICITY: 0.92
REUSABILITY: 0.38
COMPLETENESS: 0.85

IMPROVEMENT SUGGESTIONS

Add an explicit early check + error block for missing EXA_API_KEY before any script resolution.
Extract the long path-resolution logic into a single reusable helper function or note to reduce duplication between EXA_FETCHER and WIKI_SCRIPT.
Make the result table columns conditional on category so non-paper results do not show empty Authors/Venue columns.

USAGE

Copy the prompt above and paste it into your AI of choice — Claude, ChatGPT, Gemini, or anywhere else you're working. Replace any placeholder sections with your own context, then ask for the output.