agent research skill risk: low
AlphaXiv arXiv Paper Lookup Workflow
The prompt defines a workflow for parsing arXiv IDs from user arguments and fetching tiered content from AlphaXiv overview, full markdown, or arXiv LaTeX source to produce structur…
- External action: low
SKILL 1 file
SKILL.md
---
name: alphaxiv
description: "Quick single-paper lookup via AlphaXiv LLM-optimized summaries with tiered source fallback. Use when user says \"explain this paper\", \"summarize paper\", pastes an arXiv/AlphaXiv URL, or provides a bare arXiv ID for quick understanding - not for broad literature search."
---
# AlphaXiv Paper Lookup
Lookup paper: $ARGUMENTS
> Quick single-paper reader with tiered source fallback (overview → full markdown → LaTeX source). Powered by [AlphaXiv](https://alphaxiv.org).
## Role & Positioning
This skill is the **quick single-paper reader** that returns LLM-optimized summaries:
| Skill | Source | Best for |
|-------|--------|----------|
| `/arxiv` | arXiv API | Batch search, PDF download, metadata |
| `/deepxiv` | DeepXiv SDK | Progressive section-level reading |
| `/semantic-scholar` | S2 API | Published venue metadata, citation counts |
| **`/alphaxiv`** | **alphaxiv.org** | **Instant LLM-optimized summary of one paper, with LaTeX source fallback** |
**Do NOT use this skill for** topic discovery, broad literature search, or multi-paper surveys — use `/research-lit` or `/arxiv` instead.
## Constants
- **OVERVIEW_URL** = `https://alphaxiv.org/overview/{PAPER_ID}.md`
- **ABS_URL** = `https://alphaxiv.org/abs/{PAPER_ID}.md`
- **ARXIV_SRC_URL** = `https://arxiv.org/src/{PAPER_ID}`
> Overrides (append to arguments):
> - `/alphaxiv 2401.12345` — quick overview
> - `/alphaxiv "https://arxiv.org/abs/2401.12345"` — auto-extract ID
> - `/alphaxiv 2401.12345 - depth: src` — force LaTeX source inspection
> - `/alphaxiv 2401.12345 - depth: abs` — force full markdown
## Workflow
### Step 1: Parse Arguments & Extract Paper ID
Parse `$ARGUMENTS` to extract a bare arXiv paper ID. Accept these input formats:
- `https://arxiv.org/abs/2401.12345` or `https://arxiv.org/abs/2401.12345v2`
- `https://arxiv.org/pdf/2401.12345`
- `https://alphaxiv.org/overview/2401.12345`
- `https://alphaxiv.org/abs/2401.12345`
- `2401.12345` or `2401.12345v2`
Strip version suffixes (`v1`, `v2`, ...) for API calls. Store as `PAPER_ID`.
Parse optional directives:
- **`- depth: overview|abs|src`**: force a specific tier instead of cascading
### Step 2: Fetch AlphaXiv Overview (Tier 1 — Fastest)
Fetch the structured overview from `https://alphaxiv.org/overview/{PAPER_ID}.md`.
This returns a **structured, LLM-optimized report** designed for machine consumption. Use this as the default and preferred source.
If the overview answers the user's question, **stop here**. Do not fetch deeper tiers unnecessarily.
If the request fails (HTTP 404 — paper not yet processed) or the content is insufficient, proceed to Step 3.
### Step 3: Fetch Full AlphaXiv Markdown (Tier 2 — More Detail)
Fetch the full paper markdown from `https://alphaxiv.org/abs/{PAPER_ID}.md`.
This provides the full paper body as markdown. Use when the user needs:
- Specific methodology details
- Detailed experimental results
- Particular sections not covered in the overview
If this still does not answer the question, proceed to Step 4.
### Step 4: Fetch arXiv LaTeX Source (Tier 3 — Deepest)
When the overview and full markdown are both insufficient (e.g., the user asks about equations, proofs, appendix details, or implementation specifics), download the paper's LaTeX source from `https://arxiv.org/src/{PAPER_ID}`.
The source is a `.tar.gz` archive. Download it to a temporary directory, extract it, and list the `.tex` files inside.
Then inspect **only** the files needed to answer the question. Prioritize:
1. Top-level `*.tex` files (usually the main document)
2. Files referenced by `\input{}` or `\include{}`
3. Appendices, tables, or sections directly related to the user's question
**Do NOT read the entire source tree by default.** Read selectively.
Temporary source artifacts live under `/tmp`. Do not rely on persistence.
### Step 5: Present Results
#### Default Answer Shape
```markdown
## [Paper Title]
- **arXiv**: [PAPER_ID] — https://arxiv.org/abs/[PAPER_ID]
- **Source depth**: overview | abs | src
### Summary
[2-3 sentence summary]
### Key Points
- [point 1]
- [point 2]
- [point 3]
### Answer to Your Question
[Direct answer if the user asked a specific question]
```
If the user only asks for one specific detail, answer it directly — skip the full template.
#### Suggest Follow-Up Skills
```text
/arxiv "PAPER_ID" - download - download the PDF to local library
/deepxiv "PAPER_ID" - section: Methods - read a specific section progressively
/research-lit "related topic" - multi-source literature survey
/novelty-check "idea from paper" - verify novelty against this paper's area
```
## Update Research Wiki (if active)
**Required when `research-wiki/` exists in the project**; skip silently
otherwise. After presenting the paper summary, ingest the single paper
that was read:
```
if [ -d research-wiki/ ]:
ARIS_REPO="${ARIS_REPO:-$(awk -F'\t' '$1=="repo_root"{print $2; exit}' .aris/installed-skills-codex.txt 2>/dev/null)}"
WIKI_SCRIPT=""
[ -n "$ARIS_REPO" ] && [ -f "$ARIS_REPO/tools/research_wiki.py" ] && WIKI_SCRIPT="$ARIS_REPO/tools/research_wiki.py"
[ -z "$WIKI_SCRIPT" ] && [ -f tools/research_wiki.py ] && WIKI_SCRIPT="tools/research_wiki.py"
[ -z "$WIKI_SCRIPT" ] && [ -f ~/.codex/skills/research-wiki/research_wiki.py ] && WIKI_SCRIPT="$HOME/.codex/skills/research-wiki/research_wiki.py"
[ -n "$WIKI_SCRIPT" ] && python3 "$WIKI_SCRIPT" ingest_paper research-wiki/ \
--arxiv-id "<paper_arxiv_id>" \
[--thesis "<one-line thesis from the Tier 1 overview>"]
```
The helper handles metadata fetch, slug, dedup, page creation, index
rebuild, and log append — **do not handwrite `papers/<slug>.md`**. See
[`shared-references/integration-contract.md`](../shared-references/integration-contract.md).
If wiki was not present at read time, the user can backfill via
`python3 "$WIKI_SCRIPT" sync research-wiki/ --arxiv-ids <id>` after resolving `WIKI_SCRIPT` as above.
## Key Rules
- **Overview first**: `overview` is the fastest path and must always be tried before deeper tiers. Only escalate when needed.
- **Minimal reads**: At `src` tier, read only the files that answer the question. Full-tree reads waste tokens.
- **Cross-platform**: When downloading and extracting the source archive, prefer cross-platform approaches (e.g., Python stdlib) over platform-specific commands to ensure Windows/WSL compatibility.
- **No PDF parsing**: This skill reads structured markdown and LaTeX source, not raw PDFs. For PDF content, suggest `/arxiv` with download.
- **Rate limiting**: arXiv source download may rate-limit. If HTTP 429 occurs, wait 5 seconds and retry once. If still blocked, report the error and suggest `/deepxiv` as alternative.
- **Complementary, not competing**: This skill complements `/arxiv` (search + download) and `/deepxiv` (progressive reading). Do not re-implement their functionality.
## Integration with Other Skills
### As enrichment in `/research-lit`
`/research-lit` can use this skill's Tier 1 (overview) as a fast enrichment step between search and deep analysis. After finding arXiv papers in Step 1, fetch AlphaXiv overviews to quickly assess relevance before committing to full-text reads:
```
Step 1: Search → list of arXiv IDs
Step 1.5: AlphaXiv overview for top 5-8 papers (this skill, Tier 1 only)
Step 2: Deep analysis only for papers that pass the relevance filter
```
This saves significant tokens by filtering out marginally relevant papers before deep reading.
### As follow-up from other skills
After `/research-lit`, `/novelty-check`, or `/idea-discovery` surface a specific paper, users can invoke `/alphaxiv PAPER_ID` for a fast deep-dive without re-running the full survey.
INPUTS
- ARGUMENTS REQUIRED
user-supplied paper identifier or URL plus optional depth flag
e.g. 2401.12345 - depth: src
- PAPER_ID REQUIRED
extracted bare arXiv ID without version suffix
e.g. 2401.12345
REQUIRED CONTEXT
- $ARGUMENTS containing paper ID or arXiv/AlphaXiv URL
OPTIONAL CONTEXT
- - depth: overview|abs|src directive
TOOLS REQUIRED
- web_fetch
- file_download
- shell_execute
ROLES & RULES
- Do NOT use this skill for topic discovery, broad literature search, or multi-paper surveys
- Strip version suffixes for API calls
- If the overview answers the user's question, stop here
- Do NOT read the entire source tree by default
- Do NOT rely on persistence
- Overview first: always try overview before deeper tiers
- Minimal reads: at src tier read only files that answer the question
- Prefer cross-platform approaches for source extraction
- No PDF parsing
- If HTTP 429 occurs, wait 5 seconds and retry once
- Do not re-implement functionality of /arxiv or /deepxiv
- do not handwrite papers/<slug>.md
EXPECTED OUTPUT
- Format
- markdown
- Schema
- markdown_sections · Paper Title, arXiv, Source depth, Summary, Key Points, Answer to Your Question, Suggest Follow-Up Skills
- Constraints
- use default answer shape with title, arXiv link, source depth, summary, key points, and optional direct answer
- suggest follow-up skills after results
- conditionally run wiki ingestion script if research-wiki/ exists
SUCCESS CRITERIA
- Extract paper ID from any supported URL or bare ID format
- Always attempt overview tier first
- Escalate tiers only when needed
- Read LaTeX source selectively
- Present results in the default answer shape unless user asks for one specific detail
- Suggest follow-up skills after answer
FAILURE MODES
- May fetch deeper tiers unnecessarily
- May read too many LaTeX files
- May fail to locate WIKI_SCRIPT when research-wiki is present
EXAMPLES
Includes input format examples, override directives with depth flags, default markdown answer template, follow-up skill suggestions, and wiki ingestion code snippet.
CAVEATS
- Dependencies
- $ARGUMENTS
- research-wiki/ directory (optional)
- ARIS_REPO or local research_wiki.py script (conditional)
- Missing context
- Target runtime environment or CLI framework assumed for $ARGUMENTS parsing
- Definition of research-wiki/ directory structure and expected thesis format
- Ambiguities
- Does not specify exact criteria for determining when overview or markdown content is 'insufficient' to escalate tiers.
- Wiki integration script discovery logic is complex and may fail silently under unstated conditions.
QUALITY
- OVERALL
- 0.83
- CLARITY
- 0.78
- SPECIFICITY
- 0.88
- REUSABILITY
- 0.82
- COMPLETENESS
- 0.85
IMPROVEMENT SUGGESTIONS
- Add a short 'When to escalate' decision table with concrete signals (e.g., missing sections, equation references).
- Extract the three URL templates into a single configurable constants block at the top for easier maintenance.
USAGE
Copy the prompt above and paste it into your AI of choice — Claude, ChatGPT, Gemini, or anywhere else you're working. Replace any placeholder sections with your own context, then ask for the output.
MORE FOR AGENT
- Creative Thinking Frameworks for CS Researchagentresearch
- Academic Paper Figure Generatoragentresearch
- Deep Investigation Agent for Geopolitics Researchagentresearch
- Customer Research Analyst and Synthesizeragentresearch
- Gemini Research Paper Literature Searchagentresearch
- Research Formula Derivation Package Builderagentresearch
- Research Session Provenance Recorderagentresearch
- BIDS Neuroscience Data Organizeragentresearch
- Research Experiment Plan Roadmap Builderagentresearch
- ARA Research Artifact Compileragentresearch
- Research Proposal Experiment Roadmap Generatoragentresearch
- ML AI Theorem Proof Package Writeragentresearch
- Research Formula Derivation Package Builderagentresearch
- Scientific ML Catalog Assistantagentresearch
- OpenMM MDAnalysis Molecular Dynamics Workflowagentresearch
- Publication-Quality Paper Figure Generatoragentresearch
- ML Research Idea Generator and Rankeragentresearch
- ML Paper Figure and Table Generatoragentresearch
- Competitor Profiling Intelligence Analystagentresearch
- Research Method Novelty Checkeragentresearch
- Research Refine and Experiment Planning Pipelineagentresearch
- ML Ablation Study Planneragentresearch
- Research Agent Validation Best Practicesagentresearch
- AlphaXiv Single-Paper Lookup and Summarizeragentresearch
- End-to-End Autonomous Research Pipelineagentresearch