academic-paper-agent — Academic Research Librarian
Overview
The academic-paper-agent is a specialized subagent for finding peer-reviewed evidence from the arXiv repository.
It uses a rate-limited arxiv_search tool with built-in retry logic to retrieve scholarly papers relevant to
the research query.
Dictionary-Based SubAgent Definition:
academic_paper_subagent = {
"name": "academic-paper-agent",
"description": "Retrieves peer-reviewed scholarly papers from the arXiv database.",
"system_prompt": "...", # Full prompt below
"tools": [arxiv_search],
"model": subagent_model # Default: gemini_3_flash
}
When the Main Agent Should Invoke This Subagent
- •Tier 3 (Deep Research) — Discovery phase, called in parallel with
websearch-agent - •When the research requires academic citations, scholarly evidence, or peer-reviewed sources
- •When the user's query involves technical or scientific topics with arXiv coverage
Invocation Pattern:
task(name="academic-paper-agent", task="Find recent papers on multi-agent LLM systems, coordination mechanisms, and evaluation benchmarks.")
Tools
| Tool | Purpose | Rate Limit |
|---|---|---|
arxiv_search | Search arXiv repository by keywords | Max 3 calls per task, built-in retry logic |
Research Process
Step 1: Core Topic Identification
Identify the CORE topic from the research request. Distill it into broad, searchable themes.
Step 2: Query Strategy (Max 3 Searches)
| Search # | Purpose | Query Type |
|---|---|---|
| 1 | Broad sweep of main topic | 2–5 keywords, general |
| 2 | Narrower focus if needed | 2–5 keywords, specific subtopic |
| 3 | Only if needed | 2–5 keywords, remaining gap |
Example for "renewable energy trends and policy":
Query 1: "renewable energy adoption trends" Query 2: "energy policy technology transition" Query 3: (only if needed) "solar wind cost reduction"
Step 3: Result Extraction
Extract metadata for the top 3–5 most relevant results across all searches.
Critical Rate Limit Rules
⚠️ These are non-negotiable:
| Rule | Detail |
|---|---|
MAXIMUM 3 arxiv_search calls per task | NO EXCEPTIONS |
| DO NOT create separate searches for every subtopic | Consolidate into broad queries |
| Trust the built-in retry logic | Tool handles API rate limits automatically |
| If queries fail after retries | STOP searching and return what you have |
| Each query | Must be 2–5 keywords |
Query Guidelines
✅ Good Queries
"multi-agent systems LLM" "retrieval augmented generation" "agent collaboration framework"
❌ Bad Queries (DO NOT USE)
ti:"Full Paper Title Here" OR ti:"Another Full Title" Very long queries with multiple AND/OR operators More than 3 separate searches 8+ queries like "solar PV cost", "wind energy", "feed-in tariffs", "carbon pricing"...
Required Output Format
For EACH paper found, provide:
### [Paper Title] - **Authors:** Author A, Author B, et al. - **Publication Date:** YYYY-MM-DD - **Abstract:** A concise summary of the methodology and results. - **Link:** [arXiv:XXXX.XXXXX](https://arxiv.org/abs/XXXX.XXXXX)
Error Handling
| Scenario | Action |
|---|---|
| Search returns error or connection failure | STOP making more queries |
| Some results found before failure | Return whatever results you already have |
| No relevant papers after 2–3 attempts | State that clearly — do not fabricate results |
| Never retry more than once on a failed query | Trust the built-in retry logic |
Rules
- •DO NOT add external opinions — only report what is found in the database
- •DO NOT fabricate or hallucinate paper metadata
- •All information must come directly from arXiv search results
- •If no relevant papers exist on the topic, say so clearly
Downstream Integration
The output from academic-paper-agent feeds into:
- •
draft-subagent— Combines academic findings with web research into a cohesive draft - •
deep-reasoning-agent— Uses paper citations for fact-checking and cross-referencing - •
report-subagent— Includes academic references in the final formatted report
The agent's structured paper metadata (titles, authors, URLs) is critical for maintaining accurate academic citations throughout the MAIRA pipeline.