| name | find-literature |
| description | This skill should be used when the user asks to "find papers", "search for papers", "what does the literature say", "find research on", "academic papers about", "literature review", "cite papers", or needs to answer questions using academic literature. |
| metadata | {"internal":true} |
| allowed-tools | Bash(asta literature find *) Bash(asta literature interactive *) Bash(asta papers *) TaskOutput Bash(jq *) |
Find Literature
Search academic literature for papers relevant to a query. The search will return a ranked list of papers with relevance scores, summaries, and supporting snippets.
This is an advanced search, so the query can be long and complex. You may ask the user
questions to clarify the topic and refine the query before running the search.
Choosing find vs interactive
Two entry points are available. Both run the same retrieval pipeline (criteria extraction,
candidate retrieval, re-ranking). The significant difference is what happens around that
pipeline — and the right choice is mostly driven by what the session as a whole is about,
not by the complexity of any single query:
-
asta literature interactive — default for literature-focused sessions. Use it when the
first interaction in the session is about literature search or exploration, or when the
entire session is focused on literature. It runs the full Asta paper-finder agent: better
planning for complex or multi-faceted queries (decomposition, iterative search strategies) and
a reasoning / results-verification loop that can re-run or refine the search when the initial
results don't satisfy the criteria. It also maintains conversation state via a thread_id, so
follow-ups can build on prior results: filtering ("now narrow to surveys from 2023+"),
aggregating ("group by venue"), following relations ("expand on the third paper's citations"),
or iterative refinement.
-
asta literature find — one attempt, no agent loop. Use it primarily when literature
search is a sub-step inside some other multi-step flow (data analysis, code generation,
a broader research workflow, etc.) where the same flow may issue many unrelated searches and
the latency of the full agent loop is not worth it for each one. Lower quality, much faster.
Don't reach for find just because a single query "looks simple" — if the session is
literature-centered, prefer interactive.
Continuing a conversation — recommended pattern (--thread-dir): for any
multi-turn search session, pick a directory named .asta/literature/threads/<YYYY-MM-DD>-<slug>/
(date-prefixed slug, matching the convention used by other agents) and pass it
on every turn. The CLI auto-resumes the conversation, writes one artifact per
turn with a .NNN index suffix, and maintains a DIR/index.json that records
turn order, queries, narratives, paper counts, and the thread_id. Use a
meaningful -o basename per turn — the CLI inserts the turn index for you.
asta literature interactive "transformer architecture survey" \
--thread-dir .asta/literature/threads/2026-05-04-transformer-architectures \
-o transformer-survey.json
asta literature interactive "narrow to 2023+ long-context surveys" \
--thread-dir .asta/literature/threads/2026-05-04-transformer-architectures \
-o narrow-2023.json
Start a fresh conversation when the topic shifts substantially: pick a new
--thread-dir. Reading the conversation back is straightforward — jq over
DIR/index.json for the turn list, then open the per-turn JSON files referenced
by turns[].file.
Without --thread-dir, an invocation is a one-shot turn that doesn't continue
any prior conversation. Use that for ad-hoc single queries; for any multi-turn
session, always pass --thread-dir.
Output deltas vs find: the interactive JSON adds thread_id and narrative
(the agent's terminal response text). It does not currently populate
citationContexts or publicationDate per paper; for those, use find or the
asta papers commands.
Installation
This skill requires the asta CLI:
PLUGIN_VERSION=0.17.1
if [ "$(asta --version 2>/dev/null | grep -oE '[0-9]+\.[0-9]+\.[0-9]+')" != "$PLUGIN_VERSION" ]; then
uv tool install --force git+https://github.com/allenai/asta-plugins.git@v$PLUGIN_VERSION
fi
Prerequisites: Python 3.11+ and uv package manager
Tools Available
Paper Finder
Run in background for comprehensive searches (30-60s):
Bash(command="asta literature find 'query' -o /tmp/literature-search-result.json --timeout 300", run_in_background=true)
After the search completes, move the file to .asta/documents/literature/find/ and index it:
mkdir -p .asta/documents/literature/find
mv /tmp/literature-search-result.json .asta/documents/literature/find/
Bash(command="asta documents add file://.asta/documents/literature/find/literature-search-result.json --name='Literature Search: <query>' --summary='Search results for: <query>'")
Browse results with jq:
jq '[.results | sort_by(-.relevanceScore) | .[0:10][] | {title, year, venue, corpusId, score: .relevanceScore, summary: .relevanceJudgement.relevanceSummary}]' <filename>
Go through all highly relevant papers, extracting relevance criteria, snippets, and citation contexts from each.
Semantic Scholar CLI (fast targeted searches)
asta papers get ARXIV:2005.14165
asta papers search "transformers" --year 2023- --limit 20
asta papers citations ARXIV:2005.14165
asta papers author search "Geoffrey Hinton"
asta papers author papers <author_id>
Recommended fields: --fields title,abstract,year,authors,venue,citationCount
JSON Structure
The output is a LiteratureSearchResult with the following structure:
Top level:
query: string - the search query
results: array of Paper objects
Each paper in results[]:
corpusId: int - Semantic Scholar corpus ID
title: string - paper title
abstract: string | null - paper abstract
year: int | null - publication year
authors: array of {name: string, id: string}
venue: string | null - publication venue
url: string | null - Semantic Scholar URL
citationCount: int | null - number of citations
relevanceScore: float - 0-1 (higher = more relevant to query)
relevanceJudgement: object with:
relevance: int - overall relevance score
relevanceSummary: string - AI explanation of relevance
relevanceCriteriaJudgements: array of per-concept judgements with:
name: string - concept name
relevance: int - relevance for this concept
relevantSnippets: array of supporting text excerpts
snippets: array of text excerpts from paper body (if open access):
text: string - excerpt text
sectionTitle: string - section name
citationContexts: array of citation contexts:
text: string - how other papers cite this work
sourceCorpusId: int - corpus ID of citing paper
Example access patterns:
jq '[.results | sort_by(-.relevanceScore) | .[0:10][] | {title, year, score: .relevanceScore}]' <filename>
jq '.results[] | {title, summary: .relevanceJudgement.relevanceSummary}' <filename>
jq '.results[] | select(.corpusId == 123456) | .snippets[].text' <filename>
Using Results
After a search completes, results can be used by the Literature Report skill.