arXiv Source Reader OpenClaw Skill
Read and analyze arXiv papers by fetching LaTeX source, listing sections, or extracting abstracts
Installation
clawhub install arxiv-source
Requires npm i -g clawhub
379
Downloads
0
Stars
5
current installs
5 all-time
6
Versions
arxiv-reader
Read and analyze arXiv papers by fetching their public LaTeX source. Converts LaTeX into clean text suitable for LLM analysis.
Description
This skill fetches arXiv papers from the public arXiv API (arxiv.org), flattens LaTeX includes, and returns clean text. No local file access is required — all content is fetched over HTTPS from arXiv's public endpoints and cached in memory for the session.
Network access: Only connects to arxiv.org and export.arxiv.org to download publicly available paper source tarballs and metadata. No other network connections are made. No data is sent to external services — this is read-only.
Caching: Results are cached in memory (process-scoped) for fast repeat access within the same session. No files are written to disk.
Usage Examples
- "Read the paper 2301.00001 from arXiv"
- "What sections does paper 2405.12345 have?"
- "Get the abstract of 2312.09876"
- "Fetch paper 2301.00001 without the appendix"
Process
- Quick look — Use
arxiv_abstractto get a paper's abstract before committing to a full read - Survey structure — Use
arxiv_sectionsto understand the paper's outline - Deep read — Use
arxiv_fetchto get the full flattened LaTeX for analysis
Tools
arxiv_fetch
Fetch the full flattened LaTeX source of an arXiv paper.
Parameters:
arxiv_id(string, required): arXiv paper ID (e.g.2301.00001or2301.00001v2)remove_comments(boolean, optional): Strip LaTeX comments (default: true)remove_appendix(boolean, optional): Remove appendix sections (default: false)figure_paths(boolean, optional): Replace figures with file paths only (default: false)
Returns: { content: string, arxiv_id: string, cached: boolean }
Example:
{ "arxiv_id": "2301.00001", "remove_appendix": true }
arxiv_sections
List all sections and subsections of an arXiv paper.
Parameters:
arxiv_id(string, required): arXiv paper ID
Returns: { arxiv_id: string, sections: string[] }
Example:
{ "arxiv_id": "2301.00001" }
arxiv_abstract
Extract just the abstract from an arXiv paper.
Parameters:
arxiv_id(string, required): arXiv paper ID
Returns: { arxiv_id: string, abstract: string }
Example:
{ "arxiv_id": "2301.00001" }
Notes
- Results are cached in memory — repeat requests within the same session are instant
- Paper IDs support version suffixes (e.g.
2301.00001v2) - Very large papers may take 10-30 seconds on first fetch
arxiv_abstractuses the public arXiv Atom API for fast metadata retrieval- No filesystem writes — all caching is in-memory only
- Only connects to arxiv.org (read-only, public data)
Statistics
Author
willamhou
@willamhou
Latest Changes
v1.0.5 · Mar 16, 2026
Declare runtime and network access, clarify read-only public API access, in-memory cache only
Quick Install
clawhub install arxiv-source Related Skills
Other popular skills you might find useful.
Chat with 100+ AI Models in one App.
Use Claude, ChatGPT, Gemini alongside with EU-Hosted Models like Deepseek, GLM-5, Kimi K2.5 and many more.