ScholarAIO is designed to work best through an AI coding agent, but the CLI remains useful for scripting, inspection, and quick queries.
The authoritative source is always:
scholaraio --help
scholaraio <command> --helpThe command groups below are aligned with the current codebase.
scholaraio setup
scholaraio setup check
scholaraio setup agent
scholaraio setup agent --apply
scholaraio setup agent check
setupruns the interactive installation and configuration wizard.setup checkreports dependency, parser, API-key, optional service, and runtime directory status.setup agentpreviews cross-project agent integration for supported coding agents. It covers shell runtime wiring, Codex/OpenClaw skill discovery, project-local wrappers for supported hosts, and Claude Code plugin instructions.setup agent --applyperforms the automatic steps. Restart the target agent session afterward so it reloads newly registered skills and wrapper files.setup agent checkreports the current agent-integration state without changing files.--target-projectwrites project-local wrappers with absolute local paths. Review the managed block before committing those files to a shared repository.
scholaraio index
scholaraio index --chunks
scholaraio search
scholaraio search --chunk
scholaraio search-author
scholaraio show
scholaraio embed
scholaraio vsearch
scholaraio usearch
scholaraio fsearch
scholaraio top-cited
searchperforms paper-level keyword search.index --chunksbuilds a line-addressable evidence chunk index frompaper.mdandmeta.json["toc"].search --chunksearches evidence chunks and returns the source paper, section, line range, and snippet. It supports the normalsearchfilters such as--year,--journal, and--type. This is evidence retrieval, not a knowledge graph.vsearchperforms semantic vector search.usearchperforms fused keyword + semantic retrieval.fsearchsearches across the main library, proceedings, explore databases, and arXiv.showsupports layered reading from metadata to full text.
scholaraio pipeline [preset]
scholaraio ingest-link <url> [<url> ...]
scholaraio websearch <query> [--count N]
scholaraio webextract <url> [--pdf] [--full] [--max-chars N]
scholaraio paper2any setup [--install-runtime]
scholaraio paper2any mcp-serve
scholaraio paper2any backend-serve
scholaraio paper2any status|tools|call
scholaraio patent-search <query> [--count N]
scholaraio patent-fetch <publication-number-or-url>
scholaraio enrich-toc
scholaraio enrich-l3
scholaraio backfill-abstract
scholaraio refetch
scholaraio translate
scholaraio attach-pdf [--dry-run] [--force]
scholaraio fetch-pdf <doi-or-url-or-title> [--direct] [--out-dir <dir>] [--ingest]
scholaraio fetch-pdf --paper <paper-id> [<paper-id> ...] [--direct] [--force]
scholaraio fetch-pdf --all [--direct] [--force]
pipelineis the main composable ingest entrypoint.ingest-linkpulls one or more rendered web URLs or online PDFs through an externalqt-web-extractorservice and routes them into the existing document ingest flow.fetch-pdfdownloads publisher PDFs through the current user network and access context. It does not bypass access controls; use--directto ignore proxy environment variables such as Clash when the campus network itself has access.fetch-pdf --ingestsends only the fetched PDF into the ingest pipeline. Without--out-dir, the PDF is staged temporarily and is not left in the configured inbox; use--out-dirto keep a separate downloaded copy. If--out-diris supplied, the PDF is saved there but ingested through an isolated temporary single-file inbox, so unrelated PDFs in that directory are not processed.websearchperforms live web search through an externalGUILessBingSearchservice; preferwebsearch.transport: mcpwith thesearch_bingtool when available, while the legacy HTTP/searchtransport remains supported.webextractextracts rendered web content throughqt-web-extractor; preferwebextract.transport: mcpwith thefetch_urltool for agent workflows, while the legacy HTTP/extracttransport remains supported. By default it prints a preview, and--fullexpands to the full body.paper2anystarts and calls the lightweight MCP sidecar for an external OpenDCAI/Paper2Any checkout. Use it for real Paper2Any paper-to-figure, PPT, poster, video, citation, rebuttal, DrawIO, mindmap, PDF-to-PPT, image-to-PPT, and KB workflows without vendoring Paper2Any into ScholarAIO.patent-searchdiscovers patent candidates through USPTO PPUBS by default, with optional ODP API support.patent-fetchdownloads a patent PDF into the configured patent inbox for the normal patent ingest flow.refetchrefreshes citation counts, bibliographic metadata, and structuredreferencesfor already ingested papers.refetch --references-only/--refs-onlylimits the run to DOI papers whosereferencesfield is still empty; in single-paper mode it only updatesreferences.attach-pdfattaches a source PDF to an existing paper directory, stores it besidepaper.mdusing the paper directory stem, and regenerates Markdown. It refuses to replace an existing canonical PDF unless--forceis supplied.fetch-pdf --paper <id> [<id> ...]re-downloads canonical PDFs for selected existing library papers usingsource_urlor DOI;fetch-pdf --allapplies the same logic to the whole library and reports downloaded/skipped/failed counts. Refetching PDFs does not regeneratepaper.md; useattach-pdfor the ingest conversion path when Markdown needs to be rebuilt.- Current preset values are
full,ingest,enrich, andreindex. - Run
scholaraio pipeline --helpfor pipeline options such as--steps,--dry-run,--no-api, and--rebuild.
scholaraio refs
scholaraio citing
scholaraio shared-refs
scholaraio topics
scholaraio explore
- Use
refs,citing, andshared-refsfor citation-graph analysis. - Use
topicsfor BERTopic-based topic modeling and exploration. - Use
explorefor OpenAlex-backed literature exploration outside the main library.
scholaraio import-endnote
scholaraio import-zotero
scholaraio export
scholaraio publish-site
scholaraio gui
scholaraio ws
scholaraio migrate status
scholaraio migrate upgrade --migration-id <id> --confirm
scholaraio migrate verify --migration-id <id>
scholaraio migrate finalize --migration-id <id> --confirm
-
migrate upgradeis the one-command path from supported legacy layout roots to the current fresh layout. It runs needed store moves, verification, cleanup archival, and final verification in one journal. -
Supported legacy signals are the layout-version-0 / implicit pre-cleanup roots:
data/papers/,data/citation_styles/,data/toolref/,data/explore/,data/proceedings/,data/inbox*,data/pending/,workspace/<name>/papers.json, and legacy workspace outputs. -
Empty legacy roots are cleanup candidates too, so finalized upgrades do not leave historical empty directories behind.
-
migrate finalizeremains available when a user or operator has already run store-level migration steps manually and only needs final cleanup and verification. -
import-endnoteandimport-zoterobring existing libraries into ScholarAIO. -
exporthandles BibTeX, RIS, Markdown, and DOCX export. -
publish-sitegenerates a static site from auditedpublished/*/metadata.jsonarchives, copying PDF/source assets by default and supporting--symlinkfor local preview. -
guistarts a local read-only WebUI for browsing the main paper library and proceedings child papers with live refresh, audit status, Markdown-rendered abstracts/conclusions, and local PDF preview. The WebUI serves only packaged local assets and does not load remote runtime scripts. -
wsmanages paper subsets for focused projects and writing workflows.
scholaraio toolref
scholaraio arxiv
scholaraio document
scholaraio diagram
scholaraio style
scholaraio backup
toolrefprovides versioned scientific tool documentation lookup.- Current
toolrefsubcommands arefetch,show,search,list, anduse. arxivsupports arXiv search and PDF fetch.documentprovides Office-document utilities such as inspection.diagramgenerates editable scientific diagrams from paper content or structured text. See the Graphviz Diagram Guide for DOT/SVG workflows.stylemanages citation styles.backuplists configured rsync targets and runs a named backup plan.backup runis intentionally non-interactive: SSH is launched withBatchMode=yes, so key-based auth and host trust must already be prepared.- If a target stores
passwordinconfig.local.yaml, ScholarAIO switches to an internal non-interactive askpass path instead of waiting for a terminal prompt. - A good first-run sequence is
ssh-keyscan ... >> ~/.ssh/known_hosts, thenssh -i <key> -p <port> <user>@<host> true, thenscholaraio backup run <target> --dry-run.
scholaraio audit
scholaraio repair
scholaraio rename
scholaraio setup
scholaraio insights
scholaraio metrics
scholaraio proceedings
scholaraio citation-check
auditchecks missing metadata, duplicate DOIs, filename issues, and title/content mismatches.audituses paper-type-aware skips so documents, patents, dissertations, and similar front matter do not create spurioustitle_mismatchwarnings.setupis the environment check and setup wizard entrypoint.insightsanalyzes research behavior such as hot keywords and reading trends.metricsshows LLM token and runtime usage.proceedingsprovides dedicated proceedings helpers.citation-checkverifies whether citations in text are backed by the local library.
Use the agent for the full workflow, and fall back to CLI commands when you want:
- fast scripted access
- a precise diagnostic check
- direct inspection of intermediate results
- reproducible command-line automation