io.github.ArkNill/docpick

MCPcommunity
v0.1.2io.github.ArkNillUnknownUpdated 3mo agoGitHub

Schema-driven document extraction with local OCR + LLM. Document in, Structured JSON out.

Document in, Structured JSON out. Locally. With your schema. docpick is a lightweight, schema-driven document extraction pipeline that combines local OCR engines with local LLMs to extract structured JSON from any document — invoices, receipts, bills of lading, tax forms, and more. Zero cloud dependency — runs entirely on your machine (CPU or GPU) Custom schemas — define your own Pydantic models…

Automatically indexed from public sources. Not yet verified by the developer on Forge.Claim this listing →
3mo agoLast update
Package
Authorio.github.ArkNill
LicenseUnknown
Version0.1.2
Sourcemcp-registry
Trust Status
B
60/100Good
Listed in Forge index+10/10
Publisher identity verified+0/25
Publisher: run `forge publish` from the package repo to claim ownership
Ed25519 publish signature+0/10
Included automatically when the publisher runs `forge publish`
Domain verification+0/5
Publisher: host /.well-known/forge.json on the package homepage with { "publisher": "<github-login>" }
CVE scan · clean+30/30
Static analysis · clean+20/20
npm provenance (Sigstore)+0/5
Publish from GitHub Actions with the --provenance flag
Paste into Claude Code, Cursor, or any AI assistant to fix all gaps
StatusCommunity-indexed
PublisherUnverified
SignatureUnsigned
Domain
Provenance
DependenciesNot audited
Tool surface
Security scan✓ Cleanv0.1.3 · 19d ago
EvalsNone
IndexedJun 13, 2026

Verification confirms publisher identity (repo ownership), not code safety. The security scan covers known CVEs and suspicious install scripts — it cannot prove the absence of malicious code.

About

Document in, Structured JSON out. Locally. With your schema. docpick is a lightweight, schema-driven document extraction pipeline that combines local OCR engines with local LLMs to extract structured JSON from any document — invoices, receipts, bills of lading, tax forms, and more. Zero cloud dependency — runs entirely on your machine (CPU or GPU) Custom schemas — define your own Pydantic models or use 8 built-in document schemas Validation built-in — checkdigit verification, cross-field rules,…

Keywords
mcp