Generate QA datasets & evaluate RAG systems with failure diagnosis. Any LLM.
Generate QA datasets & evaluate RAG systems in 2 commands 🔒 Privacy-First • ⚡ Lightning Fast • 🤖 Any LLM • 🏠 Local or Cloud • 🌍 Multilingual English | 中文 | 日本語 | Deutsch That's it. Get accuracy scores and incorrect QA pairs instantly. Already installed? Keep up to date — new versions add features like failure diagnosis and retrieved context capture: Perfect for Jupyter, Colab, and rapid…
Verification confirms publisher identity (repo ownership), not code safety. The security scan covers known CVEs and suspicious install scripts — it cannot prove the absence of malicious code.
Generate QA datasets & evaluate RAG systems in 2 commands 🔒 Privacy-First • ⚡ Lightning Fast • 🤖 Any LLM • 🏠 Local or Cloud • 🌍 Multilingual English | 中文 | 日本語 | Deutsch That's it. Get accuracy scores and incorrect QA pairs instantly. Already installed? Keep up to date — new versions add features like failure diagnosis and retrieved context capture: Perfect for Jupyter, Colab, and rapid iteration. Get instant visualizations. ragscore generate confidentialdocs/*.pdf…