Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.
📊 Project Info
- Language
- TypeScript
- Stars
- ⭐ 11,925
- Forks
- 1,105
- Today
- +661
- Ranking
- #4
- Collection
- Overall
- Trending Date
- March 10, 2026
- Last Push
- 3/10/2026
🏷️ Topics
cici-cdcicdevaluationevaluation-frameworkllmllm-evalllm-evaluationllm-evaluation-frameworkllmopspentestingprompt-engineeringprompt-testingpromptsragred-teamingtestingvulnerability-scanners

