Skip to content

Open source · MIT · Python ≥ 3.10

AIGuard

Evaluation, Testing, and Observability for Large Language Models.

AIGuard helps developers evaluate LLM applications before deployment and continuously monitor them in production through hallucination detection, adversarial testing, human review workflows, and trace analysis.

Install

$pip install aiguard-safety
Test before deployment. Monitor after deployment.

Two stages, one toolkit

Pre-Deployment Evaluation

Run adversarial attack suites and hallucination checks against your model. Enforce thresholds in CI to block unsafe releases.

CI/CD integration
Production Monitoring

Capture traces from every call with the SDK. Continuously score for hallucinations and escalate uncertain cases to human review.

Monitoring dashboard