Usage Guide
Install, configure, run evaluations, compare models, and use Python API.
Open Usage GuideLLM evaluation, cost monitoring, regression detection, and model comparison.
Use this page as the documentation homepage for the library.
Install, configure, run evaluations, compare models, and use Python API.
Open Usage GuideComplete project reference across architecture, workflows, and operations.
Open Full DocumentationDetailed system design, module boundaries, extension points, and data flow.
Open ArchitectureCreate and test custom provider adapters with implementation guidance.
Open Adapter DevelopmentProject standards, PR process, testing expectations, and setup steps.
Open ContributingCommunity behavior expectations and reporting process.
Open Code of Conductpip install aegis-ai
# evaluate a dataset
aegis eval run --dataset examples/datasets/qa_sample.yaml --model gpt-4
# compare multiple models
aegis compare --dataset examples/datasets/qa_sample.yaml --models gpt-4,gpt-3.5-turbo,claude-3-opus
simple_eval.py · model_compare.py · cost_tracking_demo.py · sentiment_classification.py · qa_evaluation.py