Skip to main content
Hipocap documentation home page
Search...
⌘K
Contact support
hipocap/hipocap
hipocap/hipocap
Search...
Navigation
Evaluations
Documentation
SDK
Community
Introduction
Introduction
Getting Started
Architecture
Hosting Options
AI Security
AI Security Introduction
Setting up the Shield
Keyword Detection
Prompt Injection Protection
Threat Categories
Governance & RBAC
Governance & RBAC Introduction
Policies
Roles & Permissions
Function Access Control
Function Chaining
Observability
Tracing
Evaluations
Datasets
Platform
Evaluations
Evaluations score traces so you can quantify improvements and catch regressions as models, prompts, and code change.
Introduction
What evals are and when to use them.
Quickstart
Run your first evaluation.
Using a Dataset
Evaluate against curated data.
Online Evaluators
Score production traffic continuously.
Next: turn failures into reusable test data with
Datasets
and
Queues
.
⌘I