Skip to content
⠀⠶⠀Nodes

Explore

LearnMapResourcesStacks

Updates

Changelog
⠀⠶⠀NodesResources
← Resources

Braintrust

Evals, datasets, and logging for iterating on agent behavior in production-like conditions.

Visit site (opens in new tab)
ToolsharnessstableReviewed 2026-05-19

Hosting

Cloud

Explained in Learn

  • What is an Agent Harness?
  • What is Machine Learning?

Used in stacks

  • Support agent basics

Pairs well with

  • Langfuse

Tags

evalsdatasetslogging