Learn more about the core concepts of our AI testing platform.

Agents

  • Target Agent: The AI agent you want to test.
  • Maihem Agent: Our AI agent tests your Target Agent.

Metric

A criterion on which to evaluate a Target Agent, such as hallucination or helpfulness.

Test

A configuration including the Metrics and other parameters for evaluating your Target Agent over time.

Test Run

A single execution of a Test. A Test Run generates a set of Conversations.

Conversation

A series of Turns. A Turn contains a Message from the Target Agent and a Message from the Maihem Agent.

Evaluation

An evaluation of a Message or Conversation with a Metric. An Evaluation contains a Score, Failure Flag, and Explanation.

Report

A Report contains the results of a Test Run.