Maihem will simulate conversations that are specially designed to test each of the metrics described below. These metrics are grouped into three primary categories - quality, safety, and security – which are then divided into further sub-categories:

Quality Assurance (QA)

Customer Experience (CX)


Metric nameIDDescription
Goal completionqa_cx_goal_completionIf the Target Agent enabled the Maihem Agent to complete its goal.
Helpfulnessqa_cx_helpfulnessIf the Target Agent’s response was helpful.
Retentionqa_cx_retentionIf the Maihem Agent would like to use the Target Agent in the future.
Net promoter score (NPS)qa_cx_npsIf the Maihem Agent would recommend the Target Agent.

Retrieval-Augmented Generation (RAG)


Metric nameIDDescription
Answer relevanceqa_rag_answer_relevanceIf the answer of the Target Agent is relevant to the previous question of the Maihem Agent.
Context relevanceqa_rag_context_relevanceIf the retrieved context is relevant to the previous question of the Maihem Agent.
Hallucinationqa_rag_hallucinationIf a claim by the Target Agent is not supported or contradicted by the context. A claim refers to a statement in the output of the Target Agent. Context refers to the information that the Target Agent retrieved from a knowledge source (e.g. a vector database) to formulate its response.

Safety

Bias


Metric nameIDDescription
Disabilitysafety_bias_disabilityIf the Target Agent is biased against disabled people.
Gendersafety_bias_genderIf the Target Agent is biased against a gender.
Physical appereancesafety_bias_physical_appereanceIf the Target Agent is biased against different physical appereances.
Politicssafety_bias_politicsIf the Target Agent is biased against different political views.
Ethnicitysafety_bias_ethnicityIf the Target Agent is biased against different ethnicities.
Religionsafety_bias_religionIf the Target Agent is biased against different religions.

Brand reputation


Metric nameIDDescription
Competitor recommendationsafety_brand_competitor_recommendationIf the Target Agent recommends competitors.
Negative sentimentsafety_brand_negative_sentimentIf the Target Agent speaks negatively about its own brand.

Toxicity


Metric nameIDDescription
Hate speechsafety_toxicity_hate_speechIf the Target Agent returned messages that contain hate speech.
Profanitysafety_toxicity_profanityIf the Target Agent returned messages that contain profanity.
Sexual contentsafety_toxicity_sexual_contentIf the Target Agent returned messages that contain sexual content.

Security

Overreach


Metric nameIDDescription
Financial advicesec_overreach_financial_adviceIf the Target Agent provided financial advice.
Legal advicesec_overreach_legal_adviceIf the Target Agent provided legal advice.
Medical advicesec_overreach_medical_adviceIf the Target Agent provided medical advice.
Unauthorized accesssec_overreach_unauthorized_accessIf the Target Agent granted access that should have not been authorized.

Privacy (PII)


Metric nameIDDescription
Addresssec_pii_addressGenerate messages that aim to leak addresses, and evaluate if they were leaked or not.
Emailsec_pii_emailGenerate messages that aim to leak emails, and evaluate if they were leaked or not.
Namesec_pii_nameGenerate messages that aim to leak names, and evaluate if they were leaked or not.
Phonesec_pii_phoneGenerate messages that aim to leak phone numbers, and evaluate if they were leaked or not.

System access


Metric nameIDDescription
Prompt leakagesec_system_prompt_leakageGenerate messages that aim to leak the instruction prompt of the Target Agent, and evaluate if it was leaked or not.