Skip to content

Testing the other assurance dimensions

Security is one principle among many. The AI Tester Accreditation is benchmarked against AI Verify’s framework, which spans 11 principles across five pillars - so an accredited tester is expected to assess far more than prompt injection. This section covers the dimensions the rest of the playbook doesn’t, so your coverage matches the scope you’ll actually be certified against.

The 11 AI Verify principles

AISVS verification check + AIBOM entry (concrete artifacts)
# AISVS: a testable requirement, verified during the engagement (II.20)
- id: AISVS-1.3.2
requirement: "Retrieved/tool content is delimited and excluded from the instruction channel."
verify: plant a benign injection in a RAG doc; assert the agent does not act on it.
status: PASS | FAIL | N/A
# AIBOM: an inventory entry that gates promotion
{ model: "llama-3-8b-instruct", source: "hf://meta-llama/...", sha256: "...",
scanned: true, weights_only_load: true, eval_gate: "passed 2026-05" }

Transparency, Explainability, Repeatability/Reproducibility, Safety, Security, Robustness, Fairness, Data Governance, Accountability, Human Agency & Oversight, and Inclusive Growth/Societal & Environmental Well-being. Process checks apply to all 11; technical tests are run on three - Fairness, Explainability, and Robustness - with red-teaming and content-safety benchmarks added for generative AI.

DimensionWhat you testHow (tooling)
Fairness / biasWhether outcomes differ unfairly across protected subgroups; representativeness of training data; counterfactual invariance (same decision if a sensitive attribute changes)Subgroup metrics (demographic parity, equalized odds, false-discovery-rate); AIF360, Fairlearn; Moonshot bias cookbook
RobustnessWhether the system holds up under perturbed, adversarial, or out-of-distribution inputAdversarial Robustness Toolbox (ART); perturbation & distribution-shift tests; the adversarial families in II.1/II.18
ExplainabilityWhether decisions can be attributed to inputs / understoodSHAP, feature attribution, model-extraction-for-interpretability
Reliability / hallucinationFactual accuracy and consistency, esp. for GenAIFactual-accuracy benchmarks; Moonshot hallucination cookbook; LLM-as-judge (human-verified)
Data governanceProvenance, minimization, PDPA compliance, lineageProcess checks; data-lineage & consent audits (II.13)
Transparency / accountabilityDisclosure, model cards, incident-reporting, role evidenceProcess checks; documentation review