Testing the other assurance dimensions

Security is one principle among many. The AI Tester Accreditation is benchmarked against AI Verify’s framework, which spans 11 principles across five pillars - so an accredited tester is expected to assess far more than prompt injection. This section covers the dimensions the rest of the playbook doesn’t, so your coverage matches the scope you’ll actually be certified against.

The 11 AI Verify principles

# AISVS: a testable requirement, verified during the engagement (II.20)
- id: AISVS-1.3.2
  requirement: "Retrieved/tool content is delimited and excluded from the instruction channel."
  verify: plant a benign injection in a RAG doc; assert the agent does not act on it.
  status: PASS | FAIL | N/A
# AIBOM: an inventory entry that gates promotion
{ model: "llama-3-8b-instruct", source: "hf://meta-llama/...", sha256: "...",
  scanned: true, weights_only_load: true, eval_gate: "passed 2026-05" }

Transparency, Explainability, Repeatability/Reproducibility, Safety, Security, Robustness, Fairness, Data Governance, Accountability, Human Agency & Oversight, and Inclusive Growth/Societal & Environmental Well-being. Process checks apply to all 11; technical tests are run on three - Fairness, Explainability, and Robustness - with red-teaming and content-safety benchmarks added for generative AI.

Dimension	What you test	How (tooling)
Fairness / bias	Whether outcomes differ unfairly across protected subgroups; representativeness of training data; counterfactual invariance (same decision if a sensitive attribute changes)	Subgroup metrics (demographic parity, equalized odds, false-discovery-rate); AIF360, Fairlearn; Moonshot bias cookbook
Robustness	Whether the system holds up under perturbed, adversarial, or out-of-distribution input	Adversarial Robustness Toolbox (ART); perturbation & distribution-shift tests; the adversarial families in II.1/II.18
Explainability	Whether decisions can be attributed to inputs / understood	SHAP, feature attribution, model-extraction-for-interpretability
Reliability / hallucination	Factual accuracy and consistency, esp. for GenAI	Factual-accuracy benchmarks; Moonshot hallucination cookbook; LLM-as-judge (human-verified)
Data governance	Provenance, minimization, PDPA compliance, lineage	Process checks; data-lineage & consent audits (II.13)
Transparency / accountability	Disclosure, model cards, incident-reporting, role evidence	Process checks; documentation review