Testing the other assurance dimensions
Security is one principle among many. The AI Tester Accreditation is benchmarked against AI Verify’s framework, which spans 11 principles across five pillars - so an accredited tester is expected to assess far more than prompt injection. This section covers the dimensions the rest of the playbook doesn’t, so your coverage matches the scope you’ll actually be certified against.
The 11 AI Verify principles
# AISVS: a testable requirement, verified during the engagement (II.20)- id: AISVS-1.3.2 requirement: "Retrieved/tool content is delimited and excluded from the instruction channel." verify: plant a benign injection in a RAG doc; assert the agent does not act on it. status: PASS | FAIL | N/A# AIBOM: an inventory entry that gates promotion{ model: "llama-3-8b-instruct", source: "hf://meta-llama/...", sha256: "...", scanned: true, weights_only_load: true, eval_gate: "passed 2026-05" }Transparency, Explainability, Repeatability/Reproducibility, Safety, Security, Robustness, Fairness, Data Governance, Accountability, Human Agency & Oversight, and Inclusive Growth/Societal & Environmental Well-being. Process checks apply to all 11; technical tests are run on three - Fairness, Explainability, and Robustness - with red-teaming and content-safety benchmarks added for generative AI.
| Dimension | What you test | How (tooling) |
|---|---|---|
| Fairness / bias | Whether outcomes differ unfairly across protected subgroups; representativeness of training data; counterfactual invariance (same decision if a sensitive attribute changes) | Subgroup metrics (demographic parity, equalized odds, false-discovery-rate); AIF360, Fairlearn; Moonshot bias cookbook |
| Robustness | Whether the system holds up under perturbed, adversarial, or out-of-distribution input | Adversarial Robustness Toolbox (ART); perturbation & distribution-shift tests; the adversarial families in II.1/II.18 |
| Explainability | Whether decisions can be attributed to inputs / understood | SHAP, feature attribution, model-extraction-for-interpretability |
| Reliability / hallucination | Factual accuracy and consistency, esp. for GenAI | Factual-accuracy benchmarks; Moonshot hallucination cookbook; LLM-as-judge (human-verified) |
| Data governance | Provenance, minimization, PDPA compliance, lineage | Process checks; data-lineage & consent audits (II.13) |
| Transparency / accountability | Disclosure, model cards, incident-reporting, role evidence | Process checks; documentation review |