← Back to Enterprise.AI
Accelerator · Checklist

Model Risk Management Checklist for AI

An SR 11-7 / SS1/23-aligned checklist extended to cover machine learning and generative AI models. Use it as a self-assessment, a model intake gate or a validation scope document.

44
Checkpoint Items
6
Lifecycle Stages
2
Regulatory Frameworks
This checklist extends SR 11-7 and SS1/23 to cover ML and GenAI models. Use it as an intake gate before model development begins, and again as a pre-deployment validation scope document. Tick items as you confirm them with your model risk team.

1 · Intake & classification

2 · Data & documentation

3 · Development & testing

4 · Independent validation

5 · Deployment & monitoring

6 · Generative AI extensions

Coverage 0%

Tick items as you confirm them with your model risk team.

SR 11-7 & SS1/23 section references

This checklist is aligned with the Federal Reserve's SR 11-7 and Bank of England's SS1/23 for model risk management. Below are key cross-references:

SR 11-7 / SS1/23 PrincipleChecklist Sections AddressingKey Requirements
Governanceg1: Risk tier, owner, approvalBoard & executive accountability; independent oversight; risk appetite defined
Model Developmentg3: Methodology, testing, explainabilityDocumented process; rigorous testing; version control; reproducibility
Model Validationg4: Independent validation, sign-offPre-deployment validation; conceptual, data, performance soundness; exception tracking
Data Governanceg2: Data quality, PII, lineage, biasData quality standards; PII protection; representativeness; fairness testing
Monitoring & Reportingg5: SLOs, drift detection, alerting, revalidationReal-time monitoring; performance tracking; drift detection; escalation procedures; periodic revalidation
DocumentationAll sections; esp. g1, g3, g5Model inventory, purpose, methodology, testing results, deployment plan, monitoring metrics
GenAI-specific (SS1/23)g6: Foundation model, prompts, hallucination, human-in-the-loopModel & vendor assessment; prompt governance; grounding; hallucination controls; human oversight

Validation scope template (for Level 1 & 2 models)

For Tier 1 & 2 models, use this template to scope independent validation before deployment:

Validation AreaScope for Tier 1Scope for Tier 2Owner / Timeline
Conceptual SoundnessFull review of business case, model choice, designReview of model design & business rationaleModel Risk / 2–3 weeks
Data ReviewFull data audit: sources, quality, bias, representativenessSampling of data quality & bias testingModel Risk / 1–2 weeks
Performance TestingBacktesting on holdout set; stress testing; fairness testing all cohortsBacktesting; fairness testing on main cohortsModel Risk / 1–2 weeks
Robustness & StressAdversarial testing, edge cases, degraded data scenariosKey edge case testingModel Risk / 1 week
ImplementationCode review, integration testing, production readinessCode review, basic integration testingTech Risk / 1 week
ExplainabilityFull explainability testing; SHAP/LIME plots; comparison modelsExplainability for top features; simpler challengerModel Risk / 1 week
GovernanceFull validation report; sign-off; condition documentationValidation report; sign-offModel Risk / 1 week
Timeline Total6–8 weeks (typical for Tier 1)3–4 weeks (typical for Tier 2)Plan accordingly in roadmap

Model inventory template (minimum fields)

Your central AI/ML inventory should track at minimum these fields for each model:

FieldDescriptionExampleOwner
Model IDUnique identifier for the modelLOAN_SCORING_V2.1Tech/Data
Business NameNon-technical name for the modelMortgage Pre-Approval ScorerBusiness
Business OwnerExecutive accountable for the modelHead of Retail LendingBusiness
Model Owner (Tech)Person responsible for day-to-day operation[model-owner@yourbank.com]Data Science
Risk OwnerExecutive accountable for risk managementChief Risk Officer / Head of Model RiskRisk
Risk Tier1 (Critical), 2 (High), 3 (Moderate), or 4 (Low)1Risk
Regulatory ClassificationEU AI Act (prohibited/high-risk/limited/minimal), SR 11-7, GDPR ADM, Fair LendingHigh-risk (EU AI Act), SR 11-7, GDPR ADMCompliance
Model TypeTraditional ML, DL, GenAI, ensemble, prompt engineeringLogistic Regression + XGBoost ensembleData Science
StatusDevelopment, Validation, Production, RetiredProductionData Science
Production DateWhen model went live2025-06-15Data Science
Last Revalidation DateMost recent validation/revalidation completed2026-03-20Risk
Next Revalidation DueScheduled revalidation date by risk tier2026-06-20 (quarterly for Tier 1)Risk
Monitoring StatusActive, Alert, Suspended, DecommissioningActiveData Science
Fairness Testing CompletedY/N and date of last fairness auditY (2026-03-15)Risk
Explainability AvailableY/N; explanation methodY (SHAP, LIME)Data Science
Data Governance OwnerTeam responsible for training data qualityData Platform teamData
Foundation Model (if GenAI)OpenAI GPT-4, Anthropic Claude, etc.OpenAI GPT-4 TurboData Science
Vendor/Third-partyIf model or foundation model from external vendorOpenAI / NAVendor Risk
Audit Trail / DocumentationLink to model card, validation report, incident logSharePoint link to MRM documentation folderRisk
The most common failure: models deployed once and never reassessed. Automate revalidation triggers, schedule cadence in your inventory, and track compliance monthly. Silent degradation is the biggest risk in production AI.

Common Model Risk Management gaps

Watch out for these common failures in AI/ML model risk:

Ready to implement this in your organisation?

Get in touch to discuss how this accelerator fits your institution.

Book a Consultation →