AIIRS provides a consistent and defensible basis for assessing the inherent risk associated with GenAI-assisted tasks.
The AI Inherent Risk Scale (AIIRS) provides a structured approach for classifying tasks that use generative artificial intelligence (GenAI) into LOW, MEDIUM, or HIGH inherent-risk bands.
Classification is determined via three criteria—epistemic dependence, verifiability, and consequences of error—that define the nature and significance of a task's reliance on GenAI.
These criteria consider the extent to which GenAI is expected to supply information, the degree to which the output can be independently verified, and the seriousness of any potential errors.
AIIRS evaluates tasks across three independent dimensions.
Epistemic dependence captures whether a task requires the system's representations of the world to be correct in order for the task outcome to be usable.
Tasks with lower epistemic dependence rely only on user-provided material, without requiring the system's representations of the world to be correct for the task outcome to be usable.
Tasks with higher epistemic dependence require the system's representations of the world to be correct for the task outcome to be usable.

Verifiability captures the basis on which the correctness of a GenAI system's output can be verified for the task.
Verifiability is assessed independently of consequences. A task may be high risk due to unsourced verifiability, even where the immediate consequences of error are limited.
Tasks with embedded verifiability enable quick, reliable verification by the user or the surrounding process, without requiring specific domain expertise.
Tasks requiring expert verifiability depend on specialised expertise or external investigation that requires evaluative judgement.

The consequences of error reflect the extent to which incorrect, misleading, or incomplete GenAI outputs affect decisions, records, or outcomes related to the task.
Tasks with minimal consequences of error are those in which errors have minimal impact on understanding or outputs and do not affect decisions, records, or outcomes relating to people beyond the task.
Tasks with significant consequences of error are those in which errors affect decisions about people, alter records relating to them, or compromise outputs that have consequences for individuals or groups beyond the task.

Select one option for each criterion. AIIRS calculates the risk classification using max-dominant scoring where the single highest criterion determines the overall result.
AIIRS uses a max-dominant classification model that supports proportionate risk management by ensuring that any single high-risk feature of a task is not offset by lower-risk features elsewhere.
Tasks classified as HIGH must not proceed in their current form. One or more of the following interventions are required:

Tasks classified as MEDIUM require proportionate controls to manage identified risk. The following controls and conditions apply:

Tasks classified as LOW require routine care appropriate to the task and context. The following routine practices apply:

The complete AI Inherent Risk Scale is freely available and released under a Creative Commons BY-NC-SA 4.0 license.
Download AIIRS PDF
Scan to download