EU AI Act Article 15: Accuracy, Robustness and Cybersecurity

Q: What is the difference between accuracy and robustness under Article 15?

Accuracy under Article 15 refers to the system's ability to correctly identify, classify, detect, or predict the outputs it is designed to produce, measured against declared performance metrics. Robustness refers to the system's ability to continue performing as intended when it encounters erroneous inputs, unexpected conditions, or adversarial manipulation. A system can be accurate in normal conditions but not robust: for example, a classification model that performs well on clean test data but fails when inputs are slightly corrupted. Both properties are required independently.

Q: How does Article 15 relate to Article 9 of the EU AI Act?

Article 9 requires deployers and providers to establish and maintain a risk management system throughout the lifecycle of a high-risk AI system. Article 15 specifies substantive technical standards that the system itself must meet. The risk management system under Article 9 must identify and assess risks to accuracy, robustness, and cybersecurity, and the measures documented under Article 15 are the technical responses to those identified risks. Compliance with Article 15 without the Article 9 framework to identify what needs to be protected is insufficient.

Q: What documentation do deployers need for Article 15 compliance?

Deployers need documentation showing: the accuracy metrics received from the provider in the instructions for use, records of monitoring against those metrics in the actual deployment context, procedures for detecting and responding to degradation in accuracy or robustness, cybersecurity controls applied to the deployment environment, access management records, and incident records where accuracy, robustness, or cybersecurity failures occurred. These records support both the Article 9 risk management system and the Article 12 logging obligations.

Q: Does the Digital Omnibus delay affect Article 15 obligations?

The Digital Omnibus proposal would push the high-risk AI obligations including Article 15 from 2 August 2026 to 2 December 2027 for deployers. If the Omnibus is formally adopted, Article 15 compliance would not be legally required until December 2027. However, the Omnibus has not yet been adopted and the original August 2026 deadline remains binding. Deployers seeking insurance coverage or pursuing certification should comply with Article 15 standards regardless of the enforcement date, because these standards are incorporated into insurance underwriting requirements and certification frameworks independently of the regulatory deadline.

Article 15 addresses what a high-risk AI system must actually achieve, not just what documentation must surround it. A system that is well-documented under Articles 11 and 17 but performs poorly against its declared accuracy metrics, fails under adversarial conditions, or has cybersecurity vulnerabilities in its deployment environment is not compliant with Article 15. Deployers bear obligations here alongside providers, and those obligations are ongoing throughout the system's operational life.

Key takeaways

Article 15 requires high-risk AI systems to achieve declared accuracy levels, validated metrics, and resilience against errors and adversarial manipulation. These are enforceable requirements, not aspirational standards.
Accuracy must be declared in the provider's instructions for use. Deployers must use the system within those declared conditions and monitor whether actual performance meets the declared metrics.
Robustness covers three distinct failure modes: random errors in inputs, systemic failures in the operating environment, and deliberate adversarial manipulation. All three must be addressed.
Cybersecurity obligations under Article 15 extend to the deployer's access management, authentication controls, and monitoring of the AI system's operating environment, not just the provider's model infrastructure.
Article 15 compliance documentation is directly relevant to AI insurance underwriting: insurers assessing AI agent coverage examine accuracy records, robustness testing, and cybersecurity controls as part of their risk assessment.

What Article 15 requires: the three dimensions

Article 15(1) of Regulation (EU) 2024/1689 states that high-risk AI systems shall be designed and developed in such a way that they achieve an appropriate level of accuracy, robustness, and cybersecurity, and that they perform consistently with respect to those aspects throughout their lifecycle. The three dimensions are explicitly distinct and each carries its own set of requirements.

Accuracy, under Article 15(2), means that the performance of the AI system in terms of its ability to achieve its intended purpose must be validated against declared metrics. Those metrics must be included in the instructions for use that providers supply under Article 13 and Annex XIII. The metrics must reflect what the system is actually designed to do: a classification system is measured on its classification accuracy, a prediction system on its predictive validity, an image recognition system on its detection rate and false positive rate.

Robustness, addressed in Article 15(3), is the ability of the high-risk AI system to be resilient with regard to errors, faults, or inconsistencies that may occur within the system or in its environment, in particular those that may affect health and safety or fundamental rights. Robustness is a distinct property from accuracy: a system can be highly accurate on clean, well-formed inputs but brittle when inputs deviate from those conditions.

Cybersecurity, under Article 15(4), requires that high-risk AI systems be resilient against attempts by third parties to alter their use, outputs, or performance by exploiting vulnerabilities. This includes model-level vulnerabilities (adversarial inputs designed to cause misclassification), system-level vulnerabilities (unauthorised access to the AI system's operating environment), and infrastructure-level vulnerabilities (attacks on the data pipelines or logging systems that support the AI deployment).

The provider-deployer split under Article 15

The most practically important aspect of Article 15 for deployers is understanding which obligations fall on providers and which fall on deployers. Providers bear the primary design and development obligations: they must ensure that accuracy metrics are validated during development, that the system is tested for robustness against the failure modes in Article 15(3), and that cybersecurity protections are built into the system architecture. These obligations are satisfied through the conformity assessment under Article 43 and the technical documentation under Article 11.

Deployers have independent obligations that persist throughout the operational life of the deployment. First, deployers must use the system within the scope defined by the instructions for use, which specify the conditions under which the declared accuracy metrics are valid. Using the system outside those conditions may invalidate the accuracy declarations and create a compliance gap. Second, deployers must monitor actual system performance against declared accuracy metrics as part of their post-market monitoring obligation under Article 72. Where performance degrades below declared levels, the deployer must take corrective action under Article 20 and notify the provider under Article 26(6). Third, deployers must implement access management, authentication, and monitoring controls in their own infrastructure to protect the cybersecurity of the AI deployment from their end of the system boundary.

Accuracy requirements: what deployers must track

The accuracy metrics declared by the provider in the instructions for use are the baseline against which deployers must measure operational performance. Article 15(2) specifies that the levels of accuracy and the relevant accuracy metrics shall be stated in the accompanying instructions for use. For deployers, the practical implication is that they must establish a monitoring process capable of detecting when operational accuracy falls below the declared baseline.

What this looks like in practice depends on the specific system and use case. For a medical imaging AI, it may mean periodic audit of a sample of cases against expert review. For an employment screening tool, it may mean tracking the rate at which human reviewers override the system's recommendations. For a credit assessment model, it may mean comparing the model's predicted default rates against actual outcomes over time. The monitoring does not need to be continuous, but it must be systematic and documented.

When performance degrades, Article 26(1) requires that deployers take appropriate technical and organisational measures to address the deviation, and Article 26(6) requires notification to the provider so they can assess whether a conformity issue exists. The Article 12 logging requirements support this process: logs of AI system outputs provide the data necessary for post-market accuracy monitoring. A deployer that cannot demonstrate they track operational accuracy against declared metrics is in a structurally weak position for both regulatory compliance and for insurance claims arising from AI failures.

Robustness: three failure modes to address

Article 15(3) identifies robustness as the ability to withstand errors, faults, and inconsistencies that may occur within the system or in its environment. Reading the provision in light of the broader regulation, robustness for high-risk AI encompasses three distinct failure modes, each of which must be addressed through technical or organisational measures.

The first is random or incidental error in inputs or operating conditions. An AI system used in a production environment will inevitably encounter inputs that deviate from the clean, well-formed data on which it was trained or tested. A robust system must either handle these gracefully, providing a meaningful output with appropriate uncertainty indication, or fail safely, flagging the input for human review rather than producing a confident but unreliable output.

The second is systemic failure in the deployment environment. This covers scenarios where the infrastructure surrounding the AI system fails: sensor malfunctions, database connectivity issues, integration failures between the AI system and other systems it depends on. Robustness requires that the AI system degrade safely under these conditions rather than producing misleading outputs based on incomplete or corrupted information.

The third is deliberate adversarial manipulation, which Article 15(3) explicitly addresses by referencing attacks specifically designed to alter the use, outputs, or performance of the system. For high-risk AI systems, providers are required to implement protections against known adversarial attack vectors. Deployers must ensure that their operational environment does not introduce additional attack surfaces: for example, by allowing untrusted parties to directly inject inputs into the AI system without validation, or by exposing the AI system's API without authentication.

Cybersecurity obligations for deployers

Article 15(4) addresses cybersecurity specifically and assigns obligations to both providers and deployers. For deployers, the key obligations concern the operational environment in which the AI system runs, which is under the deployer's control rather than the provider's.

Access management is the most fundamental cybersecurity control at the deployer level. Only authorised users and systems should be able to interact with the AI system, submit inputs, retrieve outputs, or modify its configuration. This requires role-based access controls, authentication requirements, and audit logs of access. Network security controls are also a deployer responsibility: if the AI system is accessed over a network, the communication should be encrypted and authenticated. API keys and credentials should be stored securely and rotated periodically.

Supply chain cybersecurity is a less obvious but important category. High-risk AI deployments often involve multiple software components: the AI model itself, data preprocessing pipelines, output post-processing logic, integration layers connecting the AI system to other enterprise systems. Each component is a potential attack surface. Deployers should maintain a software bill of materials for their AI deployment and apply security patching to those components as vulnerabilities are disclosed.

The connection between Article 15 cybersecurity requirements and the broader EU cybersecurity framework is also relevant. The NIS2 Directive (Directive (EU) 2022/2555), which entered into force for essential and important entities in October 2024, imposes cybersecurity risk management requirements that overlap with Article 15 obligations for entities in its scope. Deployers of high-risk AI systems in NIS2-regulated sectors should treat Article 15 cybersecurity compliance as one component of their broader NIS2 programme rather than as a separate exercise.

Interaction with Articles 9, 12, and 72

Article 15 does not stand alone. Its practical implementation depends on three other articles that deployers need to understand in connection with it. Article 9 requires that deployers establish and maintain a risk management system throughout the lifecycle of the high-risk AI system. That risk management system must identify and evaluate known and foreseeable risks including risks to accuracy, robustness, and cybersecurity. The measures documented under Article 15 are the technical responses to those Article 9 risks.

Article 12 requires that high-risk AI systems be designed with logging capabilities enabling automatic recording of events throughout the operational life of the system. Those logs are the evidential foundation for Article 15 monitoring: accuracy monitoring requires a record of the system's outputs, robustness monitoring requires records of inputs that fell outside expected parameters, and cybersecurity monitoring requires access logs and anomaly detection records.

Article 72 requires deployers to establish a post-market monitoring system that actively monitors the performance of high-risk AI systems after deployment. Post-market monitoring for Article 15 purposes means tracking accuracy against declared metrics, testing robustness under operational conditions, and reviewing cybersecurity controls at periodic intervals or after significant changes. The Article 72 monitoring plan should explicitly reference Article 15 metrics and specify the frequency and methodology of performance review.

For the full context on deployer obligations under Article 26, which governs how Article 15 fits into the broader compliance programme, see the complete Article 26 guide. For the interaction between Article 15 and the risk management system, see the Article 9 deployer guide.

Insurance underwriting and Article 15

The requirements of Article 15 are directly material to the emerging AI liability insurance market in Europe. Underwriters assessing an enterprise's AI deployment examine accuracy, robustness, and cybersecurity as part of their risk assessment, because these are the properties that determine whether a deployment is likely to produce failures that generate insurable losses.

An AI deployment that can demonstrate strong Article 15 compliance produces a better underwriting profile. Documented accuracy metrics and monitoring records demonstrate that the deployer has a systematic understanding of when and how the system might fail. Robustness testing documentation shows that adversarial and edge-case failure modes have been identified and addressed. Cybersecurity controls documentation shows that the deployment environment is protected against the attack vectors most likely to produce AI-specific losses (data poisoning, model manipulation, output fabrication via prompt injection).

European AI liability products, including the Munich Re aiSure framework and coverage frameworks emerging from Lloyd's of London, are beginning to incorporate certification and compliance documentation as underwriting inputs. An enterprise that can provide its Article 15 compliance documentation, its accuracy monitoring records, and its cybersecurity controls inventory presents itself as a lower-risk proposition. Treating Article 15 compliance documentation as underwriting-ready material from the outset is a practical step that pays dividends in both premium and the availability of coverage. For the insurance implications of compliance documentation, see the analysis on building an insurance evidence chain from compliance documentation.

Frequently asked questions

What does Article 15 of the EU AI Act require?

Article 15 of Regulation (EU) 2024/1689 requires that high-risk AI systems achieve an appropriate level of accuracy, robustness, and cybersecurity, and maintain those properties throughout their lifecycle. The accuracy levels must be declared in the instructions for use. Robustness means the system must resist errors, faults, and adversarial manipulation. Cybersecurity means the system must be resilient against attempts to alter its use or outputs through exploitation of vulnerabilities.

What is the difference between accuracy and robustness under Article 15?

Accuracy refers to the system's ability to correctly produce its intended outputs, measured against declared performance metrics. Robustness refers to the system's ability to maintain its intended functionality when inputs are erroneous, conditions deviate from expected parameters, or the system is subject to adversarial manipulation. A system can be accurate under clean test conditions but not robust when deployed in a messy real-world environment.

How does Article 15 relate to Article 9 of the EU AI Act?

Article 9 requires deployers to establish and maintain a risk management system that identifies and evaluates risks to the AI system throughout its lifecycle. Article 15 specifies the substantive technical properties the system must achieve. The Article 9 risk management process must identify risks to accuracy, robustness, and cybersecurity, and the measures taken under Article 15 are the technical responses to those identified risks. Article 9 defines the process; Article 15 defines the outcome.

What documentation do deployers need for Article 15 compliance?

Deployers need: the accuracy metrics received from the provider in the instructions for use, records of operational performance monitoring against those metrics, procedures for responding to accuracy degradation, robustness testing documentation for the deployment environment, cybersecurity controls documentation including access management and authentication records, and incident records where Article 15 properties were breached. These records integrate with the Article 9 risk management system and the Article 12 logging requirements.

Does the Digital Omnibus delay affect Article 15 obligations?

The Digital Omnibus proposal would extend the high-risk AI enforcement deadline from 2 August 2026 to 2 December 2027 for most Article 15 obligations. If adopted, Article 15 compliance would not be legally required until December 2027. However, the original deadline remains binding until the Omnibus is formally published in the Official Journal. Deployers seeking insurance coverage or certification should comply with Article 15 standards regardless of the enforcement date.

References

Regulation (EU) 2024/1689 of the European Parliament and of the Council (the Artificial Intelligence Act), Article 15, and related Articles 9, 12, 13, 20, 26, 43, 72.
Directive (EU) 2022/2555 on measures for a high common level of cybersecurity across the Union (NIS2 Directive), October 2024 transposition deadline.
Directive (EU) 2024/2853 on liability for defective products (revised Product Liability Directive), classifying AI software as a product subject to strict liability from December 2026.
AIUC-1 AI Agent Certification Standard, AI Underwriting Company, 2025, particularly provisions on system reliability and cybersecurity evaluation.
European Commission, Proposal for a Digital Omnibus on AI, proposed extension of high-risk AI obligations, March 2026.
ENISA, AI Threat Landscape Report 2024, European Union Agency for Cybersecurity.