www.microsoft.com 6/4/2026, 8:01:21 PM · external

Updating the taxonomy of failure modes in agentic AI systems: What a year of red teaming taught us

Updating the taxonomy of failure modes in agentic AI systems: What a year of red teaming taught us
CyberSIXT Evidence Panel Source marked as original reporting

THE Microsoft AI Red Team has updated its Taxonomy of Failure Modes in Agentic AI Systems to v2.0, reflecting new insights gained from a year of red teaming. The update introduces seven new failure modes, such as Agentic Supply Chain Compromise and Goal Hijacking, addressing new vulnerabilities discovered in open-source frameworks and computer-use agents.

The v2.0 taxonomy emphasizes the need for supply chain security, a zero-trust framework for inter-agent communication, and improved consent architecture to protect against these vulnerabilities. Key operational findings include the frequency of HitL bypass exploits and the necessity of system-level testing for emerging attack patterns.

Recommendations for organizations involve comprehensive inventorying of their AI system supply chains, verifying agent identities, and auditing human-in-the-loop processes systematically.

View full article

Article by CyberSIXT