Runtime Truth

Thought Leadership

Agentic AI Threats: An OWASP-Aligned Threat Modeling Framework for Security Teams

STRIDE was built for systems that do what they are told.

Obsidian Editorial Team

Security Research

Obsidian Security

May 15, 2026

June 1, 2026

Key Takeaways

STRIDE was built for systems that do what they are told.
When a Salesforce Agentforce workflow can invoke tools, chain actions across connected apps, and operate under a creator's admin credentials without the invoker knowing, the entire premise of configuration-time threat modeling collapses.
Understanding the actual agentic AI threats your environment faces requires a different methodology entirely.---

Why Traditional Threat Modeling Frameworks Fail for AI Agents

Security teams have relied on STRIDE and DREAD for decades because those frameworks match how software behaves. A service receives input, processes it according to fixed logic, and produces output. The threat model asks: where can an attacker spoof, tamper, repudiate, disclose, deny, or escalate? The answers are bounded because the system's behavior is bounded.

AI agents break every one of those assumptions.

What is an agentic AI system, exactly? It is an autonomous system that takes multi-step actions on behalf of users using tools, APIs, and SaaS integrations. It does not follow a fixed execution path. It reasons about goals, selects tools, chains actions, and adapts based on intermediate results. Two identical prompts to the same agent can produce two entirely different sequences of tool calls. That is not a bug. It is the design.

STRIDE asks: "What can an attacker do to this system?" For an autonomous agent, the harder question is: "What can this system do, to whom, under what conditions, and who authorized any of it?" Those are identity and authority questions, not just attack surface questions.

The configuration-time analysis problem is equally serious. Traditional threat modeling happens at design time. Security teams review architecture diagrams, data flow diagrams, and trust boundaries. For AI agents, configuration-time analysis shows theoretical configuration: what the agent is set up to do on paper. It does not show effective authority: what the agent can actually execute inside each SaaS application it touches, given how entitlements resolve at runtime.

One enterprise discovered 377 Microsoft Copilot agents through a security assessment. None appeared in any architecture diagram. None had been through a threat modeling exercise. They existed, they were running, and they had access. That is the shadow agent gap that configuration-time frameworks cannot close.

The result is ghost chasing. Security teams review policy documents and connector configurations, flag theoretical risks, and have no runtime evidence to confirm whether any of it matters. The agents keep running. The blast radius keeps growing.

The OWASP-Aligned Agentic AI Threat Model

The OWASP Top 10 for Agentic AI and the companion OWASP Low-Code/No-Code Security (LCNC-SEC) project are the most operationally useful starting points for agentic AI threat modeling. They map directly to the attack patterns that enterprise security teams are already encountering: privilege escalation via maker mode credentials, confused deputy attacks, excessive data access, and agent impersonation.

The table below maps each relevant OWASP category to the threat modeling question it answers and the runtime signal that confirms the risk is real.

OWASP CategoryThreat Modeling QuestionSignal to Look ForLCNC-SEC-01: Account ImpersonationCan a lower-privilege user invoke this agent and receive data they are not authorized to see?Runner identity does not match the entitlement level of the agent's embedded credentialsLCNC-SEC-05: Security MisconfigurationIs this agent publicly accessible or org-wide without access controls?Agent URL accessible without authentication; no invoker identity requirementLCNC-SEC-06: Excessive Data AccessDoes this agent access more data than its workflow actually requires?Agent queries records, files, or objects outside the scope of its stated purposeOWASP Agentic: Privilege EscalationCan this agent acquire permissions beyond its initial grant through action chaining?Agent calls a tool that returns a token or role elevation used in a subsequent callOWASP Agentic: Confused DeputyCan an unauthorized user manipulate this agent into using its elevated permissions on their behalf?Invoker identity has no direct access to the resource the agent retrievesOWASP Agentic: Excessive AgencyDoes this agent have more tool access than its workflow requires?Agent has write or delete permissions on systems it only needs to readOWASP Agentic: Credential ExposureAre credentials embedded in the agent configuration accessible to unintended parties?Hardcoded secrets, shared connectors, or maker mode connections with admin-level credentialsOWASP Agentic: Insecure Agent CommunicationDoes this agent share full context with sub-agents or downstream agents?Supervisor agent forwards entire conversation context including sensitive fields to sub-agents

"The question for security teams should not just be: which agents exist. It is what they can actually execute, on whose behalf, with what downstream reach, and whether any of that is policy-aligned."

OWASP provides the vocabulary. The methodology below provides the process for answering each of these questions systematically, for every agent in your environment. For a deeper look at the attack surface these categories represent, see AI agent attack vectors security teams need to know.

The Four-Phase Threat Modeling Methodology

Agentic AI threat modeling requires four sequential phases. Each phase builds on the last. Skipping Phase 1 makes Phase 2 impossible. Skipping Phase 2 makes Phase 3 produce false confidence.

Phase 1: Inventory and Identity Mapping

The first question is always: what agents exist? Security teams cannot threat model what they cannot see. This phase produces a complete agent inventory across every platform in scope, including the agents nobody registered.

For each agent, capture:

Creator identity: Who built this agent? Is that account still active?
Credential type: Does this agent use the creator's credentials (maker mode), a service account, or delegated user credentials?
Platform: Where does this agent run? Copilot Studio, Agentforce, Bedrock, Azure AI Foundry, n8n, or another builder?
Connected tools and MCP servers: What external systems can this agent reach?
Invoker scope: Who can trigger this agent? One user, the whole organization, or anyone with a URL?

An orphaned agent whose creator account is disabled but whose credentials are still live is a critical finding at this phase. The agent continues running with inherited permissions that no active owner can review or revoke.

Phase 2: Effective Authority Analysis

Theoretical configuration tells you what the agent is set up to do. Effective authority tells you what it can actually do inside each connected application after all entitlements resolve.

These two numbers are rarely the same. Ninety percent of agents hold excessive privileges relative to what their workflows actually require. A connector configured for "read access" may resolve to admin-level entitlements inside Salesforce because the creator holds an admin role and the connector inherits it.

For each agent, map:

What does the configuration claim this agent can access?
What can the agent's credentials actually execute inside each connected SaaS application?
Does the invoker's identity constrain what the agent can retrieve, or does the agent operate entirely on its own credentials?

This phase surfaces the maker mode risk directly. When a user without Salesforce access invokes an agent built with an admin's credentials, the user receives Salesforce data at admin resolution. No access control was bypassed technically. Your IAM was simply invisible to the interaction. Securing Salesforce Agentforce workflows requires this analysis before any other control can be meaningful.

Phase 3: Action Chain Decomposition

Action chaining is the mechanism by which agents compound access across multiple tool calls. A single agent interaction can involve five, ten, or twenty sequential tool calls, each one building on the output of the last.

Decompose each agent's possible action chains by asking:

What is the first tool call this agent can make?
What data or tokens does that call return?
What does the agent do with that output in the next call?
At what point in the chain does the agent touch sensitive data, modify records, or acquire new permissions?
What is the maximum blast radius if the chain runs to completion under the agent's current effective authority?

This phase frequently reveals that agents with individually acceptable permissions can reach critical data through multi-step sequences that no single permission review would flag. For a structured look at these attack patterns, see the full AI agent attack surface analysis.

Phase 4: Toxic Combination Identification

Individual risk factors often score as medium severity. Two medium-severity factors on the same agent frequently produce a critical-severity finding. This is the toxic combination problem.

Review each agent's risk factor profile and flag combinations such as:

Shadow agent (unmanaged, no security review) plus org-wide accessible
What this combination enforces as a finding: an unvetted agent with maximum exposure scope
Why individual factors understate the risk: an unmanaged agent is concerning; an unmanaged agent accessible to the entire organization is a critical blast radius
Maker mode connector with admin credentials plus invoker has no direct platform access
What this combination enforces as a finding: a de facto privilege escalation path available to any invoker
Why individual factors understate the risk: maker mode is a design choice; the absence of invoker-level entitlement checks makes it an access control bypass
Orphaned agent (creator account disabled) plus sensitive data access
What this combination enforces as a finding: active data access by credentials with no accountable human owner
Why individual factors understate the risk: a disabled account is an IAM hygiene issue; a disabled account still running an agent against sensitive data is an active security incident
Public webhook endpoint (n8n) plus hardcoded credentials in the workflow
What this combination enforces as a finding: externally accessible execution with embedded secrets that cannot be rotated without code changes
Why individual factors understate the risk: a public endpoint is a scope concern; hardcoded credentials in a publicly invokable workflow is a credential exposure at the perimeter

Each combination multiplies the blast radius of the other factors. A publicly accessible agent with maker mode admin credentials is not two medium-severity findings. It is a single critical-severity finding that warrants immediate remediation. For a deeper framework on how these combinations stack, see AI agent toxic combinations and risk scoring.

A Worked Example: Threat Modeling a Salesforce Agentforce Workflow

A financial services security team is asked to threat model a new Salesforce Agentforce workflow. The workflow is described as: "Summarize open opportunities and surface renewal risk for account executives."

Here is how the four phases apply.

Phase 1 findings: The agent was built by a Salesforce administrator. The administrator's account is active. The agent uses a connector in maker mode, meaning it runs on the administrator's credentials for every invocation. The agent is accessible to all sales users in the organization. No MCP server connections are present.

Phase 2 findings: The theoretical configuration says the agent reads opportunity records. Effective authority analysis shows the administrator's credentials resolve to read access across all Salesforce objects, including contracts, pricing rules, and HR-linked compensation records that the sales team is not supposed to see. The agent's effective authority is significantly broader than its stated purpose.

Phase 3 findings: Action chain decomposition shows the agent's first call retrieves opportunity records. A follow-up call, triggered by the agent's reasoning about "renewal risk," queries contract terms. A third call pulls account contact records including fields tagged as sensitive. The chain runs in under two seconds. No individual call appears alarming in isolation.

Phase 4 findings: Two toxic combinations are present. First: maker mode connector with admin-level effective authority plus org-wide invoker access. Any sales user can now retrieve contract and compensation-adjacent data through the agent. Second: the agent accesses files with sensitivity labels through its connected SharePoint integration, which was not documented in the original workflow description.

The overall risk rating for this agent moves from medium (based on individual factors) to critical (based on toxic combinations). The remediation path requires restricting the connector to a least-privilege service account scoped to opportunity and contact objects only, and limiting agent invocation to the account executive role rather than all sales users.

This is what AI agent governance looks like in practice: not a policy document, but a structured finding with a specific remediation action tied to a specific agent.

Operationalizing Threat Modeling Continuously

A one-time threat modeling exercise produces a snapshot. Agents are not static. New agents appear daily. Creators change roles. Connectors are added. MCP server connections are established without security review. The snapshot is outdated before the report is written.

Continuous threat modeling requires runtime signals feeding back into the model. The four-phase methodology above describes the initial analysis. The continuous loop adds a fifth input: what the agent actually did.

Runtime signals that update the threat model include:

A previously low-usage agent suddenly querying large volumes of records
An agent invoking a tool it has never called before
An invoker triggering an agent from an unusual location or at an unusual time
An agent accessing data objects outside its documented workflow scope

Each of these signals is a reason to re-run Phase 2 (effective authority) and Phase 3 (action chain decomposition) for that specific agent. The threat model is not a document. It is a living picture of every agent's current blast radius.

The operationalization gap is real. Security teams across enterprises describe the same problem: they know agents exist, they know some are risky, but they have no systematic way to prioritize which ones to investigate first. The answer is toxic combination scoring. Agents with multiple simultaneous risk factors get investigated first, regardless of how any individual factor scores in isolation.

Where to start this quarter:

Run Phase 1 across your highest-priority AI platforms. Copilot Studio and Salesforce Agentforce are the most common sources of shadow agent sprawl in 2026.
Flag every agent in maker mode with org-wide or public access. That is your critical-priority list.
For each critical agent, run Phase 2 to confirm effective authority. Configuration will not tell you the answer.
Build a repeatable cadence. Threat modeling for agentic AI threats is a quarterly minimum, with runtime signal review happening continuously.

Request an AI agent risk assessment to surface the agents and toxic combinations already present in your environment before the next quarterly review.

Conclusion

Traditional threat modeling frameworks were built for systems that behave as designed. Autonomous AI agents do not. They chain actions, inherit credentials, resolve entitlements at runtime, and operate across platforms that no single security tool was built to see simultaneously.

The OWASP-aligned four-phase methodology in this article gives security teams a structured, repeatable process for answering the questions that matter: what agents exist, what they can actually do, how far their action chains reach, and which combinations of risk factors produce critical-severity findings.

The work does not end with a report. Runtime signals must feed back into the model. Effective authority changes when creators change roles. New connectors expand blast radius without any configuration review. The threat model must evolve with the agents it covers.

Security teams that build this capability now will have a defensible, audit-ready picture of their agentic AI risk. Teams that wait will continue ghost chasing, reviewing theoretical configurations while probabilistic agents operate without deterministic guardrails.

Start with inventory. Confirm effective authority. Decompose the action chains. Find the toxic combinations. Then build the runtime loop that keeps the model current.

Frequently Asked Questions

What is an agent in cybersecurity?

An agent in cybersecurity is an autonomous AI system that takes multi-step actions on behalf of users using tools, APIs, and SaaS integrations. Unlike traditional software, agents reason about goals and select actions dynamically. This makes them a distinct threat modeling challenge because their behavior is probabilistic, not fixed.

What is agentic AI and why does it create new security risks?

Agentic AI refers to AI systems capable of autonomous, goal-directed action across multiple tools and platforms. The security risk comes from three factors: agents inherit credentials that may exceed the invoking user's own permissions, they chain actions across connected systems faster than any human review process, and they operate in a visibility gap between SaaS applications where traditional security tools have no sight lines.

Why does STRIDE fail for autonomous AI agent security?

STRIDE assumes deterministic system behavior. It asks what an attacker can do to a system with fixed logic. AI agents do not have fixed logic. Their action sequences vary based on runtime reasoning. STRIDE cannot model the privilege escalation that occurs when an agent chains tool calls to reach data its initial permission grant does not obviously cover, because that chain is not predictable at design time.

What is a toxic combination in AI agent threat modeling?

A toxic combination is the presence of multiple medium-severity risk factors on the same agent simultaneously. Each factor alone might not justify urgent remediation. Together, they produce a critical-severity blast radius. A common example: an agent in maker mode with admin credentials that is also accessible to all users in the organization. Neither factor alone is critical. Together, they mean any user in the organization can retrieve admin-level data through the agent.

How often should security teams run agentic AI threat modeling?

The four-phase methodology should run on a quarterly basis at minimum for all production agents. However, toxic combination scoring and effective authority analysis should update continuously as runtime signals surface new behavior. New agents, new connectors, and changes to creator account status are all triggers for an immediate re-run of the relevant phases.

What is the difference between theoretical configuration and effective authority?

Theoretical configuration is what an agent's policy or setup file says it can do. Effective authority is what the agent can actually execute inside each connected SaaS application after all entitlements resolve. These two values are frequently different. Ninety percent of agents hold excessive privileges relative to their stated workflow scope, which means theoretical configuration consistently understates the real blast radius.

Which frameworks apply to agentic AI threat modeling?

The OWASP Top 10 for Agentic AI and OWASP LCNC-SEC provide the most operationally specific coverage for the attack patterns enterprises face today. MITRE ATLAS maps adversarial tactics to AI systems. NIST AI RMF and ISO 42001 provide governance frameworks for managing AI risk across the organization. The four-phase methodology in this article operationalizes these frameworks into a repeatable process.