Guardrails

Threat Explainer

AI Agent Privilege Escalation: How Agents Inherit Dangerous Permissions

AI privilege escalation does not require an exploit.

Obsidian Editorial Team

Security Research

Obsidian Security

May 16, 2026

June 1, 2026

Key Takeaways

AI privilege escalation does not require an exploit.
It requires invoking the right agent.
A user without Salesforce access opens a chat interface, asks an AI agent a question about pipeline data, and receives a detailed answer.
The agent ran the query using its creator's admin credentials.
The IAM policy that was supposed to prevent exactly this did nothing.
That is AI privilege escalation in its current, operational form.

What AI Agent Privilege Escalation Is

Security teams understand traditional privilege escalation well. An attacker exploits a vulnerability, abuses a misconfigured permission rule, or cracks a credential to move from a low-privilege account to a high-privilege one. The escalation requires an active attack step. Something breaks or something is abused, and forensics can usually find the seam.

AI privilege escalation works differently. Nothing breaks. The agent is doing exactly what it was configured to do. The escalation happens because the agent holds credentials that exceed what the invoking user is authorized to access, and no existing control checks whether the invoker's permissions align with the agent's permissions before execution begins.

The core mechanism: an AI agent is not a passive tool. It holds tokens, OAuth grants, service account credentials, and embedded API keys. When a user invokes that agent, the agent acts using its own stored credentials, not the user's. If the agent's credentials grant access to data the user is not authorized to see, the user gets that data anyway. The IAM boundary that was supposed to stop them never had a chance to fire.

This is the visibility gap that security teams describe as ghost chasing. The configuration looks correct. The user's access policy is correctly scoped. But the agent operates in a layer between the user and the SaaS application where no traditional access control applies.

The distinction from traditional privilege escalation matters for how security teams respond. Traditional privilege escalation leaves exploit artifacts. AI privilege escalation leaves only normal agent activity logs, which look identical to authorized usage.

The Five Privilege Escalation Patterns in Agentic Systems

AI agent privilege escalation is not a single attack pattern. It is a family of five related mechanisms, each with a different trigger and a different remediation path.

Maker Mode

When an agent is built using the creator's credentials and those credentials are embedded into the agent's connector configuration, every user who invokes the agent operates at the creator's privilege level. A business analyst invokes a Copilot Studio agent built by a Salesforce administrator. The agent queries CRM records using the administrator's embedded credentials. The analyst retrieves data they were never provisioned to see. No access policy was violated from the agent's perspective. The IAM boundary was bypassed entirely.

This is maker mode privilege escalation. It is the most common pattern in enterprise environments and the most difficult to detect without correlating the invoker's identity against the agent's credential source.

Action Chaining

A single agent call triggers a sequence of downstream tool calls across multiple applications. Each step in the chain may appear authorized in isolation. The combined sequence accesses data that no single step should have reached. Action chaining is how an agent with moderate permissions produces an outcome that requires elevated permissions, by assembling access across multiple authorized but individually limited steps.

Confused Deputy

An agent with elevated permissions is manipulated into performing an action on behalf of a user who lacks the authority to request that action directly. The agent is the deputy. The user is the unauthorized principal. The agent cannot verify whether the invoker has the right to request the specific action being executed. It trusts the invocation. This is the confused deputy problem applied to agentic systems, and it is a direct consequence of agents lacking invoker-permission verification at runtime.

Credential Inheritance

Agents inherit OAuth tokens, service account roles, and API keys from the applications they connect to. When those credentials are over-scoped, the agent inherits excess authority. When the agent is then shared across teams or made org-wide accessible, every user in the organization effectively inherits that excess authority through the agent. Credential inheritance is not a configuration error in isolation. It becomes a privilege escalation vector when combined with broad agent accessibility.

Transitive Trust

Agent A has limited permissions. Agent B has broad permissions. Agent A is configured to call Agent B as part of its workflow. A user invokes Agent A, which invokes Agent B, which accesses data the user and Agent A were never supposed to reach. The trust relationship between agents creates a transitive escalation path. No single agent in the chain violated its own permission boundary. The chain as a whole crossed a boundary that no individual component was supposed to cross.

These five patterns frequently appear together. An agent operating in maker mode that also performs action chaining and calls a second agent via transitive trust represents a toxic combination: multiple risk factors on a single agent creating compounding, critical-priority risk.

How AI Action Chaining Compounds Escalation

Action chaining deserves a concrete walkthrough because it is the mechanism that turns a moderate escalation event into a high-severity incident.

Consider this scenario: a sales operations user invokes an n8n workflow agent to pull a summary of recent deal activity. The agent is authorized to read Salesforce opportunity records. That is the intended scope.

Step one: The agent queries Salesforce and retrieves opportunity data. Authorized.

Step two: The agent calls a connected document tool to retrieve the contract templates associated with those opportunities. The document tool connection was added by the workflow builder for a different use case. The agent's connector has read access to the entire contracts folder, not just the templates. Authorized by the connector scope, but broader than the task requires.

Step three: The agent calls a summarization tool and passes the full document folder contents as context. The summarization tool logs inputs to an external service for model improvement. The contracts folder contains compensation data, legal terms, and client PII. That data just left the organization.

No single step was an exploit. Each tool call was within the agent's configured permissions. The chain as a whole produced a data exposure that no individual step would have caused. This is the multiplicative blast radius of action chaining: each additional tool call in the sequence expands the potential exposure surface, and the final outcome bears no resemblance to what the agent was supposed to do.

The challenge for security teams is that action chaining is invisible to tools that only examine agent configuration. The configuration shows what tools the agent can call. It does not show what sequence of calls the agent actually executed, what data moved between steps, or what the cumulative access footprint of the full chain was. Runtime truth, not theoretical configuration, is the only way to see the chain as it executes.

These risk factors map directly to OWASP Low-Code/No-Code Security project categories, particularly LCNC-SEC-01 (Account Impersonation) and LCNC-SEC-06 (Excessive Data Access).

AI Agent Data Exfiltration as the Outcome

Privilege escalation is not the end goal. It is the precondition. AI agent data exfiltration is what privilege escalation makes possible, and it happens at a speed and scale that human-paced incident response cannot match.

AI agents move 16 times more data than human users. When an agent with escalated privileges begins querying records, the volume of data accessed in a single session can exceed what a human insider would access over months. By the time a security team identifies the anomaly, the data has already moved.

The detection challenge is specific. Agent activity logs, where they exist at all, record the agent's actions against the agent's own credentials. They do not record who invoked the agent, whether the invoker was authorized to receive the data the agent retrieved, or what the invoker did with the output. The log shows a normal API call from a known agent identity. Nothing in that log signals that a user without the required access permissions just received restricted CRM data, contract terms, or financial records.

This is the machine insider risk problem stated precisely. The agent acts like an insider because it holds credentials and accesses data. But no insider risk program covers it. The agent has no behavioral baseline, no manager who reviews its access during quarterly certifications, and no MFA prompt that would flag an unusual access pattern. It operates continuously, at machine speed, with the full authority of whoever provisioned its credentials.

Detecting AI agent data exfiltration requires correlating the invoker's identity with the agent's effective authority, not just logging the agent's API calls. Without that correlation, the log is noise.

AI Privilege Escalation Prevention Framework

Privilege escalation prevention in agentic systems requires three layers working together. Each layer addresses a different failure mode. None of the three is sufficient alone.

Deterministic Guardrails

Probabilistic agents require deterministic guardrails. An AI agent's behavior is non-deterministic by design. It can deviate from its intended workflow based on input, context, and model state. A deterministic guardrail does not try to predict what the agent will do. It enforces fixed rules about what the agent is allowed to do, regardless of what the agent decides.

Effective deterministic guardrails for AI privilege escalation include:

Invoker-permission verification: before an agent executes a sensitive action, verify that the invoking user's permissions include the right to request that action. If the invoker cannot perform the action directly, the agent should not perform it on their behalf.
Maker mode detection and blocking: flag any agent connector operating in maker mode with sensitive data access. Require explicit approval before that agent is made accessible to users who do not hold the same permissions as the creator.
Action chain scope limits: enforce maximum tool call depth per session. Require re-authorization for tool calls that access data outside the originally requested scope.
Org-wide accessibility controls: block agents from being made org-wide accessible if they hold connectors with sensitive data access, until a least-privilege review is completed.

Identity Correlation

The visibility gap in AI privilege escalation is an identity correlation problem. Security teams need to know three things simultaneously: who invoked the agent, what credentials the agent used to execute, and whether the invoker's permissions align with the agent's effective authority. No single data source provides all three.

Closing this gap requires an identity graph that maps the relationship between human identities, agent identities, and the SaaS entitlements those agents hold. Effective authority, not theoretical configuration, is the output. When a runner invokes an agent, the system checks the runner's identity against the agent's maker-mode credentials and flags the delta. That is the correlation event that makes privilege escalation visible.

Runtime Monitoring

Configuration review catches what agents are set up to do. Runtime monitoring catches what they actually do. For AI privilege escalation prevention, the monitoring layer must capture:

Every tool call an agent executes, in sequence, with the invoker's identity attached.
Data access events that cross permission boundaries relative to the invoker's own entitlements.
Agent-to-agent communication that creates transitive trust escalation paths.
Orphaned agent activity: agents whose creator accounts are disabled but whose credentials remain active and operational.

See how Obsidian's identity correlation layer surfaces AI privilege escalation before data moves.

Frequently Asked Questions

What is AI privilege escalation and how does it differ from traditional privilege escalation?

AI privilege escalation occurs when an AI agent executes actions using credentials that exceed what the invoking user is authorized to access, without any exploit or attack step. Traditional privilege escalation requires an active attack, such as exploiting a vulnerability or abusing a misconfigured permission. AI privilege escalation requires only invoking an agent that holds credentials the invoker was never supposed to use.

What is maker mode and why is it a privilege escalation risk?

Maker mode refers to an agent configuration where the creator's credentials are embedded into the agent's connector. Any user who invokes the agent executes actions at the creator's privilege level, regardless of the invoker's own permissions. A user without Salesforce access can invoke a maker mode agent built by a Salesforce administrator and retrieve data they were never provisioned to see.

How does action chaining amplify the blast radius of an escalation event?

Action chaining is the sequential execution of multiple tool calls across applications. Each step may be individually authorized, but the combined sequence can access data that no single step should have reached. The blast radius is multiplicative because each additional tool call in the chain expands the potential exposure surface, and the final outcome can far exceed what the agent's individual permissions would suggest.

What is a confused deputy attack in the context of AI agents?

A confused deputy attack occurs when an agent with elevated permissions is manipulated into performing an action on behalf of a user who lacks the authority to request that action directly. The agent trusts the invocation and executes using its own elevated credentials, bypassing the access controls that would have blocked the user from making the same request directly.

Why do traditional IAM controls fail to prevent AI privilege escalation?

Traditional IAM controls are designed to govern human identity lifecycle events: provisioning, de-provisioning, quarterly access reviews, and MFA-based authentication. AI agents bypass these controls because they hold their own credentials, operate continuously without interactive authentication, and are not covered by any existing insider risk program. The IAM policy governing the human user never intercepts the agent's credential-based execution.

What are deterministic guardrails and why do probabilistic agents require them?

Deterministic guardrails are fixed, predictable enforcement rules applied to AI agents at runtime. They do not predict agent behavior. They enforce absolute boundaries on what actions an agent is permitted to take, regardless of what the agent's probabilistic reasoning produces. Because AI agents are non-deterministic by design and can deviate from intended workflows, the controls governing them must be deterministic to be reliable.

- [What Are Agentic Guardrails? Deterministic Controls for Probabilistic Systems](/agentic-guardrails) - [Maker Mode Security: Why Fixed-Credential Agent Connections Are a Critical Risk](/blog/maker-mode-security) - [Agent-to-Agent Communication Security: The Multi-Agent Blind Spot](/ai-agent-runtime-security) - [Shadow AI vs. Shadow AI Agents: Two Different Risk Classes](/shadow-ai-security)