Agentic AI and the New Attack Surface

Sindhu Vissamsetti

Intern - Policy & Advocacy, CyberPeace

PUBLISHED ON

Apr 10, 2026

Introduction
‍

Agentic AI systems are autonomous systems that can plan, make decisions, and take actions by interacting with external tools and environments. But they shift the nature of risk by blurring the lines among input, decision, and execution. A conventional model generates an output and stops. An agent takes input, makes plans, invokes tools, updates its state and repeats the cycle. This creates a system where decisions are continuously revised through interaction with external tools and environments, rather than being fixed at the point of input.

This means the attack surface expands in size and becomes more dynamic. Instead of remaining confined to components as in traditional computational systems, they spread in layers and can continue to grow through time. To understand this shift, the system can be analysed through functional layers such as inputs, memory, reasoning, and execution, while recognising that risk does not remain isolated within these layers but emerges through their interaction.

‍

‍

Agentic AI Attack Surface

‍

A layered view of how risks emerge across input, memory, reasoning, execution, and system integration, including feedback loops and cross-system dependencies that amplify vulnerabilities.
‍

Input Layer: Where Untrusted Data Becomes Control
‍

The entry point of an agent is no longer one prompt. The documents, APIs, files, system logs and the outputs of other agents can now be considered input. This diversity is significant due to the fact that every source of input carries its own trust assumptions, and in the majority of cases, they are weak.

The most obvious threat is prompt injection, where inputs are treated as instructions rather than data. Since inputs are treated as instructions, a virus, a malicious webpage, or a document can contain instructions that override system goals without necessarily being detected as something harmful.

Indirect prompt injection extends this risk beyond direct user interaction. Instead of targeting the interface, attackers compromise the retrieval process by embedding malicious instructions within external data sources. When the agent retrieves and processes the data, it treats the embedded content as legitimate input. As a result, the attack is executed through normal reasoning processes, allowing the system to act on untrusted data without recognising the manipulation.

Data poisoning also occurs at runtime. In contrast to classical poisoning (where training data is manipulated), runtime poisoning distorts the agent’s perception of its environment as it runs. This can change decisions without causing apparent failures.

Obfuscation introduces another indirect attacker vector. Encoded instructions or complicated forms may bypass human review but remain readable to the model. This creates asymmetry whereby the system knows more about the attack than those operating it. Once compromised at this layer, the agent implements compromised instructions which affect downstream operations.
‍

Context and Memory: Persistence of Influence
‍

Agentic systems depend on memory to operate efficiently. They often retain context across sessions and frequently store information between sessions.

This introduces a different type of risk: persistence. Through memory poisoning, attackers can insert false or adversarial information into sorted context, which then influences future decisions. Unlike prompt injection, which is often limited to a single interaction, this effect carries forward. Over time, the agent begins to operate on a distorted internal state, shaping decisions in ways that may not be immediately visible.

Another issue is cross-session leakage. Information in a particular context may be replayed in a different context when memory is being shared or there is insufficient memory separation. This is specifically dangerous in those systems that combine retrieval and long-term storage. The context management in itself becomes a weakness. Agents are required to make decisions on what to retain and what to discard. This is susceptible to attackers who can flood the context or manipulate what is still visible and indirectly affect reasoning.

The underlying problem is structural. Memory turns data into a state. Once state is corrupted, the system cannot easily distinguish valid knowledge from adversarial influence.

The issue is structural. Memory converts temporary data into a persistent state. Once this state is weakened, the system cannot reliably separate valid information from adversarial influence, making recovery significantly more difficult.
‍

Reasoning and Planning: Manipulating Intent Without Breaking Logic
‍

The reasoning layer is where agentic AI stands apart from traditional systems. The model no longer reacts to inputs alone. It actively breaks down objectives, analyses alternatives, and ranks actions.

At the reasoning stage, the nature of risk shifts. The concern is no longer limited to injecting instructions, but to influencing how decisions are made. One example is goal manipulation, where the agent subtly reinterprets its objective and produces outcomes that are technically correct but strategically harmful. Reasoning hijacking operates within intermediate steps, altering how constraints are evaluated or how trade-offs are prioritised. The system may remain internally consistent, which makes such deviations difficult to detect.

Tool selection becomes a critical control point. Agents decide which tools to use and when, so influencing these choices can redirect execution without directly accessing the tools themselves. Hallucinations also take on a different role here. In static systems, they remain errors. In agentic systems, they can trigger actions. A perceived need or incorrect judgement can translate into real-world consequences.

This layer introduces probabilistic failure. The system is not fully weakened, but it is nudged towards decisions that appear reasonable yet are incorrect. The risk lies in how those decisions are justified.

‍
‍Tool and Execution: When Decisions Gain Reach
‍

Once an agent begins interacting with tools, its behaviour extends beyond the model into external systems. APIs, databases, and services become part of the execution path.

One key risk is the use of unauthorised tools. When agents operate with broad permissions, any manipulation of the upstream can be converted into real-world actions. This makes access control a central security concern. Command injection also takes a different form here. The agent generates commands based on its reasoning, so if that reasoning is compromised, the resulting actions may still appear valid despite being harmful.

External tool outputs introduce another risk. If these systems return corrupted or misleading data, the agent may accept it without verification and incorporate it into its decisions. It is also becoming increasingly reliant on third-part tools and plugins adds to this exposure. If these components are compromised, they can affect behaviour without directly attacking the core system, creating a supply-side risk.

At this stage, the agent effectively operates as an insider. It holds legitimate credentials and interacts with systems in expected ways, making misuse harder to identify.
‍

Application and Integration: System-Level Exposure
‍

Agentic systems rarely operate in isolation. They are embedded in larger environments, interacting with identity systems, business logic, and operational workflows.

Access control becomes a major vulnerability. Agents tend to operate across multiple systems with various permission models, creating irregularities that can be exploited. Risks also arise from identity and delegation. In case an agent is operating on behalf of a user, then any vulnerabilities in authentication or session management can allow attackers to assume that authority.

Workflow execution amplifies these risks. Agents can initiate multi-step processes such as transactions, updates, or approvals. Manipulating a single step can change the result of the entire workflow. As integrations increase, so do the number of interaction points, making cumulative risk harder to track.

At this layer, failures are not isolated. They propagate into business operations, making consequences harder to contain.
‍

Output and Action: Where Failures Become Visible
‍

The output layer is where failures become visible, though they rarely originate there.

Data leakage has been a key concern. Agents may disclose information they are allowed to access, especially when tasks boundaries are not clearly defined. Misinformation and unsafe outputs are also important, particularly when outputs directly influence actions or decisions.

Generated code and commands introduce execution risk. If outputs are used without validation, errors or manipulations can have system-level effects. The shift towards autonomous action increases this risk, as small upstream deviations can lead to significant consequences without human intervention. This layer reflects symptoms rather than root causes. Addressing it alone does not reduce the underlying risk.
‍

Beyond Layers: The Missing Dimension
‍

A layered view helps, but it does not capture the full picture. Agentic systems are defined by continuous interaction across layers.

The key missing dimension is the runtime loop. Inputs shape reasoning, reasoning drives action, and actions feed back into both reasoning and memory. These cycles create feedback loops, where small manipulations may escalate over time. This also reduces observability. With multiple interacting components, it becomes difficult to trace cause and effect or identify where failures originate.

Supply chain dependencies add another layer of risk. Models, datasets, APIs, and plugins each introduce their own points of failure. A compromise at any of these points can propagate across the system. The attack surface also includes governance. Weak supervision, unclear responsibility, or excessive autonomy increase overall risk. Human control is not external to the system; it is part of its security.
‍

Conclusion: Structuring the Attack Surface
‍

Agentic AI expands the attack surface beyond traditional systems. It is both recursive and stateful. Risk does not just accumulate across layers; it moves and changes as the system operates.

Any useful representation must go beyond a linear stack. It should capture feedback loops, persistent state, and cross-layer dependencies that characterise the way these systems actually behave. The system is not a pipeline but a cycle. That is where both its capability and its risk emerge.

‍

PUBLISHED ON

Apr 10, 2026

Related Blogs

Google's Full Stack, India's Fine Print | Reading the AI Data Centre Boom Beyond the Headline

Google's Full Stack, India's Fine Print │Reading the AI Data Centre Boom Beyond the Headline

July 13, 2026

Introduction

As AI becomes more deeply integrated into everyday life and industries, Google Cloud is increasing its investment in AI-ready data centres worldwide, with India emerging as a key part of its expansion plans. Thomas Kurian’s latest India visit highlighted Google Cloud’s expanding ambitions in the country. Beyond the $15 billion, 1GW Visakhapatnam data centre announced in October 2025, Google is planning a larger multi-year AI infrastructure push, backed by partnerships with major enterprises across banking, healthcare, and digital services. This reflects a shift where countries are not only competing to create advanced AI technologies but also to build the infrastructure needed to support and lead the future AI economy. But it's worth being precise about what "building infrastructure" actually means here because it is private, foreign-headquartered capital constructing facilities on Indian soil, under terms that remain largely opaque to the public that will depend on them. That distinction matters more than the investment headline suggests.

‍

The Promise and Pressure of Google’s Full-Stack AI Strategy
‍

For decades, data centres were mainly built to store information, host websites, and support cloud applications. The rise of generative AI has completely changed that role. Today's systems need massive computing power both to train models on huge datasets and to run them every time someone generates content or automates a task. It is distinguished from traditional workloads mainly due to relying on proprietary technologies like GPU or TPU, alongside advanced networking and dynamic storage systems that complement each other and work in unison. The efforts of Google to create its own TPUs are understandable as they played a vital role in a number of achievements made by Google DeepMind. Today, the companies, government entities, and people turning to AI solutions put enormous pressure on the processing of data.

‍

The companies that are building this infrastructure are shaping ecosystems on which others will depend on. Google’s “full stack” approach that infers controlling everything from chips and AI models to cloud platforms and applications which may improve efficiency and reduce costs, but it also creates deeper dependence on a single provider. Like a hospital adopting an AI platform is not just purchasing software; over time, its data systems, workflows, and operations can become closely tied to the underlying cloud ecosystem.

‍

This concern when viewed against the concentration of the global cloud market: Amazon Web Services, Microsoft Azure, and Google Cloud together control roughly two-thirds of global cloud infrastructure, making them the dominant gatekeepers of enterprise computing. As these same companies move upward into AI models and applications while controlling the compute layer beneath them, the debate is no longer only about market share, it is about control over the entire AI value chain.
‍

Why Location Matters and Why It Isn't Enough
‍

In traditional internet services, a delay of a few milliseconds rarely mattered. However, future AI applications like autonomous vehicles, AI-assisted diagnostics, automated factory robotics will demand near-instant decision-making and cannot always depend on servers thousands of kilometres away. Regional data centres reduce that latency, which matters especially for India, where hundreds of millions are expected to interact with AI-powered services in the coming years. There is also the question of data sovereignty, and this is where the infrastructure narrative gets ahead of the regulatory reality. Governments worldwide are increasingly concerned about where citizens' and companies' data is stored and processed and local data centres are presented as the answer, but physical proximity does not automatically translate into legal accountability. Google has acknowledged that it bills cloud revenue through whichever global entity corresponds to the data centre being accessed which means an Indian client's spending on Google Cloud infrastructure inside India may still not be booked, taxed, or contractually governed as an Indian transaction. Google Cloud India Pvt. Ltd reported just ₹2,065.4 crore in FY25 revenue, strikingly disconnected from the scale of a $15 billion facility and its roster of major Indian clients. Servers on Indian soil do not by themselves guarantee that India captures the tax base, the leverage, or the oversight that "data sovereignty" implies.

This gap is widened by where India's own data protection framework stands. The Digital Personal Data Protection (DPDP) Act, 2023 leaves retention periods and purpose limitation loosely specified under Sections 8(7) and 12, and its enforcement rules are still being finalised. When hospitals or banks process data through a foundation-model platform like Gemini Enterprise, questions like where processing occurs and what audit trail exists for cross-border flows are not resolved by a local data centre's presence. At present, they rely mostly on vendor assurance rather than independent verification.
‍

Economic Opportunities: More Than Just Servers
‍

AI data centres are often imagined as buildings filled with computers, but their economic impact extends further, into energy systems, construction, engineering, semiconductor supply chains, and skilled technical work. Countries hosting these facilities can benefit from investment and job creation, while local businesses gain access to AI tools without building expensive infrastructure of their own.

For India, expanded AI infrastructure could support ambitions to become a global technology hub, and could narrow the gap in access to high-performance computing that has historically disadvantaged smaller companies and researchers. That potential is real. But it should be weighed against the terms on which it arrives, whether the economic value generated is captured domestically through tax revenue and enforceable local accountability, or whether India functions primarily as a hosting site while value accrues elsewhere. The current revenue-booking structure suggests the latter is, at minimum, a live risk rather than a settled question.

‍

The Environmental Challenge of AI Expansion
‍

However, what remains less discussed is the environmental cost behind this expansion from its impact on the power grid and water required for cooling to clearing use of renewable energy. A 1GW facility, the scale for the Visakhapatnam project is comparable to the output of a mid-sized power plant dedicated entirely to compute demand. As models grow larger and adoption accelerates, this level of energy and water consumption has become one of the central concerns of the global AI infra. As much attention as the investment figures receive, the sustainability issue behind such large-scale infrastructure deserves equal visibility.

The Future: AI Infrastructure as National Infrastructure
‍

The expansion of Google Cloud's AI data centres show a change in how the world views computing. Data centres are no longer invisible facilities operating in the background; they are becoming strategic infrastructure comparable to power grids and telecom networks. That comparison should prompt that infrastructure this consequential is usually made subject to public oversight, licensing conditions, and accountability mechanisms proportionate to its importance which is missing so far. Google Cloud's investment and the compute capacity it brings will lower barriers for Indian enterprises and researchers who have long lacked access to frontier-scale infrastructure. Against this backdrop, India needs to develop the regulatory, tax, and competition frameworks to ensure that the foundation serves the country hosting it, rather than the company that owns it.
‍

Beyond Compute: The Emerging Question of AI Sovereignty
‍

The next phase of the AI race may not be defined only by who builds the most capable models, but by who governs the infrastructure, standards, and decision making systems that those models depend upon. As advances in artificial general intelligence and discussions around superintelligence move from research laboratories into policy circles, control over compute resources is becoming a matter of strategic importance comparable to control over energy reserves or communication networks. Nations that rely entirely on external providers for advanced AI infrastructure may eventually find themselves dependent not merely for technology services, but for economic productivity, public administration, healthcare delivery, and national security capabilities. For India, the challenge is therefore larger than attracting investment. It is about ensuring meaningful domestic participation in ownership, governance, talent development, and oversight so that the intelligence systems shaping the future remain aligned with national priorities and public interest.

‍

References
‍

https://www.livemint.com/companies/news/google-will-set-up-more-ai-data-centres-in-india-as-demand-grows-says-cloud-business-ceo-thomas-kurian-11783422758078.html

‍

Empowering India's AI Vision: A Global Leap with the Paris Accelerator Programme

June 4, 2025

Introduction

In a landmark move for India’s growing artificial intelligence (AI) ecosystem, ten cutting-edge Indian startups have been selected to participate in the prestigious Global AI Accelerator Programme in Paris. This initiative, jointly facilitated by the Ministry of Electronics and Information Technology (MeitY) under the IndiaAI mission, aims to project India’s AI innovation on the global stage, empower startups to scale impactful solutions while fostering cross-border collaboration.

Launched in alignment with the vision of India as a global AI powerhouse, the IndiaAI initiative has been working on strengthening domestic AI capabilities. Participation in the Paris Accelerator Programme is a direct extension of this mission, offering Indian startups access to world-class mentorship, investor networks, and a thriving innovation ecosystem in France, one of Europe’s AI capitals.

Global Acceleration for Local Impact

The ten selected startups represent diverse verticals, from conversational AI to cybersecurity, edtech and surveillance intelligence. This selection was made after a rigorous evaluation of innovation potential, scalability, and societal impact. Each of these ventures represents India's technological ambition and capacity to solve real-world problems through AI.

The significance of this opportunity goes beyond business growth. It sets the foundation for collaborative policy dialogues, ethical AI development, and bilateral innovation frameworks. With rising global scrutiny on issues such as AI safety, bias, and misinformation, the need for making efforts for a more responsible innovation takes centre stage.

CyberPeace Outlook

India’s participation opens up a pivotal chapter in India's AI diplomacy. Through such initiatives, the importance of AI is not confined just to commercial tools but also as a cornerstone of national security, citizen safety, and digital sovereignty can be explored. As AI systems increasingly integrate with critical infrastructure from health to law enforcement, the role of cyber resilience becomes significant. With the increasing engagement of AI in several sensitive sectors like audio-video surveillance and digital edtech, there is an urgent need for secure-by-design innovation. Including parameters such as security, ethics, and accountability into the development lifecycle becomes important, aligning with its broader goal of harmonising with digital progress.

Conclusion

India’s participation in the Paris Accelerator Programme signifies its commitment to shaping global AI norms and innovation diplomacy. As Indian startups interact with European regulators, investors, and technologists, they carry the responsibility of representing not just business acumen but the values of an open, inclusive, and secure digital future.

This global exposure also feeds directly into India’s domestic AI strategies, a global platform informing policy evolution, enhancing research and development networks, and building a robust talent pipeline. Programmes like these act as bridges, ensuring India remains adaptive in the ever-evolving AI landscape. Encouraging such global engagements while actively working with stakeholders to build frameworks safeguarding national interests, protecting civil liberties, and fostering innovation becomes paramount. As India takes this global leap, the journey ahead must be shaped by innovation, collaboration, and vigilance.

References

#FactCheck-Mosque fire in India? False, it's from Indonesia

December 18, 2024

Agentic AI and the New Attack Surface

Introduction
‍

Agentic AI Attack Surface

Input Layer: Where Untrusted Data Becomes Control
‍

Context and Memory: Persistence of Influence
‍

Reasoning and Planning: Manipulating Intent Without Breaking Logic
‍

‍
‍Tool and Execution: When Decisions Gain Reach
‍

Application and Integration: System-Level Exposure
‍

Output and Action: Where Failures Become Visible
‍

Beyond Layers: The Missing Dimension
‍

Conclusion: Structuring the Attack Surface
‍

Related Blogs

Introduction

The Promise and Pressure of Google’s Full-Stack AI Strategy
‍

Why Location Matters and Why It Isn't Enough
‍

Economic Opportunities: More Than Just Servers
‍

The Environmental Challenge of AI Expansion
‍

The Future: AI Infrastructure as National Infrastructure
‍

Beyond Compute: The Emerging Question of AI Sovereignty
‍

References
‍

Introduction

Global Acceleration for Local Impact

CyberPeace Outlook

Conclusion

References

Executive Summary:

Claim:

Fact Check

Become a part of our vision to make the digital world safe for all!

Awareness

Engagement

Play your part for CyberPeace

Introduction‍

Agentic AI Attack Surface

Input Layer: Where Untrusted Data Becomes Control‍

Context and Memory: Persistence of Influence‍

Reasoning and Planning: Manipulating Intent Without Breaking Logic‍

‍‍Tool and Execution: When Decisions Gain Reach‍

Application and Integration: System-Level Exposure‍

Output and Action: Where Failures Become Visible‍

Beyond Layers: The Missing Dimension‍

Conclusion: Structuring the Attack Surface‍

Related Blogs

Introduction

The Promise and Pressure of Google’s Full-Stack AI Strategy‍

Why Location Matters and Why It Isn't Enough‍

Economic Opportunities: More Than Just Servers‍

The Environmental Challenge of AI Expansion‍

The Future: AI Infrastructure as National Infrastructure‍

Beyond Compute: The Emerging Question of AI Sovereignty‍

References‍

Introduction

Global Acceleration for Local Impact

CyberPeace Outlook

Conclusion

References

Executive Summary:

Claim:

Fact Check

Become a part of our vision to make the digital world safe for all!

Awareness

Engagement

Play your part for CyberPeace

Introduction
‍

Input Layer: Where Untrusted Data Becomes Control
‍

Context and Memory: Persistence of Influence
‍

Reasoning and Planning: Manipulating Intent Without Breaking Logic
‍

‍
‍Tool and Execution: When Decisions Gain Reach
‍

Application and Integration: System-Level Exposure
‍

Output and Action: Where Failures Become Visible
‍

Beyond Layers: The Missing Dimension
‍

Conclusion: Structuring the Attack Surface
‍

The Promise and Pressure of Google’s Full-Stack AI Strategy
‍

Why Location Matters and Why It Isn't Enough
‍

Economic Opportunities: More Than Just Servers
‍

The Environmental Challenge of AI Expansion
‍

The Future: AI Infrastructure as National Infrastructure
‍

Beyond Compute: The Emerging Question of AI Sovereignty
‍

References
‍