Mr. Aayush Bhatt
June 24, 2026 · 12 min read
Intel's Kevork Kechichian Said It Best: AI Doesn't Scale as Parts, It Scales as a System — Here Is What That Means for Every Business
Intel's data center chief said AI scales as a coordinated system, not a collection of parts. Most businesses are still buying parts. That is their mistake.
Introduction
Somewhere in the gap between what businesses expect from AI and what they actually get, there is a structural problem that almost nobody is naming correctly. Companies subscribe to AI tools. They add AI features to their software stack. They run pilots, build chatbots, generate reports, and automate individual tasks. And then they report, with a mixture of frustration and confusion, that the productivity gains they were promised are not arriving at the scale they expected. The tools work. The system does not exist.
On June 1, 2026, at Computex in Taipei, Kevork Kechichian — Executive Vice President and General Manager of Intel's Data Center Group — gave that problem its most precise public formulation. "AI doesn't scale as a collection of parts," he said. "It scales as a coordinated system." The statement was made in the context of Intel's data center announcements, including the Xeon 6+ processor and expanded networking portfolio. But the principle it articulates extends far beyond server architecture. It describes the exact mistake that most businesses, at every level of the market, are currently making with their AI investments — and it points toward what the correction needs to look like.
What Kechichian's Statement Actually Means
The instinct when companies deploy AI is to think in terms of components. You buy a language model subscription. You add a copilot to your productivity suite. You deploy a chatbot for customer support. You use a separate tool to generate marketing copy, another to summarise meeting transcripts, and a third to draft code. Each of these tools has measurable value in isolation. None of them knows the others exist. None of them shares context. None of them can hand a task to another. The work that requires moving between them falls, as it always has, on the human beings sitting in the middle.
This is the architecture of AI as a collection of parts. It is also the architecture that produced the phenomenon researchers describe as the "AI productivity paradox" — where individual workers report real efficiency gains from AI tools while organisations report negligible change in bottom-line performance. The gains are isolated. They do not compound. They do not multiply across functions. They stop at the boundary of each individual tool, because the tools were never designed to coordinate with each other, and the organisation never built the layer that would allow them to.
What Kechichian is describing as a "coordinated system" is the alternative architecture: one in which AI components share context, hand off work to each other, operate on the same underlying data layer, and are orchestrated by a unified control plane rather than navigated manually by individual users. The performance characteristics of this architecture are qualitatively different from the collection-of-parts model. Tasks that used to require a human to move information between six tools can be automated end-to-end. Processes that used to take days because each step waited for a human handoff can operate continuously. And the intelligence the system develops — about the business's data, rules, customers, and logic — accumulates in a shared layer rather than being isolated in each individual tool's context window.
The CPU as the Orchestration Hub
The hardware-level version of Kechichian's argument is specific and instructive. Intel's announcement at Computex was built around the idea that the CPU — which has spent several years playing a secondary role to GPUs in the AI infrastructure conversation — is re-emerging as the critical component in agentic AI deployments. Not because it does the heavy mathematical lifting, but because it does everything else.
Intel described the role in straightforward terms: with Xeon serving as the control plane, the company is taking a systems-level approach to AI performance. The Xeon 6+ processor is built on Intel's 18A process, designed for sustained performance under real-world power constraints, with an emphasis on orchestration, concurrency, and data movement. Up to 288 efficient-cores deliver high concurrency and strong responsiveness for the kinds of workloads that agentic AI actually generates: managing multiple simultaneous agent sessions, routing requests between models, maintaining state across long-running tasks, and moving data to wherever it needs to be for the next step in a workflow.
GPUs, in this architecture, are the inference engines — the components that do the compute-heavy work of generating model outputs. They are exceptional at what they do, but they are not designed to manage the complexity of coordinating multiple models, maintaining context across long conversations, or orchestrating the dozens of tool calls that a modern AI agent might make in the course of completing a single task. That work lands on the CPU and the network. As Intel argued at Computex, as AI systems shift from single models toward fleets of autonomous agents, the hard problems move to orchestration, concurrency, and data movement between components — and that is precisely where the CPU becomes the ceiling on how well the overall system performs.
For businesses, the practical implication is this: buying a more powerful GPU, or subscribing to a more capable AI model, does not solve the orchestration problem. The model's quality is only realised if the system around it — the control plane, the data routing, the context management, the tool coordination — can deliver the right information at the right moment and pass results between components without losing fidelity. A mid-market company that deploys Nvidia's most advanced inference accelerator but routes everything through a patchwork of manual API calls will consistently underperform a competitor running a more modest model inside a genuinely coordinated architecture.
What MCP Is Doing to Enable Coordination
The software layer equivalent of what Kechichian described in hardware terms is the Model Context Protocol, and its emergence as the dominant standard for AI agent coordination is the development that makes the systems-level architecture practically achievable for companies that are not building their own infrastructure from scratch.
MCP, originally developed by Anthropic and open-sourced in November 2024, was donated to the Linux Foundation's Agentic AI Foundation in December 2025, with OpenAI, Google, Microsoft, Amazon Web Services, and Cloudflare all joining as members or co-founders. That governance transition was the moment MCP stopped being one company's tool and became the industry's shared infrastructure. By early 2026, the protocol's TypeScript and Python SDKs had reached 97 million monthly downloads. The official registry counted over 9,600 public servers. Gartner projected that by the end of 2026, 40% of enterprise applications would include task-specific AI agents, and 75% of API gateway vendors would have MCP features built in.
What MCP actually does is straightforward in concept: it provides a standardised interface through which AI agents can access external tools and data sources — databases, CRMs, calendars, document management systems, code repositories, and any other enterprise application that exposes an MCP server. Before MCP, connecting an AI agent to an enterprise system required a custom integration for every pairing. An agent that needed to read from a CRM, write to a project management tool, and query a data warehouse needed three separate custom connections that had to be individually built and maintained. With MCP, any agent that supports the protocol can access any tool that supports the protocol through the same interface. The coordination layer is standardised. The custom integration problem largely disappears.
The business significance is direct. MCP is the protocol that allows a company to build a genuinely coordinated multi-agent system rather than a collection of individually integrated tools. A compliance agent can hand context to a reporting agent. A customer support agent can escalate a case to an operations agent that can actually query the relevant system and take action. A finance agent can pull verified data from the same governed workflow that the CFO's team has already certified, rather than querying raw tables and producing outputs that no one can audit. The shared protocol is what makes all of those handoffs possible without each one requiring a custom engineering project.
What Businesses Get Wrong When They Buy AI as Parts
The specific failure mode that Kechichian's framing identifies is one that most technology leaders can recognise in their own organisations, even if they have not previously put a name to it.
The mistake usually begins with the procurement process. AI tools are evaluated and purchased by individual departments or functions, each optimising for their own specific use case. Marketing buys an AI content tool. Sales buys an AI prospecting tool. Finance buys an AI analysis tool. Each tool is assessed on its own merit, approved on its own budget, and deployed in its own silo. The integrations between them are afterthoughts, if they exist at all. The shared data layer that would allow the marketing agent to know what the sales agent has been doing, or the finance agent to draw on the same customer data that the support team is working with, is never built. Each tool operates on its own local copy of partial information.
The result is that each AI tool improves the specific task it was purchased for, while the business processes that cross functional boundaries — which are, by definition, the ones that determine whether a company actually serves its customers well — remain just as fragmented as before. The McKinsey 2025 State of AI research found that while individual AI adoption was widespread, only a fraction of organisations had successfully scaled AI into genuine enterprise-wide transformation. The gap between the two is, in most cases, exactly this: the difference between AI as parts and AI as a system.
Companies that have successfully crossed that gap share a common architectural decision: they built or adopted a shared context layer before they scaled their deployment of individual tools. They defined how data would flow between AI components before they acquired the components. They decided who would govern the system, how agents would be monitored, and how outputs would be audited, before the system became complex enough that those decisions were difficult to make retroactively. In other words, they thought about the system first and the parts second. Most organisations do the opposite.
What a Systems-Level AI Architecture Looks Like for a Mid-Sized Company
For a company of several hundred to a few thousand employees operating in 2026, a systems-level AI architecture does not require the capital expenditure of a hyperscaler or the engineering capacity of a technology company. It does require a different way of thinking about what the goal of an AI deployment actually is.
The starting point is a shared data and business logic layer — the equivalent, in software terms, of what Kechichian described the CPU providing in hardware terms: a control plane that has access to the organisation's actual data, understands its actual rules, and can make that context available to any AI component that needs it. Practically, for a mid-sized company, this often means ensuring that the most important business logic — revenue rules, customer definitions, compliance constraints, approval workflows — is encoded in governed, auditable workflows rather than embedded in the prompt of a single AI chatbot that no one can inspect or maintain.
On top of that foundation, MCP-compatible agents can be deployed to perform specific tasks: one agent that manages customer support escalations, one that monitors and flags compliance anomalies, one that produces financial summaries on demand, one that assists with code review. These agents do not each need their own separate integration to the organisation's data. They access it through the same protocol, with the same governance, and with the same audit trail. When one agent's output is relevant to another's task — when a customer support escalation should trigger a review by the finance agent, for example — the context can be passed between them through the MCP layer without a human in the middle.
The orchestration layer — the equivalent of the CPU's role in Intel's architecture — is the component that manages which agent is doing what at any given moment, ensures that long-running tasks do not drop their state, and surfaces outputs to the humans who need to review or approve them. This can be a dedicated orchestration platform, an internal system built on an open-source agent framework, or an enterprise AI platform that includes orchestration as part of its offering. The specific tool matters less than the architectural decision that such a layer needs to exist and that it needs to be the first thing built, not the last.
Conclusion
Kevork Kechichian's statement at Computex was made in the context of hardware architecture, but the principle it contains applies equally to every boardroom, IT department, and operations team that is currently trying to figure out why their AI investment has not delivered what the vendor promised. AI does not scale as a collection of parts. It scales as a coordinated system. The model quality matters. The GPU performance matters. The choice of vendor matters. But none of those decisions produces compound organisational value until there is a shared context layer, a coherent orchestration layer, and a governance model that treats the system as a whole rather than a procurement catalogue of individually justified tools.
MCP has made the software coordination layer achievable. Intel's Xeon 6+ and the broader shift back toward CPU-centric infrastructure have made the hardware orchestration layer practical. The architectural thinking that connects them is the only remaining variable — and it is the one that no vendor can supply on your behalf. For business leaders in 2026, the question is not which AI tool to buy next. It is whether the thing you are building deserves to be called a system yet. For most organisations, the honest answer is not yet. The good news is that the path to it has never been more clearly marked.
Written by
Mr. Aayush Bhatt
Software Engineer interested in how models work and where they fail.