The Pincer: How Labs and Hyperscalers Are Taking Implementation In-House
Google Cloud and OpenAI are staffing the GenAI delivery layer themselves, squeezing the consultancy bench that has owned DAX40 rollouts..
A Forward Deployed Engineer, or FDE, is a software engineer who works inside a customer’s office rather than at the vendor’s headquarters. The role was invented by Palantir twenty years ago to make complex software actually work inside intelligence agencies and banks. In 2026, the AI labs and cloud providers are copying that model at scale. OpenAI has launched a separate company, backed by private equity, whose only job is to embed engineers with enterprise customers. Google Cloud is hiring hundreds of the same kind of engineer. The reason: most companies cannot get useful results out of a chatbot or an AI agent on their own. Until now, that translation work has been the business of Accenture, Capgemini, Deloitte, McKinsey and the rest. That business is now contested.
On a Monday morning in mid-May, Thomas Kurian, the chief executive of Google Cloud, posted to LinkedIn what looked, on first read, like a routine recruiting note. “Today we announced a new AI-focused organization,” he wrote, adding that Google would be “investing in hiring additional forward-deployed engineers to help us scale customer AI transformation.” Underneath the corporate phrasing sat a more pointed message: the hyperscaler intends to staff the GenAI delivery layer itself. Google Cloud listed 59 distinct FDE roles in the United States, London, Paris and Hong Kong, and briefed reporters at The Information that hundreds more would follow. Interview loops that used to run four to six rounds over several weeks have been compressed, in some pipelines, to two interviews in two days. The timing was not coincidental. Twenty-four hours earlier, OpenAI had launched The OpenAI Deployment Company, an entity it described, in deliberately careful language, as “a new company designed to help organizations build and deploy AI systems they can rely on every day.” The structure is unusual. OpenAI retains majority ownership and control, but the vehicle is separately capitalised: $4 billion of committed funding from a syndicate of nineteen investors at a $14 billion valuation, led by TPG with Advent International, Bain Capital and Brookfield as co-lead founding partners, and Goldman Sachs, SoftBank, Warburg Pincus and others alongside. The founding acquisition is Tomoro, an Edinburgh and London AI consultancy formed in 2023, whose client list already includes Mattel, Red Bull, Tesco and Virgin Atlantic. Tomoro brings roughly 150 experienced Forward Deployed Engineers and Deployment Specialists into the new company from day one. Not by accident, the investor list also names Bain & Company, Capgemini and McKinsey & Company as participants. The labs are inviting the consultancies into the cap table while building the operation that will, in effect, compete with their delivery arms. Gergely Orosz, who covers the engineering labour market in his Pragmatic Engineer newsletter, captured the asymmetry bluntly: building the best model, he wrote, is no longer enough; “what matters now is who can actually operationalize AI inside the world’s most complex organizations.” Orosz also reported that public-company transcripts mentioning the FDE role jumped to roughly fifty in 2025, up from eight the year before, and that Indeed postings for the title grew more than tenfold over the same period. The catch, in his telling, is that there are not enough engineers who actually want the job: it has historically been seen as less prestigious than core platform work, closer in feel to a high-end sales engineer than a Staff Engineer at a product company. The labs are betting that money, model access and customer scope solve that recruiting problem. They may be right, but only if the customers show up.
The pattern under the headlines is more interesting than either announcement on its own. For most of 2023 and 2024, the consultancies were the obvious winners of the GenAI buildout. Accenture, in particular, used its scale and its existing DAX40 and Fortune 500 relationships to convert pilot enthusiasm into paid programmes. By the close of its first fiscal quarter of 2026, Accenture reported $2.2 billion of Advanced AI bookings in a single quarter, nearly double the year-earlier figure, and roughly $1.1 billion of recognised AI revenue in the same period. Cumulative bookings since the category was created stood at $11.5 billion across more than 11,000 projects, with $4.8 billion of revenue booked. Capgemini, the most exposed European pure-play, posted Q1 2026 revenues of EUR 5.94 billion, up 11 percent at constant currency, with management explicitly attributing the print to its “cloud and AI strategy.” Both firms have been redesigning managed-services deals to embed GenAI and agentic AI into delivery. That is the business that the labs and hyperscalers are now reaching into. The historical comparison is Palantir. Its FDE bench, never more than a few thousand engineers, helped generate roughly 640 percent returns for early investors by turning bespoke engagements into a productised platform: the engineers walked into the customer, found the workflow, wrote the code, and then folded the patterns back into Foundry. McKinsey’s $300 million acquisition of QuantumBlack a decade ago tried the same logic from the consultancy side, grafting a data-science studio onto a partnership model. BCG has more recently introduced what it calls “forward deployed consultants,” a near-direct lift of the Palantir vocabulary. What is new in 2026 is who is doing the embedding. When a Bosch or a Mercedes-Benz or a Siemens runs a GenAI rollout today, the engineer in the room has historically worn an Accenture or a Capgemini lanyard, with a hyperscaler partner-manager on a call once a fortnight. In the emerging shape, the engineer wears a Google or an OpenAI badge, sits inside the customer’s data perimeter, and reports back into a lab roadmap. The model vendor sees production usage telemetry directly. The integrator becomes, at best, the second pair of hands. In DACH, where Accenture and Capgemini have built their largest European AI practices around exactly this DAX40 work, the structural exposure is concentrated. Julie Sweet, Accenture’s chief executive, signalled the threat indirectly on the Q1 2026 earnings call by announcing that the firm would stop reporting Advanced AI bookings as a separate line item from next quarter; she framed the change as a sign that AI was now “integrated and enterprise-wide.” The more cynical read is that the moment your model vendor has its own delivery arm, you stop wanting to publish the size of the prize.
The strategic logic for OpenAI and Google Cloud has three layers. First, margin: implementation services are, in pure unit-economic terms, far less attractive than software, but they are the gating factor for software revenue. Every dollar of GPT-5 or Gemini consumption inside a Fortune 500 sits behind months of integration work that someone has to do. If that work is done by an integrator the lab cannot direct, the lab cannot guarantee that its own model wins the workload, and switching costs accrue to the integrator’s playbook rather than to the model. Second, signal: an FDE inside the customer is a continuous, high-bandwidth source of product feedback. Palantir’s FDEs were, in effect, the company’s primary product-discovery mechanism, and the labs have read that lesson. Third, defensibility: the consultancies have spent two years building model-agnostic delivery layers explicitly so that they can swap GPT for Gemini for Claude without disturbing the customer. The labs would prefer that not to be true. Embedding their own engineers makes it materially less true. The counter-argument, voiced most carefully by Orosz and more bluntly by critics like Ed Zitron and Gary Marcus, is that the labs are about to learn why services businesses trade at single-digit multiples. Recruiting senior engineers into customer-site roles is hard. Managing a partnership-style P&L inside a research-driven culture is harder. And the MIT NANDA finding, widely circulated this spring, that 95 percent of enterprise GenAI pilots show little or no measurable return, suggests that the demand the labs are staffing for may be softer than the headcount plans assume.
For a DAX40 CIO, the immediate question is contractual. If Google or OpenAI engineers sit inside the data perimeter, where does liability for a model error land, and how does that flow through the existing managed-services contract with Accenture or Capgemini? The second-order question is governance: an integrator can be told to swap one model for another. A lab-employed FDE cannot. CIOs who have spent the last eighteen months building model-agnostic reference architectures should expect a procurement conversation in which the lab offers free or near-free engineering hours in return for a multi-year commitment to one model family. That is a discount worth taking only if the architecture genuinely allows reversal.
European supervisors will read this through the AI Act and DORA lenses. An FDE embedded with a bank or insurer is, in effect, a critical third-party ICT provider with privileged access to production systems. Under DORA, that triggers concentration-risk reporting and the right of supervisors to inspect. Under the AI Act, the lab moves closer to being a deployer rather than purely a provider, with the obligations that follow. BaFin in Germany and the ACPR in France have already signalled discomfort with hyperscaler concentration in the cloud layer. Lab-employed FDEs sitting inside regulated entities sharpen that question rather than blunting it, and may push some institutions back towards integrator-mediated delivery on pure compliance grounds.
For the AI-services startup landscape, the OpenAI Deployment Company is both validation and an existential threat. Tomoro, founded in 2023, was acquired before completing a Series A; that is the bull case, an early and lucrative exit path. The bear case is that every applied-AI consultancy with under 300 engineers now faces a buyer of last resort that distorts pricing for everyone else. Expect a near-term wave of M&A in the segment, particularly in the UK and DACH, and a corresponding slowdown in growth-stage funding for category-pure GenAI services plays. Founders pitching the next Tomoro will need a defensible vertical or a sovereign-cloud angle that the labs cannot replicate.
Sources 10 references
- [1]The Pulse: Forward deployed engineering heats up again
- [2]OpenAI launches the OpenAI Deployment Company to help businesses build around intelligence
- [3]Thomas Kurian on LinkedIn: Today we announced a new AI-focused organization
- [4]Google to Hire Hundreds of Engineers to Help Customers Adopt Its AI
- [5]OpenAI spins up standalone consulting business
- [6]OpenAI acquires Tomoro as founding piece of $14 billion Deployment Company
- [7]Accenture (ACN) Q1 2026 Earnings Call Transcript
- [8]Capgemini Q1 2026 revenues press release
- [9]Is the FDE role becoming less desirable?
- [10]Palantir is the world’s most successful forward-deployed engineering company