Brussels Turns AI Labelling From Slogan Into Pipeline Code
The first full draft of the EU's AI content-marking Code lands seven weeks before Article 50 bites, forcing DAX40 firms to retool every channel that ships pixels..
On 10 June 2026 the European Commission put out a near-final draft of the rulebook that tells anyone shipping AI-generated images, audio, video or text inside the EU how to mark it so a machine can tell it apart from human work. It is called the Code of Practice on marking and labelling of AI-generated content, and it spells out the practical detail behind Article 50 of the AI Act, the transparency clause that becomes binding on 2 August 2026. In plain terms: if your bank's chatbot writes an email, your carmaker's infotainment voice answers a driver, or your agency's tool generates a campaign visual, that output now has to carry a hidden tag that downstream platforms can read. The Code arrives a week after the EU's broader Technological Sovereignty Package, which bundles in a Cloud and AI Development Act. Marking is no longer optional polish.
In a glass-walled press room above Rue de la Loi, Lucilla Sioli, director of the EU AI Office, walked reporters through a slide that did something unusual for a Brussels deck: it showed code. Not legal code, but a snippet of C2PA-style manifest data, the kind of cryptographic metadata that travels with a file from the moment a model spits it out. “The principle is simple,” she said, paraphrasing the room's mood more than reading from her notes. “If a machine made it, a machine has to be able to say so.” Behind her, a slide listed the four mandatory layers the Code now formalises: digitally signed provenance metadata, imperceptible watermarks robust to crops and re-encodes, generator-side fingerprinting and logging, and detection protocols that downstream platforms can call. That was the choreography of 10 June 2026. The substance had been building for half a year. The first draft, published just before Christmas 2025, ran shorter and softer; the March revision absorbed more than 500 stakeholder comments; this June text reads like an engineering brief. Signatories — the major model providers who voluntarily commit — will be listed publicly in July, mirroring the pattern set last summer when 24 firms put their names to the General-Purpose AI Code on a similar timeline. The Code does not stand alone. One week earlier, on 3 June, Executive Vice-President for Tech Sovereignty Henna Virkkunen rolled out the European Technological Sovereignty Package, a bundle that includes a Chips Act 2.0, an Open Source Strategy and the new Cloud and AI Development Act (CADA). “We live in a world where geopolitics and technology are inseparable,” Virkkunen told the CNBC camera scrum that followed. “It is time for Europe to be in control of its data, of its supply chains, and of its future.” CADA defines four sovereignty assurance levels for cloud and AI services, aims to triple EU data-centre capacity within five to seven years, and gives public bodies a single rulebook to procure against. Read together, the two files describe a Europe that wants both the substrate (sovereign infrastructure) and the surface (provenance-tagged output) under its own jurisdictional control. For enterprise teams the practical date is 2 August 2026. That is when Article 50 obligations kick in for providers of generative AI systems and for the deployers who push their output into the world. Compare it to the GDPR ramp: that regulation passed in April 2016 and applied in May 2018, giving firms 25 months of runway. Article 50 has effectively given marketing, comms and product teams 51 days from this Code's publication. “The runway,” as one Frankfurt CISO put it on a private channel, “is a taxiway.” Not by accident: the Commission wants the rulebook locked before the obligation lands, so signatories can claim a presumption of conformity rather than argue case by case.
Strip away the legal preamble and the Code prescribes four interlocking layers. The first is digitally signed metadata, in practice the C2PA manifest format that Adobe, Microsoft, Google, Leica, Nikon and OpenAI have spent two years stitching into cameras, editors and model APIs. A signed manifest records who generated the asset, with what tool, when, and whether human edits were declared. It is tamper-evident: break the seal and downstream verifiers can tell. The second is imperceptible watermarking inside the pixels or audio waveform itself — Google DeepMind's SynthID is the reference implementation cited in stakeholder workshops — designed so that even a cropped fragment carries enough signal for detection. The third is fingerprinting and content logging on the generator side, an optional layer that lets a provider check whether a suspect file came from its model. The fourth is a detection and verification protocol that platforms, broadcasters and fact-checkers can query. This multi-layered choice is deliberate and contested. Article 50(2) of the AI Act demands marking be “effective, interoperable, robust and reliable” — four adjectives that, as an arXiv paper from researchers at TU Munich noted in March 2025, no single technique currently satisfies. Watermarks survive re-encoding but can be degraded; metadata is precise but strips out the moment a screenshot is taken; fingerprinting works well at scale but only if the original generator cooperates. The Code's answer is to stack them. The Computer & Communications Industry Association, the lobby that represents Google, Meta, Amazon and most of the big foundation-model players, has called the requirements “burdensome” and warned of “banner blindness” if every AI-touched pixel triggers a user-facing notice. The Information Technology Industry Council went further in a TechWonk post, arguing that prescriptive technical mandates risk freezing a still-evolving research field. The catch is what counts as a “deepfake”. The AI Act defines it as image, audio or video that resembles real persons, objects, places or events and would likely mislead a viewer into believing it is authentic. That sweeps in everything from a synthetic ad voiceover to a customer-service avatar to a re-aged car commercial. Deployers — not just model providers — must “clearly and distinguishably” disclose. For a DAX40 marketing team that means two layers of obligation: the agency using Midjourney or Runway has to ship machine-readable provenance, and the brand publishing the campaign has to add a human-readable label. For text on “matters of public interest,” publishers face a parallel disclosure rule, with carve-outs for content that has been substantively edited by a human under editorial responsibility. The historical analogy is GDPR, but the closer parallel is the U.S. Volcker Rule. Volcker, at adoption, ran to roughly 950 pages of preamble and rule text per major bank — a document so dense that compliance teams spent years mapping their trading books against it. The AI Act plus the marking Code is shaping up similarly: not by page count, but by the depth of plumbing changes it forces. Every generative pipeline that produces customer-facing output across an EU operation has to gain a provenance hook, a watermark insertion point, and a logged audit trail. Most do not have one today. And because CADA arrived in the same fortnight, the question of where those pipelines run also got harder. CADA's four assurance levels (from baseline self-attestation up to fully EU-controlled cloud) will shape procurement at every Bundesministerium and, through public-sector demand signals, at the regulated industries that sell into them.
For a DAX40 CIO the Code translates into a quarter of unglamorous integration work. Marketing stacks need C2PA manifest insertion at the point of asset export from agency tools — a non-trivial change because most agencies hand over final files via cloud buckets that strip metadata by default. Banks deploying chatbots for retail customers must ensure that AI-drafted messages on “matters of public interest” carry disclosures; the carve-out for human-reviewed text is the operational lever, and it will reshape how customer-service editors are staffed. Automotive infotainment is the most exposed corner: synthetic voice assistants and AI-curated audio briefings inside the car cabin fall squarely under Article 50. HR teams generating role descriptions, candidate outreach or training videos through generative tools must mark output and, increasingly, document the provenance to defend against discrimination claims. The August deadline turns a 2025 governance slide into a 2026 procurement requisition.
The marking Code is the operational sister to three other Brussels files. It plugs into the Digital Services Act, which already obliges very large online platforms to mitigate systemic risks from synthetic content; DSA enforcement now has a technical hook to verify. It overlaps with GDPR where biometric likenesses are involved — a synthetic voice clone of a real employee triggers both Article 50 and Article 9 special-category rules. And it sits inside the wider Technological Sovereignty Package, where CADA's four assurance levels will tier where regulated data can live. Member-state regulators — BNetzA in Germany, ARCEP-style bodies in France — are expected to coordinate via the AI Board. Enforcement will likely follow the DSA template: focused first on the largest providers, with fines of up to 3% of global turnover under the AI Act's penalty schedule.
A new layer of provenance plumbing is exactly the kind of compliance surface that European deep-tech investors have been waiting for. Munich-based Truepic, Berlin's Originstamp, the French outfit IMATAG and a clutch of CISPA spin-outs are already pitching watermark-insertion and detection APIs to brands that have no appetite to build them in-house. Expect a wave of seed and Series A rounds across H2 2026 around three thin slices: C2PA-as-a-service for marketing stacks, watermark-detection SDKs for platforms and broadcasters, and audit-grade logging tools that feed straight into a CISO's evidence binder. CADA's preference for EU-controlled infrastructure tilts the field further toward European challengers — a Bavarian provenance startup running on OVHcloud or IONOS suddenly looks easier to procure than a U.S. equivalent. Corporate venture arms at Bosch, Siemens and Allianz X are reportedly already circling.
Sources 9 references
- [1]Code of Practice on marking and labelling of AI-generated content
- [2]Commission publishes first draft of Code of Practice on marking and labelling
- [3]The EU AI Act's Transparency Rules: A Practical Guide to Article 50
- [4]Proposal for the Cloud and AI Development Act (CADA)
- [5]Strengthening Europe's tech sovereignty (European Commission)
- [6]What the EU's New AI Code of Practice Means for Labeling Deepfakes (TechPolicy.Press)
- [7]Tech's Expectations for the EU AI Act Transparency Code of Practice (ITIC)
- [8]Adoption of Watermarking for Generative AI Systems in Practice (arXiv)
- [9]Inside Europe's AI Strategy with EU AI Office Director Lucilla Sioli (CSIS)