Ads, privacy and confusion

Published on

September 6, 2021

No items found.

The consumer internet industry spent two decades building a huge, complex, chaotic pile of tools and systems to track and analyse what people do on the internet, and we’ve spent the last half-decade arguing about that, sometimes for very good reasons, and sometimes with strong doses of panic and opportunism. Now that’s mostly going to change, between unilateral decisions by some big tech platforms and waves of regulation from all around the world. But we don’t have any clarity on what that would mean, or even quite what we’re trying to achieve, and there are lots of unresolved questions. We are confused.

First, can we achieve the underlying economic aims of online advertising in a private way? Advertisers don’t necessarily want (or at least need) to know who you are as an individual. As Tim O’Reilly put it, data is sand, not oil - all this personal data actually only has value in the aggregate of millions. Advertisers don’t really want to know who you are - they want to show diaper ads to people who have babies, not to show them to people who don’t, and to have some sense of which ads drove half a million sales and which ads drove a million sales. Targeting ads per se doesn’t seem fundamentally evil, unless you think putting car ads in car magazines is also evil. But the internet became able to show car ads to people who read about cars yesterday, somewhere else - to target based on the user rather than the context. This is both exactly the same and completely different.

In practice, ‘showing car ads to people who read about cars’ led the adtech industry to build vast piles of semi-random personal data, aggregated, disaggregated, traded, passed around and sometimes just lost, partly because it could and partly because that appeared to be the only way to do it. After half a decade of backlash, there are now a bunch of projects trying to get to the same underlying advertiser aims - to show ads that are relevant, and get some measure of ad effectiveness - while keeping the private data private. This is the theory behind Google’s FLoC and Apple’s rather similar tracking and ad-targeting system - do the analysis and tracking on the device, show relevant ads but don’t give advertisers or publishers the underlying personal data. However, even if the tech works and the industry can get to some kind of consensus behind any such project (both very big questions), would this really be private? And what does it do to competition?

This takes me to a second question - what counts as ‘private’, and how can you build ‘private’ systems if we don’t know?

Apple has pursued a very clear theory that analysis and tracking is private if it happens on your device and is not private if leaves your device or happens in the cloud. Hence, it’s built a complex system of tracking and analysis on your iPhone, but is adamant that this is private because the data stays on the device. People have seemed to accept this (so far), but acting on the same theory Apple also created a CSAM scanning system that it thought was entirely private - ‘it only happens your device!’ - that created a huge privacy backlash, because a bunch of other people think that if your phone is scanning your photos, that isn’t ‘private’ at all. So is ‘on device’ private or not? What’s the rule? What if Apple tried the same model for ‘private’ ads in Safari? How will the public take FLoC? I don’t think we know.

On / off device is one test, but another and much broader one is the first party / third party test: that it’s OK for a website to track what you do on that website but not OK for adtech companies to track you across many different websites. This is the core of the cookie question, and sounds sensible, and indeed one might think that we do have a pretty good consensus on ‘third party cookies’ - after all, Google and Apple are getting rid of them. However, I’m puzzled by some of the implications. “1p good / 3p bad” means that it’s OK for the New York Times to know that you read ten New York Times travel pieces and show you a travel ad, but not OK for the New Yorker to know that and show you the same ad. Why, exactly, is that a policy objective? Indeed, is it ‘private’ for the New York Times to record and analyse everything a logged-in user read on that site for the last decade? What would happen to its ad revenue if it dumped your history after 24 hours? (Cynically, the answer might be ‘not much’.) Is that different to Facebook recording and analysing everything you read on Facebook?

At this point one answer is to cut across all these questions and say that what really matters is whether you disclose whatever you’re doing and get consent. Steve Jobs liked this argument. But in practice, as we've discovered, ‘get consent’ means endless cookie pop-ups full of endless incomprehensible questions that no normal consumer should be expected to understand, and that just train people to click ‘stop bothering me’. Meanwhile, Apple’s on-device tracking doesn't ask for permission, and opts you in by default, because, of course, Apple thinks that if it's on the device it's private. Perhaps ‘consent’ is not a complete solution after all.

But the bigger issue with consent is that it’s a walled garden, which takes me to a third question - competition. Most of the privacy proposals on the table are in absolute, direct conflict with most of the competition proposals on the table. If you can only analyse behaviour within one site but not across many sites, or make it much harder to do that, companies that have a big site where people spend lots of time have better targeting information and make more money from advertising. If you can only track behaviour across lots of different sites if you do it ‘privately’ on the device or in the browser, then the companies that control the device or the browser have much more control over that advertising (which is why the UK CMA is investigating FLoC). And, as an aside, if you can only target on context, not the user, then Hodinkee is fine but the Guardian’s next landmark piece on Kabul has no ad revenue. Is that what we want? What else might happen?

These are all unresolved questions, and the more questions you ask the less clear things can become. I’ve barely touched on a whole other line of enquiry - of where all the world’s $600bn of annual ad spending would be reallocated when all of this has happened (no, not to newspapers, sadly). Apple clearly thinks that scanning for CSAM on the device is more private than the cloud, but a lot of other people think the opposite. You can see the same confusion in terms like 'Facebook sells your data' (which, of course, it doesn’t) or 'surveillance capitalism' - these are really just attempts to avoid the discussion by reframing it, and moving it to a place where we do know what we think, rather than engaging with the challenge and trying to work out an answer. I don’t have an answer either, of course, but that’s rather my point - I don’t think we even agree on the questions.

Benedict Evans is a Venture Partner at Mosaic Ventures and previously a partner at a16z. You can read more from Benedict here, or subscribe to his newsletter.

Ads, privacy and confusion

AI metrics

GenAI’s adoption puzzle

Our investment in Manas AI

AI Agents Don’t Buy Seats—Why Your Pricing Should Follow Suit

Transforming customer service with AI agents: Parloa raises $120M Series C at $1B valuation

What kind of disruption?

Apple innovation and execution

Are better models better?

The Deep Research problem

Introducing Mosaic's new Partner, Chandar Lal

AI eats the world

Competing in search

The AI summer

The VR winter continues

Why we invested in Coram.ai

Apple intelligence and AI maximalism

Building AI products

Ways to think about AGI

AI and problems of scale

Looking for AI use cases

The challenges of investing in AI

Why we invested in Podcastle

Remaking the App Store

Why we invested in Parloa

AI and everything else

Unbundling AI

Scaling personalised support: LLMs and human empowerment

The impact of LLMs on marketplaces

LLM agents: the next platform shift in B2B software

Generative AI and intellectual property

When tech says "no"

LLM applications: an investing framework

AI and the automation of work

Vision Pro

Personalised learning: Edtech’s long-standing aspiration

Netflix, Shein and MrBeast

The New Gatekeepers

Evaluating SaaS metrics at Series A

ChatGPT and the Imagenet moment

Why We Invested in Vektor AI - a Platform Unlocking Mentoring for Tech Talent

Ways to think about a metaverse

Powering Personalisation: Why We Invested in Ninetailed

Meet Johannes Barth - Mosaic's New Head of Analytics

The creator economy: a power law

Rocket ships and tractors

Within and tech M&A

Back to the trend line?

There’s no such thing as data

Now what? A Letter to Founders on How to Survive a Bear Market

What do Europe’s leading founders have in common?

AI for code: the next frontier in software development?

TV, merchant media and the unbundling of advertising

‘Google meets Which’: cost of living data platform Nous raises $9m

Privacy on the internet: what comes next?

Tech questions for 2022

Three Steps To The Future

Nexar is building a ‘digital twin’ of cities using crowdsourced dash cam data

Notes on newsletters

B2B marketplaces: what comes next?

When big tech buys small tech

Metabrand

Blockchain says it posted $1.5 billion in revenue this year

Reimagining the future of buy-to-let. Why we invested in GetGround.

Privacy on the internet: who cares?

Metaverse! Metaverse? Metaverse!!

Stepping out of the firehose

A decade of the Tim Cook machine

Why We Invested in Lightyear

Mainframes, ML and digital transformation

Ads, privacy and confusion

Do App Store Rules Matter?

Why we invested in Zerion

Unleashing the potential of the extended workforce. Why we invested in Utmost.

Integrative SaaS: A new OS for the workplace?

Antitrust posturing

The Potential of Real Time Trade Finance. Our investment in Hokodo

Boxes, trucks and bikes

Apple, Fedex and the cookie apocalypse

Can Apple change ads?