Cory Doctorow: Mostly signs (some portents)

Copyright takedowns are a cautionary tale that few are heeding

ALT

On July 14, I’m giving the closing keynote for the fifteenth HACKERS ON PLANET EARTH, in QUEENS, NY. Happy Bastille Day! On July 20, I’m appearing in CHICAGO at Exile in Bookville.

We’re living through one of those moments when millions of people become suddenly and overwhelmingly interested in fair use, one of the subtlest and worst-understood aspects of copyright law. It’s not a subject you can master by skimming a Wikipedia article!

I’ve been talking about fair use with laypeople for more than 20 years. I’ve met so many people who possess the unshakable, serene confidence of the truly wrong, like the people who think fair use means you can take x words from a book, or y seconds from a song and it will always be fair, while anything more will never be.

Or the people who think that if you violate any of the four factors, your use can’t be fair – or the people who think that if you fail all of the four factors, you must be infringing (people, the Supreme Court is calling and they want to tell you about the Betamax!).

You might think that you can never quote a song lyric in a book without infringing copyright, or that you must clear every musical sample. You might be rock solid certain that scraping the web to train an AI is infringing. If you hold those beliefs, you do not understand the “fact intensive” nature of fair use.

But you can learn! It’s actually a really cool and interesting and gnarly subject, and it’s a favorite of copyright scholars, who have really fascinating disagreements and discussions about the subject. These discussions often key off of the controversies of the moment, but inevitably they implicate earlier fights about everything from the piano roll to 2 Live Crew to antiracist retellings of Gone With the Wind.

One of the most interesting discussions of fair use you can ask for took place in 2019, when the NYU Engelberg Center on Innovation Law & Policy held a symposium called “Proving IP.” One of the panels featured dueling musicologists debating the merits of the Blurred Lines case. That case marked a turning point in music copyright, with the Marvin Gaye estate successfully suing Robin Thicke and Pharrell Williams for copying the “vibe” of Gaye’s “Got to Give it Up.”

Naturally, this discussion featured clips from both songs as the experts – joined by some of America’s top copyright scholars – delved into the legal reasoning and future consequences of the case. It would be literally impossible to discuss this case without those clips.

And that’s where the problems start: as soon as the symposium was uploaded to Youtube, it was flagged and removed by Content ID, Google’s $100,000,000 copyright enforcement system. This initial takedown was fully automated, which is how Content ID works: rightsholders upload audio to claim it, and then Content ID removes other videos where that audio appears (rightsholders can also specify that videos with matching clips be demonetized, or that the ad revenue from those videos be diverted to the rightsholders).

But Content ID has a safety valve: an uploader whose video has been incorrectly flagged can challenge the takedown. The case is then punted to the rightsholder, who has to manually renew or drop their claim. In the case of this symposium, the rightsholder was Universal Music Group, the largest record company in the world. UMG’s personnel reviewed the video and did not drop the claim.

99.99% of the time, that’s where the story would end, for many reasons. First of all, most people don’t understand fair use well enough to contest the judgment of a cosmically vast, unimaginably rich monopolist who wants to censor their video. Just as importantly, though, is that Content ID is a Byzantine system that is nearly as complex as fair use, but it’s an entirely private affair, created and adjudicated by another galactic-scale monopolist (Google).

Google’s copyright enforcement system is a cod-legal regime with all the downsides of the law, and a few wrinkles of its own (for example, it’s a system without lawyers – just corporate experts doing battle with laypeople). And a single mis-step can result in your video being deleted or your account being permanently deleted, along with every video you’ve ever posted. For people who make their living on audiovisual content, losing your Youtube account is an extinction-level event:

https://www.eff.org/wp/unfiltered-how-youtubes-content-id-discourages-fair-use-and-dictates-what-we-see-online

So for the average Youtuber, Content ID is a kind of Kafka-as-a-Service system that is always avoided and never investigated. But the Engelbert Center isn’t your average Youtuber: they boast some of the country’s top copyright experts, specializing in exactly the questions Youtube’s Content ID is supposed to be adjudicating.

So naturally, they challenged the takedown – only to have UMG double down. This is par for the course with UMG: they are infamous for refusing to consider fair use in takedown requests. Their stance is so unreasonable that a court actually found them guilty of violating the DMCA’s provision against fraudulent takedowns:

https://www.eff.org/cases/lenz-v-universal

But the DMCA’s takedown system is part of the real law, while Content ID is a fake law, created and overseen by a tech monopolist, not a court. So the fate of the Blurred Lines discussion turned on the Engelberg Center’s ability to navigate both the law and the n-dimensional topology of Content ID’s takedown flowchart.

It took more than a year, but eventually, Engelberg prevailed.

Until they didn’t.

If Content ID was a person, it would be baby, specifically, a baby under 18 months old – that is, before the development of “object permanence.” Until our 18th month (or so), we lack the ability to reason about things we can’t see – this the period when small babies find peek-a-boo amazing. Object permanence is the ability to understand things that aren’t in your immediate field of vision.

Content ID has no object permanence. Despite the fact that the Engelberg Blurred Lines panel was the most involved fair use question the system was ever called upon to parse, it managed to repeatedly forget that it had decided that the panel could stay up. Over and over since that initial determination, Content ID has taken down the video of the panel, forcing Engelberg to go through the whole process again.

But that’s just for starters, because Youtube isn’t the only place where a copyright enforcement bot is making billions of unsupervised, unaccountable decisions about what audiovisual material you’re allowed to access.

Spotify is yet another monopolist, with a justifiable reputation for being extremely hostile to artists’ interests, thanks in large part to the role that UMG and the other major record labels played in designing its business rules:

https://pluralistic.net/2022/09/12/streaming-doesnt-pay/#stunt-publishing

Spotify has spent hundreds of millions of dollars trying to capture the podcasting market, in the hopes of converting one of the last truly open digital publishing systems into a product under its control:

https://pluralistic.net/2023/01/27/enshittification-resistance/#ummauerter-garten-nein

Thankfully, that campaign has failed – but millions of people have (unwisely) ditched their open podcatchers in favor of Spotify’s pre-enshittified app, so everyone with a podcast now must target Spotify for distribution if they hope to reach those captive users.

Guess who has a podcast? The Engelberg Center.

Naturally, Engelberg’s podcast includes the audio of that Blurred Lines panel, and that audio includes samples from both “Blurred Lines” and “Got To Give It Up.”

So – naturally – UMG keeps taking down the podcast.

Keep reading

Copyright takedowns are a cautionary tale that few are heeding

Solving the Moderator’s Trilemma with Federation

About

People I follow