Authors Grady Hendrix and Jennifer Roberson sued Apple on Friday for using pirated books to train its OpenELM models. The timing wasn't coincidental - it was the exact same day Anthropic agreed to pay $1.5 billion to settle a nearly identical lawsuit. Lawyers know an opportunity when they see one.
Apple getting caught using the same pirated book datasets is especially embarrassing because they've spent years lecturing everyone else about privacy and data ethics. Apple Intelligence marketing emphasizes "user consent" and "on-device processing" - but apparently that ethical stance didn't extend to acquiring training data legally.
The lawsuit specifically targets Apple's OpenELM models, which used "a known body of pirated books" for training. So while Apple was marketing privacy as a premium feature, they were torrenting books just like every other AI company.
Sue All the Things
Friday was apparently "fuck AI companies day" for copyright lawyers. Besides Apple, Warner Bros. also sued Midjourney for training on copyrighted images. The message is clear: Anthropic's $1.5B settlement proved these lawsuits can pay, so every content creator with lawyers is jumping in.
Authors have already sued OpenAI, Microsoft, Meta, and Google. Now entertainment companies are getting in on the action. This isn't random litigation - it's coordinated strategy to establish that AI companies need to pay for training data.
The Judge Alsup ruling in the Anthropic case created the perfect legal framework: using legitimately acquired books for AI training is fair use, but using pirated books is just copyright infringement. This makes the lawsuits much easier to win.
Apple's "Privacy" Hypocrisy
Apple's whole AI strategy is built on the premise that they're more ethical than competitors. Private Cloud Compute keeps user data on Apple servers instead of third-party clouds. Apple Silicon chips process AI requests on-device for privacy.
But all that privacy theater was for user data, not training data. When it came to actually building the AI models, Apple used the same pirated books as everyone else. They just did their piracy behind closed doors while publicly criticizing competitors.
This is a massive reputational risk for Apple. Their premium pricing depends partly on being "the ethical tech company" versus Meta and Google. If courts rule that Apple used identical training practices, what exactly are customers paying extra for? The privacy marketing falls apart if Apple was pirating content just like everyone else.