An AI firm won a lawsuit for copyright infringement — but may face a huge bill for piracy

To judge from the reaction among the AI crowd, a federal judge's Monday ruling in a copyright infringement case was a clear win for all the AI firms that use published material to 'train' their chatbots.
'We are pleased that the Court recognized that using works to train [large language models] was transformative — spectacularly so,' Anthropic, the defendant in the lawsuit, boasted after the ruling.
'Transformative' was a key word in the ruling by U.S. Judge William Alsup of San Francisco, because it's a test of whether using copyrighted works falls within the 'fair use' exemption from copyright infringement. Alsup ruled that using copyrighted works to train bots such as Anthropic's Claude is indeed fair use, and not a copyright breach.
Anthropic had to acknowledge a troubling qualification in Alsup's order, however. Although he found for the company on the copyright issue, he also noted that it had downloaded copies of more than 7 million books from online 'shadow libraries,' which included countless copyrighted works, without permission.
That action was 'inherently, irredeemably infringing,' Alsup concluded. 'We will have a trial on the pirated copies...and the resulting damages,' he advised Anthropic ominously: Piracy on that scale could expose the company to judgments worth untold millions of dollars.
What looked superficially as a clear win for AI companies in their long battle to use copyrighted material without paying for it to feed their chatbots, now looks clear as mud.
That's especially true when Alsup's ruling is paired with a ruling issued Wednesday by U.S. Judge Vince Chhabria, who works out of the same San Francisco courthouse.
In that copyright infringement case, brought against Meta Platforms in 2023 by comedian Sarah Silverman and 12 other published authors, Chhabria also ruled that Meta's training its AI bots on copyrighted works is defensible as fair use. He granted Meta's motion for summary judgment.
But he provided plaintiffs in similar cases with a roadmap to winning their claims. He ruled in Meta's favor, he indicated, only because the plaintiffs' lawyers failed to raise a legal point that might have given them a victory. More on that in a moment.
'Neither case is going to be the last word' in the battle between copyright holders and AI developers, says Adam Moss, a Los Angeles attorney specializing in copyright law. With more than 40 lawsuits on court dockets around the country, he told me, 'it's too early to declare that either side is going to win the ultimate battle.'
With billions of dollars, even trillions, at stake for AI developers and the artistic community at stake, no one expects the law to be resolved until the issue reaches the Supreme Court, presumably years from now. But it's worthwhile to look at these recent decisions — and a copyright lawsuit filed earlier this month by Walt Disney Co., NBCUniversal and other studios against Midjourney, another AI developer — for a sense of how the war is shaping up.
To start, a few words about chatbot-making. Developers feed their chatbot models on a torrent of material, much of it scraped from the web — everything from distinguished literary works to random babbling — as well as collections holding millions of books, articles, scientific papers and the like, some of it copyrighted. (Three of my eight books are listed in one such collection, without my permission. I don't know if any have been 'scraped,' and I'm not a party to any copyright lawsuit, as far as I know.)
The goal is to 'train' the bots to extract facts and detect patterns in the written material that can then be used to answer AI users' queries in a semblance of conversational language. There are flaws in the process, of course, including the bots' tendency when they can't find an answer in their massive hoard of data to make something up.
In their lawsuits, writers and artists maintain that the use of their material without permission to train the bots is copyright infringement, unless they've been paid. The AI developers reply that training falls within the 'fair use' exemption in copyright law, which depends on several factors — if only limited material is drawn from a copyrighted work, if the resulting product is 'transformative,' and if it doesn't significantly cut into the market for the original work.
That brings us to the lawsuits at hand.
Three authors — novelist Andrea Bartz and nonfiction writers Charles Graeber and Kirk Wallace Johnson — sued Anthropic for using their works without permission. In their lawsuit, filed last year, it emerged that Anthropic had spent millions of dollars to acquire millions of print books, new and used, to feed their bots.
'Anthropic purchased its print copies fair and square,' Alsup wrote. It's generally understood that the owners of books can do almost anything they wish with them, including reselling them.
But Anthropic also downloaded copies of more than 7 million books from online 'shadow libraries,' which include untold copyrighted works without permission.
Alsup wrote that Anthropic 'could have purchased books, but it preferred to steal them to avoid 'legal/practice/business slog,'' Alsup wrote. (He was quoting Anthropic co-founder and CEO Dario Amodei.)
Anthropic told me by email that 'it's clear that we acquired books for one purpose only — building LLMs — and the court clearly held that use was fair.'
That's correct as far as it goes. But Alsup found that Anthropic's goal was not only to train LLMs, but to create a general library 'we could use for research' or to 'inform our products,' as an Anthropic executive said, according to legal papers.
Chhabria's ruling in the Meta case presented another wrinkle. He explicitly disagreed with Alsup about whether using copyrighted works without permission to train bots is fair use.
'Companies have been unable to resist the temptation to feed copyright-protected materials into their models—without getting permission from the copyright holders or paying them.' He posed the question: Is that illegal? And answered, 'Although the devil is in the details, in most cases the answer will be yes.'
Chhabria's rationale was that a flood of AI-generated works will 'dramatically undermine the market' for the original works, and thus 'dramatically undermine the incentive for human beings to create things the old-fashioned way.'
Protecting the incentive for human creation is exactly the goal of copyright law, he wrote. 'While AI-generated books probably wouldn't have much of an effect on the market for the works of Agatha Christie, they could very well prevent the next Agatha Christie from getting noticed or selling enough books to keep writing.'
Artists and authors can win their copyright infringement cases if they produce evidence showing the bots are affecting their market. Chhabria all but pleaded for the plaintiffs to bring some such evidence before him:
'It's hard to imagine that it can be fair use to use copyrighted books...to make billions or trillions of dollars while enabling the creation of a potentially endless stream of competing works that could significantly harm the market for those books.'
But 'the plaintiffs never so much as mentioned it,' he lamented.
As a result, he said, he had no choice but to give Meta a major win against the authors.
I asked the six law firms representing the authors for their response to Chhabria's implicit criticism of their lawyering, but heard back from only one — Boies Schiller Flexner, which told me by email, 'despite the undisputed record of Meta's historically unprecedented pirating of copyrighted works, the court ruled in Meta's favor. We respectfully disagree with that conclusion.'
All this leaves the road ahead largely uncharted. 'Regardless of how the courts rule, I believe the end result will be some form of licensing agreement,' says Robin Feldman, director of the Center for Innovation at UC College of the Law. 'The question is where will the chips fall in the deal and will smaller authors be left out in the cold.'
Some AI firms have reached licensing agreements with publishers allowing them to use the latters' copyrighted works to train their bots. But the nature and size of those agreements may depend on how the underlying issues of copyright infringement play out in the courts.
Indeed, Chhabria noted that filings in his court documented that Meta was trying to negotiate such agreements until it realized that a shadow library it had downloaded already contained most of the works it was trying to license. At that point it 'abandoned its licensing efforts.' (I asked Meta to confirm Chhabria's version, but didn't get a reply.)
The truth is that the AI camp is just trying to get out of paying for something instead of getting it for free. Never mind the trillions of dollars in revenue they say they expect over the next decade — they claim that licensing will be so expensive it will stop the march of this supposedly historic technology dead in its tracks.
Chhabria aptly called this argument 'nonsense.' If using books for training is as valuable as the AI firms say they are, he noted, then surely a market for book licensing will emerge. That is, it will — if the courts don't give the firms the right to use stolen works without compensation.

Hashtags

Entertainment

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

'Not a god': arguments end in Combs trial ahead of deliberations

Yahoo

4 hours ago

Yahoo

'Not a god': arguments end in Combs trial ahead of deliberations

Sean "Diddy" Combs's lawyer aimed Friday to skewer the credibility of the music mogul's accusers, saying in closing arguments they were out for money while rejecting any notion he led a criminal ring. But in their rebuttal -- the trial's final stage before jurors are tasked with deciding the verdict -- prosecutors tore into the defense, saying Combs's team had "contorted the facts endlessly." Prosecutor Maurene Comey told jurors that by the time Combs -- once among the most powerful people in music -- had committed his clearest-cut offenses, "he was so far past the line he couldn't even see it." "In his mind he was untouchable," she told the court. "The defendant never thought that the women he abused would have the courage to speak out loud what he had done to them." "That ends in this courtroom," she said. "The defendant is not a god." For most of Friday's hearing defense attorney Marc Agnifilo picked apart, and even made light of, the testimony of women who were in long-term relationships with Combs, and who said he had coerced them into drug-fueled sex parties with paid escorts. Agnifilo scoffed at the picture painted by prosecutors of a violent, domineering man who used his employees, wealth and power to foster "a climate of fear" that allowed him to act with impunity. Combs, 55, is a "self-made, successful Black entrepreneur" who had romantic relationships that were "complicated" but consensual, Agnifilo said. In his freewheeling, nearly four-hour-long argument, Agnifilo aimed to confuse the methodic narrative US attorney Christy Slavik provided one day prior. She had spent nearly five hours meticulously walking the jury through the charges and their legal basis, summarizing thousands of phone, financial, travel and audiovisual records along with nearly seven weeks of testimony from 34 witnesses. Central to their case is the claim that Combs led a criminal enterprise of senior employees -- including his chief-of-staff and security guards -- who "existed to serve his needs." But Agnifilo underscored that none of those individuals testified against Combs, nor were they named as co-conspirators. "This is supposed to be simple," the defense counsel told jurors. "If you find that you're in the weeds of this great complexity, maybe it's because it just isn't there." If convicted, Combs faces upwards of life in prison. - 'Brazen' - Casandra Ventura and a woman who testified under the pseudonym Jane described abuse, threats and coercive sex in excruciating detail. Combs's defense has conceded that domestic violence was a feature of the artist's relationships, but that his outbursts did not amount to sex trafficking. The defense insisted the women were consenting adults. Prosecutor Comey snapped back that they were being "manipulated" into "brazen" acts of sex trafficking, reiterating once again for jurors what the government says are the clearest-cut examples. Agnifilo pointed to Ventura's civil lawsuit against Combs in which she was granted $20 million: "If you had to pick a winner in this whole thing, it would be Cassie," he said. Comey called that notion insulting: "What was her prize? Black eyes? A gash in her head? Sex for days with a UTI?" The prosecutor also pointed to a violent episode between Combs and Jane, when she says she struck him in an argument before he brutally beat her, knocked her down in the shower, and then forced her into giving an escort oral sex. "Jane may have started that fight, but he finished it with a vengeance," Comey said, calling that incident the most obvious sex trafficking case and saying he had "literally beaten her into submission." Throughout the trial, jurors were shown voluminous phone records, including messages of affection and desire from both women -- and Agnifilo emphasized the love and romance once again. Both prosecutors said taking those words literally, and in isolation, doesn't paint the whole picture. They also referenced testimony from a forensic psychologist who explained to jurors how victims become ensnared by abusers. "The defense is throwing anything they can think of at the wall, hoping something will stick," Comey said. On Monday, Judge Arun Subramanian will instruct jurors on how to apply the law to the evidence for their deliberations. Then, 12 New Yorkers will determine Combs's future. But Combs's legal worries may not end there, after three new sexual assault lawsuits were filed against him this week. One was by a woman who alleged the rapper's son, Justin, lured her from the southern state of Louisiana to Los Angeles where she was held captive, drugged and gang raped by three masked men in 2017. One of the men was allegedly Sean Combs. The other two cases were filed by men who accuse the rapper and his team of drugging and sexually assaulting them at parties in 2021 and 2023. mdo/sla/acb

Sean 'Diddy' Combs' lawyer mocks sex trafficking case in closing, calls charges 'badly exaggerated', AP explains

Yahoo

6 hours ago

Yahoo

Sean 'Diddy' Combs' lawyer mocks sex trafficking case in closing, calls charges 'badly exaggerated', AP explains

Sean Diddy Combs' defense lawyer says the government "badly exaggerated" its evidence against the music mogul in a four-hour closing argument. (AP video by Joseph Frederick)

What Happens During ‘M3GAN 2.0' End Credits And How Possible Is A Sequel?

Forbes

8 hours ago

Forbes

What Happens During ‘M3GAN 2.0' End Credits And How Possible Is A Sequel?

"Megan 2.0" partial poster. M3GAN 2.0 — the sequel to the 2022 horror hit M3GAN — has footage in the end credits. What happens during the end credits and does it mean anything for another sequel? M3GAN 2.0 opens in theaters nationwide on Friday. The official summary for the film reads, 'Two years after M3GAN, a marvel of artificial intelligence, went rogue and embarked on a murderous (and impeccably choreographed) rampage and was subsequently destroyed, M3GAN's creator Gemma (Allison Williams) has become a high-profile author and advocate for government oversight of AI. "Meanwhile, Gemma's niece Cady (Violet McGraw), now 14, has become a teenager, rebelling against Gemma's overprotective rules. Unbeknownst to them, the underlying tech for M3GAN has been stolen and misused by a powerful defense contractor to create a military-grade weapon known as Amelia (Ivanna Sakhno), the ultimate killer infiltration spy. "But as Amelia's self-awareness increases, she becomes decidedly less interested in taking orders from humans. Or in keeping them around. With the future of human existence on the line, Gemma realizes that the only option is to resurrect M3GAN (Amie Donald, voiced by Jenna Davis) and give her a few upgrades, making her faster, stronger and more lethal. As their paths collide, the original AI icon is about to meet her match.' Directed by Gerard Johnstone, M3GAN 2.0 also stars Brian Jordan Alvarez, Jen Van Epps, Aristotle Athari, Timm Sharp and Jemaine Clement. Note: Spoilers about the ending and end credits are revealed in the next section. What Footage Is Featured During The End Credits Of M3GAN 2.0? End-credits or post-credits scenes generally wrap up loose ends from a scene from earlier in a film — like Brad Pitt's F1: The Movie does — or they can set up a potential sequel. In the case of M3GAN 2.0, there is no post-credits scene and the end credits only show highlights from 2022's M3GAN and the new film, along with a silhouette of M3GAN the AI robot dancing. As for the future of M3GAN the AI android, the sequel makes it clear that the program can function as a disembodied robot no matter how mangled its mechanic makeup gets. As such, there's really no reason to stay in your seats to watch the end credits because there's really no substance to them. Even though there are no scenes in 2.0 that set up another M3GAN (M3GAN 3.0, perhaps?), another movie in the series is in all likelihood something that Universal Pictures executives are considering. M3GAN was a big moneymaker for the studio in 2022 — it made $182 million worldwide against a $12 million budget before prints and advertising, according to The Hollywood Reporter, so they'll no doubt be tempted for a third film depending on how well M3GAN 2.0 does. M3GAN 2.0 is projected by Universal (via THR) to earn $20 million domestically in its opening weekend frame against a $15 million production budget before P&A, which is cheap by Hollywood standards for a major release (Deadline, however, reported that the budget was $25 million, which is still a low number). So, as long as Universal makes enough money to justify another M3GAN film, director Gerard Johnstone is game to make more. The director told THR recently that as AI technology continues to evolve, it will help foster future screen stories. 'I would not be surprised if there's another five of these movies, Johnstone told THR. 'So, who knows, maybe I'll come back for the fifth one.' Lucky for Johnstone, franchise star Allison Williams wants to see more M3GAN movies, too. "We have big aspirations of big dreams, and I certainly don't feel like I'm done making these movies with these people and this tonal landscape and the subject matter,' Williams told THR recently. 'So, yeah, I have been dreaming of a third, for sure.' M3GAN 2.0 opens in theaters nationwide on Friday.