Copyrights in AI-Generated Content

Copyright registrations are being issued for works created with generative-AI tools, subject to some important qualifications. Also, Internet Archves revisited (briefly)

The long-awaited U.S. Copyright Office report on the copyrightability of works created using AI-generated output, is here. The legality of using copyrighted works to train generative-AI systems is a topic for another day.

Key takeaways:

  • Copyright protects the elements of a work that are created by a human, but does not protect elements that were AI-generated (probably the key take-away from the Report) The is the “human authorship” requirement that the Copyight Office invoked in denying registration of Stephen Thaler’s AI-generated output. I wrote about that a couple of years ago in “AI Can Create But Is It Art?” and also have commented on new AI copyright guidance from the Office before. 
  • The Copyright Office believes existing law is adequate to deal with AI copyright issues; it does not believe any new legislation is needed
  • Using AI to assist in the creative process does not affect copyrightability
  • Prompts do not provide sufficient control over the output to be considered creative works.
  • Protection exists for the following, if they involve sufficient human creativity:
    • Selection, coordination, and arrangement of AI-generated output (compilation)
      •  Modification of AI-generated content  (Derivative works)
        • Human-created elements distinguishable from AI-generated elements.

Prompts

A key question for the Copyright Office was whether a highly detailed prompt could suffice as human creative expression. The Office says no; “[P]rompts alone do not provide sufficient human control to make users of an AI system the authors of the output. Prompts essentially function as instructions that convey unprotectable ideas. While highly detailed prompts could contain the user’s desired expressive elements, at present they do not control how the AI system processes them in generating the output.”

How much control does a human need over the output-generation process to be considered an author? The answer, apparently, is “So much control that the AI mechanism’s contribution was purely rote or mechanical. “The fact that identical prompts can generate multiple different outputs further indicates a lack of human control.”

Expressive prompts

If the prompt itself is sufficiently creative and original, the expression contained in the prompt may qualify for copyright protection. For example, if a user prompts an AI tool to change a story from first-person to third-person point of view, and includes the first-person version in the prompt, then copyright may be claimed in the story that was included in the prompt. The author could claim copyright in the story as a “human-generated element” distinguishable from anything AI thereafter did to it. The human-created work must be perceptible in the output.

Registration of hybrid works

The U.S. Copyright Office has now issued several registrations for works that contain a combination of both human creative expression and AI-generated output. Examples:

Irontic, LLC has a registered copyright in Senzia Opera a, a sound recording with “music and singing voices by [sic] generated by artificial intelligence,” according to the copyright registration. That material is excluded from the claim. The registration, however, does provide protection for the story, lyrics, spoken words, and the selection, coordination, and arrangement of the sound recording.

Computer programs can be protected by copyright, but if any source code was generated by AI, it must be excluded from the claim. Thus, the Adobe GenStudio for Performance Marketing computer program is protected by copyright, but any source code in it that was AI-generated is not.

A record company received a copyright registration for human additions and modifications to AI-generated art.

As an example of a “selection, coordination and arrangement” copyright, there is the registration of a work called “A Collection of Objects Which Do Not Exist,” consisting of a collage of AI-generated images. “A Single Piece of American Cheese,” is another example of a registered copyright claim based on the selection, coordination, or arrangement of AI-generated elements.

Woman's face superimposed on a sheet of computer programming code, signifying an AI-generated image

China

A Chinese court has taken a contrary position, holding that an AI-generated image produced by Stable Diffusion is copyrightable because the prompts he chose reflected his aesthetic choices.

Internet Archive Postscript

In January, the Second Circuit Court of Appeals affirmed the decision in Hachette Book Group, Inc. v. Internet Archive. This came as no surprise. A couple of important things that bear repeating came out of this decision, though.

First, the Court of Appeals reaffirmed that fair use is an affirmative defense. As such, the defendant bears the burden of establishing the level of market harm the use has caused or may cause. While a copyright owner may reasonably be required to identify relevant markets, he/she/it is not required to present empirical data to support a claim of market harm. The defendant bears the burden of proof of a fair use defense, including proof pertinent to each of the four factors comprising the defense.

Confusion seems to have crept into some attorneys’ and judges’ analysis of the issue. This is probably because it is well known that the plaintiff bears the burden of proof of damages, which can also involve evidence of market harm. The question of damages, however, is separate and distinct from the “market harm” element of a fair use defense.

The second important point the Second Circuit made in Hatchette is that the “public benefit” balancing that Justice Breyer performed in Google LLC v. Oracle America, Inc. needs to focus on something more than just the short-term benefits to the public in getting free access to infringing copies of works. Otherwise, the “public benefit” in getting free copies of copyright-protected stuff would outweigh the rights of copyright owners every time.  The long-term benefits of protecting the rights of authors must also be considered.

True, libraries and consumers may reap some short-term benefits from access to free digital books, but what are the long-term consequences? [Those consequences, i.e.,] depriv[ing] publishers and authors of the revenues due to them as compensation for their unique creations [outweigh any public benefit in having free access to copyrighted works.]

Id.

They reined in Google v. Oracle.

Thomas James is a human

 

Fair Use Decision in Thomson Reuters v. Ross

A court has handed down the first known ruling (to me, anyway) on “fair use” in the wave of copyright infringement lawsuits against AI companies that are pending in federal courts.

Thomson Reuters v. ROSS is one of the top 12 generative-AI lawsuits that are pending in the courts. A court has handed down the first known ruling (to me, anyway) on “fair use” in the wave of copyright infringement lawsuits against AI companies that are pending in federal courts. The ruling came in Thomson Reuters v. ROSS. Thomas Reuters filed this lawsuit against Ross Intelligence back in 2020, alleging that Ross trained its AI models on Westlaw headnotes to build a competing legal research tool, infringing numerous copyrights in the process. Ross asserted a fair use defense.

Library of Congress building, the front entrance
Library of Congress

In 2023, Thomson Reuters sought summary judgment against Ross on the fair use defense. At that time, Judge Bibas denied the motion. This week, however, the judge reversed himself, knocking out at least a major portion of the fair use defense.

Ross had argued that Westlaw headnotes are not sufficiently original to warrant copyright protection and that even if they are, the use made of them was “fair use.” After painstakingly reviewing the headnotes and comparing them with the database materials, he concluded that 2,243 headnotes were sufficiently original to receive copyright protection, that Ross infringed them, and that “fair use” was not a defense in this instance because the purpose of the use was commercial and it competed in the same market with Westlaw. Because of that, it was likely to have an adverse impact on the market for Westlaw.

While this might seem to spell the end for AI companies in the many other lawsuits where they are relying on a “fair use” defense, that is not necessarily so. As Judge Bibas noted, the Ross AI was non-generative. Generative AI tools may be distinguishable in the fair use analysis.

I will be presenting a program on Recent Developments in AI Law in New Jersey this summer. This one certainly will merit mention. Whether any more major developments will come to pass between now and then remains to be seen.

New AI Copyright Infringement Lawsuit

Another copyright and trademark infringement lawsuit against an AI company was filed this week. Advance Local Media et al. v. Cohere, Inc. This one pits news article publishers Advance Local Media, Condé Nast, The Atlantic, Forbes Media, The Guardian, Business Insider, LA Times, McClatchy Media Company, Newsday, Plain Dealer Publishing Company, POLITICO, The Republican Company, Toronto Star Newspapers, and Vox Media against AI company Cohere.

The complaint alleges that Cohere made unauthorized use of publisher content in developing and operating its generative AI systems, infringing numerous copyrights and trademarks. The plaintiffs are seeking an injunction and monetary damages.

Read more copyright and AI training stories.

Top Copyright Cases of 2024

Many AI-related copyright lawsuits continued to proceed through the courts

Warner Chappell Music Inc. v. Nealy

The Copyright Act imposes a three-year period of limitations for copyright infringement claims. There has been a split in the circuits about whether this means that damages could be claimed only for infringement occurring during the three-year period or whether damages could be recovered for earlier acts of infringement so long as the claim is timely filed. The Supreme Court has now resolved the split.

The issue arises in cases where a claimant invokes the discovery rule. The general rule is that a limitations period runs from the date of the act giving rise to the cause of action. The discovery rule, by contrast, measures the limitations period from the date the infringing act is discovered. Thus, for example, if an infringing act occurred in 2012 but the copyright owner did not learn about it until 2022, then under the traditional rule, the claim would be time-barred. Under the discovery rule, it would not be.

The U.S. Supreme Court’s holding in this case is to the effect that if the discovery rule applies in the jurisdiction where suit is filed, and a claimant properly invokes it, then damages are not limited to the three years preceding suit. Rather, any damages incurred since the date of the infringing act are recoverable.

The Court did not rule on the validity of the discovery rule.

Warner Chappell Music Inc.. v. Nealy, 601 U.S. ____ (2024).

 Hachette Book Group Inc. v. Internet Archive

I wrote about Hachette Book Group v. Internet Archive back in 2022, when it was at the summary judgment stage in the district court for the Southern District of New York. Astute readers with much better memory than I possess will recall that I included in my list of the top copyright cases os 2022. The complaint, filed by book publishers, alleged that the Internet Archive made digital copies of over a million print books and then freely distributed the copies to members of the public, all without the permission of the copyright owners. In 2023, the district judge ruled in favor of the publishers, holding that the enterprise was not “fair use.” It subsequently issued an injunction against further scanning and distribution of books. (See A Copyright Win in the Internet Archive Lawsuit.) This year, the Second Circuit Court of Appeals affirmed the decision.

To some, the decision might seem like a no-brainer. Copying other people’s books and giving them away for free, without the copyright owners’ permission, sounds like core copyright infringement, right? Yet, before the Warhol v. Goldsmith decision in 2023, courts had been applying such an expansive view of the “transformative use” branch of fair use that some people thought that making digital copies of a print book was categorically “transformative” and therefore fair use. This decision makes it clear that no, it isn’t.

The Internet Archive has said it will not appeal the decision to the United States Supreme Court.

Hachette Book Group Inc. et al. v. Internet Archive, No. 23-1260 (2nd Cir. 2024)

Griner v. King

U.S. Representative Steve King’s campaign committee used a copyright-protected photograph in his campaign without permission. King’s committee had argued fair use and that it had an “implied license” to use the image because it had been widely circulated as a meme on the Internet. The Eighth Circuit Court of Appeals upheld an Iowa jury’s verdict for the copyright owner.

Griner et al. v. King et al., No. 23-2117, (8th Cir. 2024)

The Intercept Media v. OpenAI

This isn’t really a momentous decision, in terms of precedential value, but it is the first major victory for Big AI in the plethora of AI-related lawsuits they are facing.

The Intercept Media, Inc. sued OpenAI and Microsoft Corporation for alleged Digital Millennium Copyright Act (DMCA) violations in connection with training the AI tool, ChatGPT. The defendants filed a motion to dismiss. On November 21, 2024 the New York court dismissed claims against Microsoft with prejudice. The court dismissed the 17 U.S.C. § 1202(b)(3) claim against OpenAI but allowed the claim under 17 U.S.C. §1202(b)(1) to proceed.  

Section 1202(b)(1) prohibits unauthorized removal or alteration of copyright management information, including author information and the copyright notice.

The Intercept Media Inc. v. OpenAI Inc., No. 1:24-cv-01515, (S.D.N.Y. Nov. 21, 2024).

Stay tuned…

Many AI-related copyright lawsuits continued to proceed through the courts in 2024, with decisions expected in 2025 or later.