Photographers’ Rights

The Second Circuit Court of Appeals reversed a trial judge’s dismissal of a photographer’s copyright infringement complaint, holding that because “fair use” was not clearly established on the face of the complaint, the district court should not have dismissed the complaint sua sponte. Romanova v. Amilus, Inc.

Romanova v. Amilus, Inc., No. 23-828 (2nd Cir., May 23, 2025)

The Second Circuit Court of Appeals reversed a trial judge’s dismissal of a photographer’s copyright infringement complaint, holding that because “fair use” was not clearly established on the face of the complaint, the district court should not have dismissed the complaint sua sponte.

European grass snake photograph illustrating copyright infringement article by Cokato Minnesota attorney Thomas James

Photographer Jana Romanova created a photograph of a woman with a snake wrapped around her left hand and another snake crawling up her torso. (Not the one pictured here.) She licensed it to National Geographic Magazine for a single use. According to the complaint, Amilus, Inc. allegedly made a copy of the photograph and published it to its website. Romanova allegedly sent notifications demanding the removal of the photograph from the website. The defendant allegedly did not respond. This lawsuit followed.

The defendant allegedly did not appear or respond to the complaint, so Romanova moved for the entry of default judgment. Rather than grant a default judgment, however, the district court judge sua sponte ordered Romanova to show cause why the court should not dismiss the case on the grounds that the defendant’s use of the photograph was fair use. Although fair use is an affirmative defense, which defendants have the burden of asserting and proving, the judge opined that the fair use defense did not need to be pleaded because the judge believed the fair use defense was “clearly established on the fact of the complaint.

Romanova appealed. The Second Circuit Court of Appeals reversed, effectively allowing the infringement claim to go forward.

Fair Use

In its decision, the Second Circuit Court of Appeals clarified how courts are to interpret and apply the four-factor “fair use” test outlined in the Copyright Act, 17 U.S.C. ยง 107 (purpose and character of the use; nature of the work; amount and substantiality of the portion copied; and the effect on the market for the work.)

The district court concluded that the defendant’s publication of the photograph communicated a different message than what the photographer intended. According to the district court, the purpose of the publication in the National Geographic was “to showcase persons in [her] home country of Russia that kept snakes as pets, specifically to capture pet snakes in common environments that are more associated with mainstream domesticated animals.” The district court found that the purpose of the defendant’s publication was to communicate a message about “the ever-increasing amount of pet photography circulating online.

Apparently the district court was under the impression that the use of a copyright-protected work for any different purpose, or to communicate any different message, is “transformative” and therefore “fair use.” The Court of Appeals clarified that is not the case. In addition to alleging and proving the use was for a different purpose or conveyed a different meaning, a defendant seeking to establish a fair use defense must also allege and prove a justification for the copying.

Examples of purposes that may justify copying a work include commentary or criticism of the copied work, or providing information to the public about the copied work, in circumstances where the copy does not become a substitute for the work. (See, e.g., Authors Guild v. Google, Inc., 804 F.3d 202, 212 (2d Cir. 2015).) Copying for evidentiary purposes (such as to support a claim that the creator of the work published a defamatory statement) can also be a valid justification to support a fair use defense. Creating small, low-resolution copies of images (“thumbnails”) may be justified when the purpose is to facilitate Internet searching. (Perfect 10 v. Amazon.com, 508 F.3d 1146, 1165 (9th Cir. 2007). Facilitating blind people’s access to a work may provide a justification for converting it into a format that blind people can read. (Authors Guild v. HathiTrust, 755 F.3d 87, 97 (2d Cir. 2014).

The Court cited other examples of potential justifications for copying. The Court admonished, however, that the question whether justification exists is a fact-specific determination that must be made on a case-by-case basis.

[J]ustification is often found when the copying serves to critique, or otherwise comment on, the original, or its author, but can also be found in other circumstances, such as when the copying provides useful information about the original, or on other subjects, usually in circumstances where the copying does not make the expressive content of the original available to the public.

Romanova, supra.

The only “justification” the district court cited for the copying was that it believed the defendant merely wanted to illustrate its perception of a growing trend to publish photographs of people with pets. “Little could remain of an author’s copyright protection if others could secure the right to copy and distribute a work simply by asserting some fact about the copied work,” the Court observed. The defendant’s publication of the copy did not communicate criticism or commentary on the original photograph or its author, or any other subject, the Court held.

The Court held that the remaining three fair use factors also militated against a finding of fair use.

Sua Sponte Dismissal for “Fair Use”

Justice Sullivan filed a concurring opinion. He would have reversed on procedural grounds without reaching the substantive issue. Specifically, Justice Sullivan objected to the trial judge’s raising of the fair use defense sua sponte on behalf of a non-appearing defendant. Normally, if a complaint establishes a prima case for relief, the court does not consider affirmative defenses (such as fair use) unless the defendant asserts them. That is to say, fair use is an affirmative defense; the defendant, not the plaintiff, bears the burden of proof.

Conclusion

Appeals courts continue to rein in overly expansive applications of transformative fair use by the lower courts. Here, the Court of Appeals soundly reasoned that merely being able to articulate an additional purpose served by publishing an author’s entire work, unchanged, will not, by itself, suffice to establish either transformative use or fair use.

 

Copyrights in AI-Generated Content

Copyright registrations are being issued for works created with generative-AI tools, subject to some important qualifications. Also, Internet Archves revisited (briefly)

The U.S. Copyright Office has issued its long-awaited report on the copyrightability of works created using AI-generated output. The legality of using copyrighted works to train generative-AI systems is a topic for another day.

Key takeaways:

  • Copyright protects the elements of a work that are created by a human, but does not protect elements that were AI-generated (probably the key take-away from the Report) The is the “human authorship” requirement that the Copyight Office invoked in denying registration of Stephen Thaler’s AI-generated output. I wrote about that a couple of years ago in “AI Can Create But Is It Art?” and also have commented on new AI copyright guidance from the Office before.ย 
  • The Copyright Office believes existing law is adequate to deal with AI copyright issues; it does not believe any new legislation is needed
  • Using AI to assist in the creative process does not affect copyrightability
  • Prompts do not provide sufficient control over the output to be considered creative works.
  • Protection exists for the following, if they involve sufficient human creativity:
    • Selection, coordination, and arrangement of AI-generated output (compilation)
      • Modification of AI-generated content
        • Human-created elements distinguishable from AI-generated elements.

Prompts

A key question for the Copyright Office was whether a highly detailed prompt could suffice as human creative expression. The Office says no; โ€œ[P]rompts alone do not provide sufficient human control to make users of an AI system the authors of the output. Prompts essentially function as instructions that convey unprotectable ideas. While highly detailed prompts could contain the userโ€™s desired expressive elements, at present they do not control how the AI system processes them in generating the output.โ€

How much control does a human need over the output-generation process to be considered an author? The answer, apparently, is โ€œSo much control that the AI mechanismโ€™s contribution was purely rote or mechanical. โ€œThe fact that identical prompts can generate multiple different outputs further indicates a lack of human control.โ€

Expressive prompts

If the prompt itself is sufficiently creative and original, the expression contained in the prompt may qualify for copyright protection. For example, if a user prompts an AI tool to change a story from first-person to third-person point of view, and includes the first-person version in the prompt, then copyright may be claimed in the story that was included in the prompt. The author could claim copyright in the story as a โ€œhuman-generated elementโ€ distinguishable from anything AI thereafter did to it. The human-created work must be perceptible in the output.

Registration of hybrid works

The U.S. Copyright Office has now issued several registrations for works that contain a combination of both human creative expression and AI-generated output. Examples:

Irontic, LLC has a registered copyright in Senzia Opera, a sound recording with โ€œmusic and singing voices by [sic] generated by artificial intelligence,โ€ according to the copyright registration. That material is excluded from the claim. The registration, however, does provide protection for the story, lyrics, spoken words, and the selection, coordination, and arrangement of the sound recording.

Computer programs can be protected by copyright, but if any source code was generated by AI, it must be excluded from the claim. Thus, the Adobe GenStudio for Performance Marketing computer program is protected by copyright, but any source code in it that was AI-generated is not.

A record company received a copyright registration for human additions and modifications to AI-generated art.

As an example of a โ€œselection, coordination and arrangementโ€ copyright, there is the registration of a work called โ€œA Collection of Objects Which Do Not Exist,โ€ consisting of a collage of AI-generated images. โ€œA Single Piece of American Cheese,โ€ is another example of a registered copyright claim based on the selection, coordination, or arrangement of AI-generated elements.

China

A Chinese court has taken a contrary position, holding that an AI-generated image produced by Stable Diffusion is copyrightable because the prompts he chose reflected his aesthetic choices.

Internet Archive Postscript

In January, the Second Circuit Court of Appeals affirmed the decision in Hachette Book Group, Inc. v. Internet Archive. This came as no surprise. A couple of important things that bear repeating came out of this decision, though.

First, the Court of Appeals reaffirmed that fair use is an affirmative defense. As such, the defendant bears the burden of establishing the level of market harm the use has caused or may cause. While a copyright owner may reasonably be required to identify relevant markets, he/she/it is not required to present empirical data to support a claim of market harm. The defendant bears the burden of proof of a fair use defense, including proof pertinent to each of the four factors comprising the defense.

Confusion seems to have crept into some attorneysโ€™ and judgesโ€™ analysis of the issue. This is probably because it is well known that the plaintiff bears the burden of proof of damages, which can also involve evidence of market harm. The question of damages, however, is separate and distinct from the “market harm” element of a fair use defense.

The second important point the Second Circuit made in Hatchette is that the โ€œpublic benefitโ€ balancing that Justice Breyer performed in Google LLC v. Oracle America, Inc. needs to focus on something more than just the short-term benefits to the public in getting free access to infringing copies of works. Otherwise, the โ€œpublic benefitโ€ in getting free copies of copyright-protected stuff would outweigh the rights of copyright owners every time. ย The long-term benefits of protecting the rights of authors must also be considered.

True, libraries and consumers may reap some short-term benefits from access to free digital books, but what are the long-term consequences? [Those consequences, i.e.,] depriv[ing] publishers and authors of the revenues due to them as compensation for their unique creations [outweigh any public benefit in having free access to copyrighted works.]

Id.

They reined in Google v. Oracle.

Thomas James is a human. No part of this article was AI-generated.

 

AI Lawsuits Roundup

A status update on 24 pending lawsuits against AI companies – what they’re about and what is happening in court – prepared by Minnesota copyright attorney Thomas James.

Advancements in artificial intelligence technology, including generative-AI, have introduced a wide range of new or exacerbated legal problems. Collectively, I call these AI legal issues. Although not all of them are unique to scenarios involving AI, they are certainly testing and stretching the capacity of legal institutions. Here is a very brief summary of how these issues are playing out in the courts, as of February 28, 2024.ย 

Copyright

Thomson Reuters v. Ross, (D. Del. 2020)

Filed May 6, 2020. Thomson Reuters, owner of Westlaw, claims that Ross Intelligence infringed copyrights in Westlaw headnotes by training AI on copies of them. The judge has granted, in part, and denied, in part, motions for summary judgment. The questions of fair use and whether the headnotes are sufficiently original to merit copyright protection remain to be decided.

Update: The court initially ruled that Westlaw’s headnotes are not sufficiently creative and original to merit copyright protection, but has now reversed itself, ruling that over 2,243 of them are. There has now been a fair use decision in Thomson Reuters v. Ross.ย 

Thaler v. Perlmutter (D.D.C. 2022).

Complaint filed June 2, 2022. Thaler created an AI system called the Creativity Machine. He applied to register copyrights in the output he generated with it. The Copyright Office refused registration on the ground that AI output does not meet the โ€œhuman authorshipโ€ requirement. (I explained that requirement in a previous blog post that explored the difference between human and AI creation of a work. He then sought judicial review. The district court granted summary judgment for the Copyright Office. In October, 2023, Thaler filed an appeal to the District of Columbia Circuit Court of Appeals (Case no. 23-5233).

Doe v. GitHub, Microsoft, and OpenAI (N.D. Cal. 2022)

Complaint filed November 3, 2022. Software developers claim the defendants trained Codex and Copilot on code derived from theirs, which they published on GitHub. Some claims have been dismissed, but claims that GitHub and OpenAI violated the DMCA and breached open source licenses remain. Discovery is ongoing.

Andersen v. Stability AI (N.D. Cal. 2023)

Complaint filed January 13, 1023. Visual artists sued Midjourney, Stability AI and DeviantArt for copyright infringement for allegedly training their generative-AI models on images scraped from the Internet without copyright holdersโ€™ permission. Other claims included DMCA violations, publicity rights violations, unfair competition, breach of contract, and a claim that output images are infringing derivative works. On October 30, 2023, the court largely granted motions to dismiss, but granted leave to amend the complaint. Plaintiffs filed an amended complaint on November 29, 2023. Defendants have filed motions to dismiss the amended complaint. Hearing on the motion is set for May 8, 2024.

Getty Images v. StabilityAI (U.K. 2023)

Complaint filed January, 2023. Getty Images claims StabilityAI scraped images without its consent. Gettyโ€™s complaint has survived a motion to dismiss and the case appears to be heading to trial.

In re OpenAI ChatGPT Litigation (N.D. Cal. 2023)

Complaint filed June 28, 3023. Originally captioned Tremblay v. OpenAI. Book authors sued OpenAI for direct and vicarious copyright infringement, DMCA violations, unfair competition and negligence. Both input (training) and output (derivative works) claims are alleged, as well as state law claims of unfair competition, etc. Most state law and DMCA claims have been dismissed, but claims based on unauthorized copying during the AI training process remain. An amended complaint is likely to come in March. The court has directed the amended complaint to consolidate Tremblay v. OpenAI, Chabon v. OpenAI, and Silverman v. OpenAI. ย 

Kadrey v. Meta (N.D. Cal. 2023)

Complaint filed July 7, 2023. Sarah Silverman and other authors allege Meta infringed copyrights in their works by making copies of them while training Metaโ€™s AI model; that the AI model is itself an infringing derivative work; and that outputs are infringing copies of their works. Plaintiffs also allege DMCA violations, unfair competition, unjust enrichment, and negligence. The court granted Metaโ€™s motion to dismiss all claims except the claim that unauthorized copies were made during the AI training process. An amended complaint and answer have been filed.

In 2025, Judge Chhabria ruled in Meta’s favor on fair use with respect to AI training; reserved the motion for summary judgment on the DMCA claims for decision in a separate order, and held that the claim of infringing distribution via leeching or seeding “will remain a live issue in the case.”

J.L. v. Google (N.D. Cal. 2023)

Complaint filed July 11, 2023. An author filed a complaint against Google alleging misuse of content posted on social media and Google platforms to train Googleโ€™s AI Bard. (Gemini is the successor to Googleโ€™s Bard.) Claims include copyright infringement, DMCA violations, and others. J.L. filed an amended complaint and Google has filed a motion to dismiss it. A hearing is scheduled for May 16, 2024.

Chabon v. OpenAI (N.D. Cal. 2023)

Complaint filed September 9, 2023. Authors allege that OpenAI infringed copyrights while training ChatGPT, and that ChatGPT is itself an unauthorized derivative work. They also assert claims of DMCA violations, unfair competition, negligence and unjust enrichment. The case has been consolidated with Tremblay v. OpenAI, and the cases are now captioned In re OpenAI ChatGPT Litigation.

Chabon v. Meta Platforms (N.D. Cal. 2023)

Complaint filed September 12, 2023. Authors assert copyright infringement claims against Meta, alleging that Meta trained its AI using their works and that the AI model itself is an unauthorized derivative work. The authors also assert claims for DMCA violations, unfair competition, negligence, and unjust enrichment. In November, 2023, the court issued an Order dismissing all claims except the claim of unauthorized copying in the course of training the AI. The court described the claim that an AI model trained on a work is a derivative of that work as โ€œnonsensical.โ€

Authors Guild v. OpenAI, Microsoft, et al. (S.D.N.Y. 2023)

Complaint filed September 19, 1023. Book and fiction writers filed a complaint for copyright infringement in connection with defendants’ training AI on copies of their works without permission. A motion to dismiss has been filed.

Huckabee v. Bloomberg, Meta Platforms, Microsoft, and EleutherAI Institute (S.D.N.Y. 2023)

Complaint filed October 17, 2023. Political figure Mike Huckabee and others allege that the defendants trained AI tools on their works without permission when they used Books3, a text dataset compiled by developers; that their tools are themselves unauthorized derivative works; and that every output of their tools is an infringing derivative work. ย Claims against EleutherAI have been voluntarily dismissed. Claims against Meta and Microsoft have been transferred to the Northern District of California. Bloomberg is expected to file a motion to dismiss soon.

Huckabee v. Meta Platforms and Microsoft (N.D. Cal. 2023)

Complaint filed October 17, 2023. Political figure Mike Huckabee and others allege that the defendants trained AI tools on their works without permission when they used Books3, a text dataset compiled by developers; that their tools are themselves unauthorized derivative works; and that every output of their tools is an infringing derivative work. Plaintiffs have filed an amended complaint. Plaintiffs have stipulated to dismissal of claims against Microsoft without prejudice.

Concord Music Group v. Anthropic (M.D. Tenn. 2023)

Complaint filed October 18, 2023. Music publishers claim that Anthropic infringed publisher-owned copyrights in song lyrics when they allegedly were copied as part of an AI training process (Claude) and when lyrics were reproduced and distributed in response to prompts. They have also made claims of contributory and vicarious infringement. Motions to dismiss and for a preliminary injunction are pending.

Alter v. OpenAI and Microsoft (S.D.N.Y. 2023)

Complaint filed November 21, 2023. Nonfiction author alleges claims of copyright infringement and contributory copyright infringement against OpenAI and Microsoft, alleging that reproducing copies of their works in datasets used to train AI infringed copyrights. The court has ordered consolidation of Authorโ€™s Guild (23-cv-8292) and Alter (23-cv-10211). On February 12,2024, plaintiffs in other cases filed a motion to intervene and dismiss.

New York Times v. Microsoft and OpenAI (S.D.N.Y. 2023)

Complaint filed December 27, 2023. The New York Times alleges that their news stories were used to train AI without a license or permission, in violation of their exclusive rights of reproduction and public display, as copyright owners. The complaint also alleges vicarious and contributory copyright infringement, DMCA violations, unfair competition, and trademark dilution. The Times seeks damages, an injunction against further infringing conduct, and a Section 503(b) order for the destruction of โ€œall GPT or other LLM models and training sets that incorporate Times Works.โ€ On February 23, 2024, plaintiffs in other cases filed a motion to intervene and dismiss this case. ย 

Basbanes and Ngagoyeanes v. Microsoft and OpenAI (S.D.N.Y. 2024)

Complaint filed January 5, 2024. Nonfiction authors assert copyright claims against Microsoft and OpenAI. On February 6, 2024, the court consolidated this case with Authors Guild (23-cv-08292) and Alter v. Open AI (23-cv-10211), for pretrial purposes.ย ย 

Trademark

Getty Images v. Stability AI (D. Del.)

Complaint filed against Stability AI by Getty Images on February 3, 2023. Getty Images alleges claims of copyright infringement, DMCA violation and trademark violations against Stability AI. The judge has dismissed without prejudice a motion to dismiss or transfer on jurisdictional grounds. The motion may be re-filed after the conclusion of jurisdictional discovery, which is ongoing.

Privacy and Publicity Rights

Flora v. Prisma Labs (N.D. Cal.)

Complaint filed February 15, 2023. Plaintiffs allege violations of the Illinois Biometric Privacy Act in connection with Prisma Labsโ€™ collection and retention of usersโ€™ selfies in AI training. The court has granted Prismaโ€™s motion to compel arbitration.

Kyland Young v. NeoCortext (C.D. Cal. 2023)

Complaint filed April 3, 2023. This complaint alleges that AI tool Reface used a personโ€™s image without consent, in violation of the personโ€™s publicity rights under California law. The court has denied a motion to dismiss, ruling that publicity rights claims are not preempted by federal copyright law. The case has been stayed pending appeal.

P.M. v. OpenAI (N.D. Cal. 2023).

Complaint filed June 28, 2023. Users claim OpenAI violated the federal Electronic Communications Privacy Act and California wiretapping laws by collecting their data when they input content into ChatGPT. They also claim violations of the Computer Fraud and Abuse Act. Plaintiffs voluntarily dismissed the case on September 15, 2023. See now A.T. v. OpenAI (N.D. Cal. 2023) (below).

A.T. v. OpenAI (N.D. Cal. 2023)

Complaint filed September 5, 2023. ChatGPT users claim the company violated the federal Electronic Communications Privacy Act, the Computer Fraud and Abuse Act, and California Penal Code section 631 (wiretapping). The gravamen of the complaint is that ChatGPT allegedly accessed usersโ€™ platform access and intercepted their private information without their knowledge or consent. Motions to dismiss and to compel arbitration are pending.

Defamation

Walters v. OpenAI (Gwinnett County Super. Ct. 2023), and Walters v. OpenAI (N.D. Ga. 2023)

Gwinnett County complaint filed June 5, 2023.

Federal district court complaint filed July 14, 2023.

Radio talk show host sued OpenAI for defamation. A reporter had used ChatGPT to get information about him. ChatGPT wrongly described him as a person who had been accused of fraud. In October, 2023, the federal court remanded the case to the Superior Court of Gwinnett County, Georgia. ย On January 11, 2024, the Gwinnett County Superior Court denied OpenAIโ€™s motion to dismiss.

Battle v. Microsoft (D. Md. 2023)

Complaint filed July 7, 2023. Pro se defamation complaint against Microsoft alleging that Bing falsely described him as a member of the โ€œPortland Seven,โ€ a group of Americans who tried to join the Taliban after 9/11.

 

Caveat

This list is not exhaustive. There may be other cases involving AI that are not included here. For a discussion of bias issues in Google’s Gemini, have a look at Scraping Bias on Medium.com.

The Top 3 Generative-AI Copyright Issues

What are your favorite generative-AI copyright issues? In this capsule summary, Cokato attorney Tom James shares what his three favorites are.

Black hole consuming a star. Photo credit: NASA
Black hole consuming a star. Photo credit: NASA.

Having outlined the top AI legal issues, the time has come now to pick favorites. In this capsule summary, Cokato attorney Tom James shares what his three favorite generative-AI copyright issues are.

Generative artificial intelligence refers collectively to technology that is capable of generating new text, images, audio/visual and possibly other content in response to a user’s prompts. They are trained by feeding them mass quantities of ABC (already-been-created) works. Some of America’s biggest mega-corporations have invested billions of dollars into this technology. They are now facing a barrage of lawsuits, most of them asserting claims of copyright infringement.

Issue #1: Does AI Output Infringe Copyrights?

Copyrights give their owners an exclusive right to reproduce their copyright-protected works and to create derivative works based on them. If a generative-AI user prompts the service to reproduce the text of a pre-existing work, and it proceeds to do so, this could implicate the exclusive right of reproduction. If a generative-AI user prompts it to create a work in the style of another work and/or author, this could implicate the exclusive right to create derivative works.

To establish infringement, it will be necessary to prove copying. Two identical but independently created works may each be protected by copyright. Put another way, a person is not guilty of infringement merely by creating a work that is identical or similar to another if he/she/it came up with it completely on his/her/its own.

Despite “training” their proteges on existing works, generative-AI outfits deny that their tools actually copy any of them. They say that any similarity to any existing works, living or dead, is purely coincidental. Thus, OpenAI has stated that copyright infringement “is an unlikely accidental outcome.”

The “accidental outcome” defense seems to me like a hard one to swallow in those cases where a prompt involves creating a story involving a specified fictional character that enjoys copyright protection. If the character is distinctive enough — and a piece of work in and of itself, so to speak — to enjoy copyright protection (such as, say, Mr. Spock from the Star Trek series), then any generated output would seem to be an unauthorized derivative work, at least if the AI tool is any good.

If AI output infringes a copyright in an existing work, who would be liable for it? Potentially, the person who entered the prompt might be held liable for direct infringement. The AI tool provider might arguably be liable for contributory infringement.

Issue #2: Does AI Training Infringe Copyrights?

AI systems are โ€œtrainedโ€ to create works by exposing a computer program system to large numbers of existing works downloaded from the Internet.

When content is downloaded from the Internet, a copy of it is made. This process will “involve the reproduction of entire works or substantial portions thereof.” OpenAI, for example, acknowledges that its programs are trained on โ€œlarge, publicly available datasets that include copyrighted worksโ€ and that this process โ€œinvolves first making copies of the data to be analyzed….” Making these copies without permission may infringe the copyright holders’ exclusive right to make reproductions of their works.

Generative-AI outfits tend to argue that the training process is fair use.

Fair use claims require consideration of four statutory factors:

  • the purpose and character of the use;
  • the nature of the work copied;
  • the amount and substantiality of the portion copied; and
  • the effect on the market for the work.

OpenAI relies on the precedent set in Authors Guild v. Google for its invocation of “fair use.” That case involved Google’s copying of the entire text of books to construct its popular searchable database.

A number of lawsuits currently pending in the courts are raising the question whether and how, the AI training process is “fair use.” For more information, have a look at Does AI Infringe Copyright?ย 

Issue #3: Are AI-Generated Works Protected by Copyright?

The Copyright Act affords copyright protection to โ€œoriginal works of authorship.โ€ The U.S. Copyright Office recognizes copyright only in works โ€œcreated by a human being.โ€ Courts, too, have declined to extend copyright protection to nonhuman authors. (Remember the monkey selfie case?) A recent copyright registration applicant has filed a lawsuit against the U.S. Copyright Office alleging that the Office wrongfully denied registration of an AI-generated work. A federal court has now rejected his argument that human authorship is not required for copyright ownership.

In March 2023, the Copyright Office released guidance stating that when AI โ€œdetermines the expressive elements of its output, the generated material is not the product of human authorship.โ€ Moreover, an argument might be made that a general prompt, such as “create a story about a dog in the style of Jack London,” is an idea, not expression. It is well settled that only expression gets copyright protection; ideas do not.

In September 2023, the Copyright Office Review Board affirmed the Officeโ€™s refusal to register a copyright in a work that was generated by Midjourney and then modified by the applicant, on the basis that the applicant did not disclaim the AI-generated material.

The Office also has the power to cancel improvidently granted registrations. (Words to the wise: Disclose and disclaim.)

These are my favorite generative-AI legal issues. What are yours?

Generative-AI: The Top 12 Lawsuits

Artificial intelligence (โ€œAIโ€) is generating more than content; it is generating lawsuits. Here is a brief chronology of what I believe are the most significant lawsuits that have been filed so far.

Artificial intelligence (โ€œAIโ€) is generating more than content; it is generating lawsuits. Here is a brief chronology of what I believe are the most significant lawsuits that have been filed so far.

Most of these allege copyright infringement, but some make additional or other kinds of claims, such as trademark, privacy or publicity right violations, defamation, unfair competition, and breach of contract, among others. So far, the suits primarily target the developers and purveyors of generative AI chatbots and similar technology. They focus more on what I call “input” than on “output” copyright infringement. That is to say, they allege that copyright infringement is involved in the way particular AI tools are trained.

Thomson Reuters Enterprise Centre GmbH et al. v. ROSS Intelligence (May, 2020)

Thomson Reuters Enterprise Centre GmbH et al. v. ROSS Intelligence Inc., No. 20-cv-613 (D. Del. 2020)

Thomson Reuters alleges that ROSS Intelligence copied its Westlaw database without permission and used it to train a competing AI-driven legal research platform. In defense, ROSS has asserted that it only copied ideas and facts from the Westlaw database of legal research materials. (Facts and ideas are not protected by copyright.) ROSS also argues that its use of content in the Westlaw database is fair use.

One difference between this case and subsequent generative-AI copyright infringement cases is that the defendant in this case is alleged to have induced a third party with a Westlaw license to obtain allegedly proprietary content for the defendant after the defendant had been denied a license of its own. Other cases involve generative AI technologies that operate by scraping publicly available content.

The parties filed cross-motions for summary judgment. While those motions were pending, the U.S. Supreme Court issued its decision in Andy Warhol Found. for the Visual Arts, Inc. v. Goldsmith, 598 U.S. ___, 143 S. Ct. 1258 (2023). The parties have now filed supplemental briefs asserting competing arguments about whether and how the Courtโ€™s treatment of transformative use in that case should be interpreted and applied in this case. A decision on the motions is expected soon.

The court has now issued a fair use decision in Thomson Reuters v. ROSS Intelligence.

Doe 1 et al. v. GitHub et al. (November, 2022)

Doe 1 et al. v. GitHub, Inc. et al., No. 22-cv-06823 (N.D. Calif. November 3, 2022)

This is a class action lawsuit against GitHub, Microsoft, and OpenAI that was filed in November, 2022. It involves GitHubโ€™s CoPilot, an AI-powered tool that suggests lines of programming code based on what a programmer has written. The complaint alleges that Copilot copies code from publicly available software repositories without complying with the terms of applicable open-source licenses. The complaint also alleges removal of copyright management information in violation of 17 U.S.C. ยง 1202, unfair competition, and other tort claims.

Andersen et al. v. Stability AI et al. (January 13, 2023)

Andersen et al. v. Stability AI et al., No. 23-cv-00201 (N.D. Calif. Jan. 13, 2023)

Artists Sarah Andersen, Kelly McKernan, and Karla Ortiz filed this class action lawsuit against generative-AI companies Stability AI, Midjourney, and DeviantArt on January 13, 2023. The lawsuit alleges that the defendants infringed their copyrights by using their artwork without permission to train AI-powered image generators to create allegedly infringing derivative works. ย The lawsuit also alleges violations of 17 U.S.C. ยง 1202 and publicity rights, breach of contract, and unfair competition.

Read my case update on Andresen v. Stability AI.ย 

Getty Images v. Stability AI (February 3, 2023)

Getty Images v. Stability AI, No. 23-cv-00135-UNA (D. Del. February 23, 2023)

Getty Images has filed two lawsuits against Stability AI, one in the United Kingdom and one in the United States, each alleging both input and output copyright infringement. Getty Images owns the rights to millions of images. It is in the business of licensing rights to use copies of the images to others. The lawsuit also accuses Stability AI of falsifying, removing or altering copyright management information, trademark infringement, trademark dilution, unfair competition, and deceptive trade practices.

Stability AI has moved to dismiss the complaint filed in the U.S. for lack of jurisdiction.

Flora et al. v. Prisma Labs (February 15, 2023)

Flora et al. v. Prisma Labs, Inc., No. 23-cv-00680 (N.D. Calif. February 15, 2023)

Jack Flora and others filed a class action lawsuit against Prisma Labs for invasion of privacy. The complaint alleges, among other things, that the defendantโ€™s Lensa app generates sexualized images from images of fully-clothed people, and that the company failed to notify users about the biometric data it collects and how it will be stored and/or destroyed, in violation of Illinoisโ€™s data privacy laws.

Young v. NeoCortext (April 3, 2023)

Young v. NeoCortext, Inc., 2023-cv-02496 (C.D. Calif. April 3, 2023)

This is a publicity rights case. NeoCortextโ€™s Reface app allows users to paste images of their own faces over those of celebrities in photographs and videos. Kyland Young, a former cast member of the Big Brother reality television show, has sued NeoCortext for allegedly violating his publicity rights. The complaint alleges that NeoCortext has โ€œcommercially exploit[ed] his and thousands of other actors, musicians, athletes, celebrities, and other well-known individualsโ€™ names, voices, photographs, or likenesses to sell paid subscriptions to its smartphone application,ย Reface,ย without their permission.โ€

NeoCortext has asserted a First Amendment defense, among others.

Walters v. Open AI (June 5, 2023)

Walters v. OpenAI, LLC, No. 2023-cv-03122 (N.D. Ga. July 14, 2023) (Complaint originally filed in Gwinnett County, Georgia Superior Court on June 5, 2023; subsequently removed to federal court)

This is a defamation action against OpenAI, the company responsible for ChatGPT. The lawsuit was brought by Mark Walters. He alleges that ChatGPT provided false and defamatory misinformation about him to journalist Fred Riehl in connection with a federal civil rights lawsuit against Washington Attorney General Bob Ferguson and members of his staff. ChatGPT allegedly stated that the lawsuit was one for fraud and embezzlement on the part of Mr. Walters. The complaint alleges that Mr. Walters was โ€œneither a plaintiff nor a defendant in the lawsuit,โ€ and โ€œevery statement of factโ€ pertaining to him in the summary of the federal lawsuit that ChatGPT prepared is false. A New York court recently addressed the questions of sanctions for attorneys who submit briefs containing citations to non-existent โ€œprecedentsโ€ that were entirely made up by ChatGPT. This is the first case to address tort liability for ChatGPTโ€™s notorious creation of โ€œhallucinatory facts.โ€

In July, 2023, Jeffery Battle filed a complaint against Microsoft in Maryland alleging that he, too, has been defamed as a result of AI-generated โ€œhallucinatory facts.โ€

P.M. et al. v. OpenAI et al. (June 28, 2023)

P.M. et al. v. OpenAI LP et al., No. 2023-cv-03199 (N.D. Calif. June 28, 2023)

This lawsuit has been brought by underage individuals against OpenAI and Microsoft. The complaint alleges the defendantsโ€™ generative-AI products ChatGPT, Dall-E and Vall-E collect private and personally identifiable information from children without their knowledge or informed consent. The complaint sets out claims for alleged violations of the Electronic Communications Privacy Act; the Computer Fraud and Abuse Act; Californiaโ€™s Invasion of Privacy Act and unfair competition law; Illinoisโ€™s Biometric Information Privacy Act, Consumer Fraud and Deceptive Business Practices Act, and Consumer Fraud and Deceptive Business Practices Act; New York General Business Law ยง 349 (deceptive trade practices); and negligence, invasion of privacy, conversion, unjust enrichment, and breach of duty to warn.

Tremblay v. OpenAI (June 28, 2023)

Tremblay v. OpenAI, Inc., No. 23-cv-03223 (N.D. Calif. June 28, 2023)

Another copyright infringement lawsuit against OpenAI relating to its ChatGPT tool. In this one, authors allege that ChatGPT is trained on the text of books they and other proposed class members authored, and facilitates output copyright infringement. The complaint sets forth claims of copyright infringement, DMCA violations, and unfair competition.

Silverman et al. v. OpenAI (July 7, 2023)

Silverman et al. v. OpenAI, No. 23-cv-03416 (N.D. Calif. July 7, 2023)

Sarah Silverman (comedian/actress/writer) and others allege that OpenAI, by using copyright-protected works without permission to train ChatGPT, committed direct and vicarious copyright infringement, violated section 17 U.S.C. 1202(b), and their rights under unfair competition, negligence, and unjust enrichment law.

The judge has issued a ruling on fair use.

Kadrey et al. v. Meta Platforms (July 7, 2023)

Kadrey et al. v. Meta Platforms, No. 2023-cv-03417 (N.D. Calif. July 7, 2023)

The same kinds of allegations as are made in Silverman v. OpenAI, but this time against Meta Platforms, Inc.

There has been a ruling in Kadrey v. Meta Platforms.ย 

J.L. et al. v. Alphabet (July 11, 2023)

J.L. et al. v. Alphabet, Inc. et al. (N.D. Calif. July 11, 2023)

This is a lawsuit against Google and its owner Alphabet, Inc. for allegedly scraping and harvesting private and personal user information, copyright-protected works, and emails, without notice or consent. The complaint alleges claims for invasion of privacy, unfair competition, negligence, copyright infringement, and other causes of action.

On the Regulatory Front

The U.S. Copyright Office is examining the problems associated with registering copyrights in works that rely, in whole or in part, on artificial intelligence. The U.S. Federal Trade Commission (FTC) has suggested that generative-AI implicates โ€œcompetition concerns.โ€. Lawmakers in the United States and the European Union are considering legislation to regulate AI in various ways.

Copyright owners prevail in Internet Archive lawsuit

A federal district court has ruled in favor of book publishers in their copyright infringement lawsuit against Internet Archives

In June, 2020 four book publishers filed a copyright infringement lawsuit against Internet Archive. The publishers asserted that the practice of scanning books and lending digital copies of them to online users infringed their copyrights in the books. On Friday, March 24, 2023, a federal district court judge agreed, granting the publishers’ motion for summary judgment.

The Internet Archive operation

Internet Archive is a nonprofit organization that has undertaken several archiving projects. For example, it created the “Wayback Machine,” an online archive of public webpages. This lawsuit involves another of its projects, namely, the creation of a digital archive of books. Some of these are in the public domain. Also included in this archive, however, are over 3 million books that are protected by copyright. The judge determined that 33,000 of them belong to the plaintiffs in the lawsuit.

According to the Order granting summary judgment, after scanning the books, Internet Archive made them publicly available online for free, without the permission of the copyright owners.

“Fair Use”

According to the Order, Internet Archive did not dispute that it violated copyright owners’ exclusive rights to reproduce the works, to make derivative works based on them, to distribute their works, to publicly perform them (Internet Archive offered a “read aloud” function on it website), and to display them (in this case, on a user’s browser.) In short, the Order determined that the operation violated all five of the exclusive rights of copyright owners protected by the United States Copyright Act (17 U.S.C. sec. 106).

Internet Archive asserted a “fair use” defense.

In previous cases involving massive operations to scan and digitize millions of books, Authors Guild v. Google., Inc. and Authors Guild v. HathiTrust, judicial analyses resulted in “fair use” determinations unfavorable to copyright owners. Internet Archive, of course, invited the judge to do the same thing here. The judge declined the invitation.

The judge distinguished this case from its predecessors by ruling that unlike the uses made of copyrighted works in those cases, the use in this case was not transformative. For example, Google had digitized the entire text of books in order to create a searchable index of books. “There is nothing transformative,” however, about copying and distributing the entire texts of books to the public, the judge declared.

The judge observed that Google reproduces and displays to the public only enough context surrounding the searched term to help a reader evaluate whether the book falls within the range of the reader’s interest. The Court of Appeals in Google had warned that “[i]f Plaintiff’s claim were based on Google’s converting their books into a digitized form and making that digitized version accessible to the public,” then the “claim [of copyright infringement] would be strong.”

The judge also determined that the alleged benefit to the public of having access to the entire text of books without having to pay for them “cannot outweigh the market harm to the publishers.”

Ultimately, the judge concluded that all four “fair use” factors (character and purpose of the use, nature of the work, amount and substantiality of the portion copied, and the effect on the market for the work) weighed against a finding of fair use.

What’s next?

Internet Archive apparently intends to appeal the decision. In the meantime, it appears that it will continue other kinds of digitized book services, such as interlibrary loans, citation linking, access for the print-disabled , text and data mining, purchasing e-books, and receiving and preserving books.

 

Update: A copyright win in the Internet Archive lawsuit.

 

 

 

 

Why Machine Training AI with Protected Works is Not Fair Use

… if the underlying goal of copyrightโ€™s exclusive rightsย andย the fair use exception is to promote new โ€œauthorship,โ€ this is doctrinally fatal to the proposal that training AIs on volumes of protected works favors a finding of fair use.

Guest blogger David Newhoff lays out the argument against the claim that training AI systems with copyright-protected works is fair use. David is the author ofย Who Invented Oscar Wilde? The Photograph at the Center of Modern American Copyrightย (Potomac Books 2020) and is a copyright advocate/writer atย The Illusion of More.


As most copyright watchers already know, two lawsuits were filed at the start of the new year against AI visual works companies. In the U.S., a class-action was filed by visual artists against DeviantArt, Midjourney, and Stability AI; and in the UK, Getty Images is suing Stability AI. Both cases allege infringing use of large volumes of protected works fed into the systems to โ€œtrainโ€ the algorithms. Regardless of how these two lawsuits might unfold, I want to address the broad defense, already being argued in the blogosphere, that training generative AIs with volumes of protected works is fair use. I donโ€™t think so.

Copyright advocates, skeptics, and even outright antagonists generally agree that the fair use exception, correctly applied, supports the broad aim of copyright law to promote more creative work. In the language of the Constitution, copyright โ€œpromotes the progress of science,โ€ but a more accurate, modern description would be that copyright promotes new โ€œauthorshipโ€ because we do not tend to describe literature, visual arts, music, etc. as โ€œscience.โ€

The fair use doctrine, codified in the federal statute in 1976, originated as judge-made law, and from the seminalย Folsom v. Marshย to the contemporaryย Andy Warhol Foundation v. Goldsmith, the courts have restated, in one way or another, their responsibility to balance the first authorโ€™s exclusive rights with a follow-on authorโ€™s interest in creating new expression. And as a matter of general principle, it is held that the public benefits from this balancing act because the result is a more diverse market of creative and cultural works.

Fair use defenses are case-by-case considerations and while there may be specific instances in which an AI purpose may be fair use, there are no blanket exceptions. More broadly, though, if the underlying goal of copyrightโ€™s exclusive rightsย andย the fair use exception is to promote new โ€œauthorship,โ€ this is doctrinally fatal to the proposal that training AIs on volumes of protected works favors a finding of fair use. Even if a court holds that other limiting doctrines render this activity by certain defendants to be non-infringing, a fair use defense should be rejected at summary judgmentโ€”at least for the current state of the technology, in which the schematic encompassing AI machine, AI developer, and AI user does nothing to promote new โ€œauthorshipโ€ as a matter of law.

The definition of โ€œauthorโ€ in U.S. copyright law means โ€œhuman author,โ€ and there are no exceptions to this anywhere in our history. The mere existence of a work we might describe as โ€œcreativeโ€ is not evidence of an author/owner of that work unless there is a valid nexus between a humanโ€™s vision and the resulting work fixed in a tangible medium. If you find an anonymous work of art on the street, absent further research, it has no legal author who can assert a claim of copyright in the work that would hold up in any court. And this hypothetical emphasizes the point that the legal meaning of โ€œauthorโ€ is more rigorous than the philosophical view that art without humans is oxymoronic. (Although it is plausible to find authorship in a work that combines human creativity with AI, I address that subject below.)

As a matter of law, the AI machine itself is disqualified as an โ€œauthorโ€ full stop. And although the AI owner/developer and AI user/customer are presumably both human, neither is defensibly an โ€œauthorโ€ of the expressions output by the AI. At least with the current state of technologies making headlines, nowhere in the processโ€”from training the AI, to developing the algorithm, to entering prompts into the systemโ€”is there an essential link between those contributions and the individual expressions output by the machine. Consequently, nothing about the process of ingesting protected works to develop these systems in the first place can plausibly claim to serve the purpose of promoting new โ€œauthorship.โ€

But What About theย Google Booksย Case?

Indeed. In the fair use defenses AI developers will present, we should expect to see them lean substantially on the holding inย Authors Guild v.ย Google Booksโ€”a decision which arguably exceeds the purpose of fair use to promote new authorship. The Second Circuit, while acknowledging that it was pushing the boundaries of fair use, found the Google Books tool to be โ€œtransformativeโ€ for its novel utility in presenting snippets of books; and because that utility necessitates scanning whole books into its database, a defendant AI developer will presumably want to make the comparison. But a fair use defense applied to training AIs with volumes of protected works should fail, even under the highly utilitarian holding inย Google Books.

While people of good intent can debate the legal merits of that decision, the utility of the Google Books search engine does broadly serve the interest of new authorship with a useful research toolโ€”one I have used many times myself. Google Books provides a new means by which one author may research the works of another author, and this is immediately distinguishable from the generative AI which may be trained to โ€œwrite booksโ€ without authors. Thus, not only does the generative AI fail to promote authorship of the individual works output by the system, but it fails to promote authorship in general.

Although the technology is primitive for the moment, these AIs are expected to โ€œlearnโ€ exponentially and grow in complexity such that AIs will presumably compete with or replace at least some human creators in various fields and disciplines. Thus, an enterprise which proposes to diminish the number of working authors, whether intentionally or unintentionally, should only be viewed as devastating to the purpose of copyright law, including the fair use exception.

AI proponents may argue that โ€œdemocratizingโ€ creativity (i.e., putting these tools in every hand) promotes authorship by makingย everyoneย an author. But aside from the cultural vacuum thisย illusion of moreย would create, the user prompting the AI has a high burden to prove authorship, and it would really depend on what he is contributing relative to the AI. As mentioned above, some AIs may evolve as tools such that the human in some way โ€œcollaboratesโ€ with the machine to produce a work of authorship. But this hypothetical points to the reason why fair use is a fact-specific, case-by-case consideration. AI Alpha, which autonomously creates, or creates mostly without human direction, should not benefit from the potential fair use defense of AI Beta, which produces a tool designed to aid, but not replace, human creativity.

Broadly Transformative? Donโ€™t Even Go There

Returning to the constitutional purpose of copyright law to โ€œpromote science,โ€ the argument has already been floated as a talking point that training AI systems with protected works promotes computer science in general and is, therefore, โ€œtransformativeโ€ under fair use factor one for this reason. But this argument should find no purchase in court. To the extent that one of these neural networks might eventually spawn revolutionary utility in medicine or finance etc., it would be unsuitable to ask a court to hold that such voyages of general discovery fit the purpose of copyright, to say nothing of the likelihood that the adventure strays inevitably into patent law. Even the most elastic fair use findings to date reject such a broad defense.

It may be shown that no work(s) output by a particular AI infringes (copies) any of the works that went into its training. It may also be determined that the corpus of works fed into an AI is so rapidly atomized into data that even fleeting โ€œreproductionโ€ is found not to exist, and, thus, the 106(1) right is not infringed. Those questions are going to be raised in court before long, and we shall see where they lead. But to presume fair use as a broad defense for AI โ€œtrainingโ€ is existentially offensive to the purpose of copyright, and perhaps to law in general, because it asks the courts to vest rights in non-humans, which is itself anathema to caselaw in other areas.[1]

It is my oft-stated opinion that creative expression without humans is meaningless as a cultural enterprise, but it is a matter of law to say that copyright is meaningless without โ€œauthorsโ€ and that there is no such thing as non-human โ€œauthors.โ€ For this reason, the argument that training AIs on protected works is inherently fair use should be denied with prejudice.

*****ย 
n.b.: The Copyright Office has issued New AI Copyright Guidance.ย 


[1]ย Cetaceans v. Bushย holding that animals do not have standing in court was the basis for rejecting PETAโ€™S complaint against photographer Slater for infringing the copyright rights of the monkey in the โ€œMonkey Selfieโ€ fiasco.