LibGuides: Copyright Crash Course: Artificial Intelligence

Artificial Intelligence (AI)

As generative artificial intelligence (generative AI) becomes increasingly widespread, it raises many ethical, practical, and legal issues, including copyright issues. UT Austin has a helpful summary and link to AI resources.

Generative AI is used to create material through a growing number of platforms such as ChatGPT and Bard for textual output and Dall-E and Midjourney for images. These tools are trained on massive collections of materials – both public domain and copyrighted. As of fall 2023, there are several lawsuits in process related to the use of copyrighted works in training AI. The plaintiffs in the lawsuits claim the use of copyrighted works without permission is an infringement of copyright. In opposition, some legal scholars have pointed out that non-consumptive uses of copyrighted content (Google Books, HathiTrust) have been considered fair use in previous court cases. While these cases are in process, we won’t have definitive answers about whether the use of copyrighted works in AI training data is legal.

In contrast, the US Copyright Office and courts have stated clearly that because human authorship is required for copyright protection of a work, generative AI generally does not create copyrightable output.

One example of this principle is the graphic novel, Zarya of the Dawn. The Copyright Office refused to allow copyright protection of illustrations for the book that were created by the AI tool Midjourney, even though the author had provided prompts and edited the images. The Copyright Office did, however, allow copyright protection for the author-created text of the story, and for the compilation of text and images of illustrations as an integrated work.

While the Zarya determination provides some guidance on what outputs of generative AI are and are not protectable in copyright, the law around this topic is likely to evolve quickly. The Copyright Office has a page about copyright and AI, which includes an explanatory resource about copyright registration for works containing content generated by AI. The Congressional Research Service also has a document providing information about copyright and generative AI.

Aside from the question of copyrightability, it is also important to understand that output created by generative AI tools is not guaranteed to be accurate or free from legal liability. It is well documented that generative AI sometimes creates “hallucinations,” which are outputs that purport to be true but are demonstrably false. Additionally, textual or visual outputs created by generative AI might subject a user to liability such as copyright infringement, defamation, or other liability that any creator might encounter. Thus, the general guidance in the Copyright Crash Course about avoiding infringement applies to works of authorship created with the assistance of generative AI.

This section of the Copyright Crash Course was mainly created by Marc Vockell, Office of General Counsel for UT System.

Ongoing AI Litigation

AI Litigation Database
George Washington University has created a searchable database of existing litigation related to artificial intelligence and machine learning.
AI Copyright Case Tracker
Wired is also tracking copyright cases related to AI, and has a nice visualization of plaintiffs and defendants.