Related papers: Between Copyright and Computer Science: The Law and Ethics of Generative AI

Between Copyright and Computer Science: The Law and Ethics of Generative AI

URL: http://arxiv.org/abs/2403.14653v2
Date: Thu, 5 Sep 2024 19:24:42 GMT
Title: Between Copyright and Computer Science: The Law and Ethics of Generative AI
Authors: Deven R. Desai, Mark Riedl,
Abstract summary: Copyright and computer science continue to intersect and clash, but they can coexist. This Article shows that, contrary to some scholars' views, fair use law does not bless all ways that someone can gain access to copyrighted material. The copyright industry claims, however, that almost all uses of copyrighted material must be compensated, even for non-expressive uses.
Score: 1.534667887016089
License: http://creativecommons.org/licenses/by/4.0/
Abstract: Copyright and computer science continue to intersect and clash, but they can coexist. The advent of new technologies such as digitization of visual and aural creations, sharing technologies, search engines, social media offerings, and more challenge copyright-based industries and reopen questions about the reach of copyright law. Breakthroughs in artificial intelligence research, especially Large Language Models that leverage copyrighted material as part of training models, are the latest examples of the ongoing tension between copyright and computer science. The exuberance, rush-to-market, and edge problem cases created by a few misguided companies now raises challenges to core legal doctrines and may shift Open Internet practices for the worse. That result does not have to be, and should not be, the outcome. This Article shows that, contrary to some scholars' views, fair use law does not bless all ways that someone can gain access to copyrighted material even when the purpose is fair use. Nonetheless, the scientific need for more data to advance AI research means access to large book corpora and the Open Internet is vital for the future of that research. The copyright industry claims, however, that almost all uses of copyrighted material must be compensated, even for non-expressive uses. The Article's solution accepts that both sides need to change. It is one that forces the computer science world to discipline its behaviors and, in some cases, pay for copyrighted material. It also requires the copyright industry to abandon its belief that all uses must be compensated or restricted to uses sanctioned by the copyright industry. As part of this re-balancing, the Article addresses a problem that has grown out of this clash and under theorized.

Related papers

Bridging the Copyright Gap: Do Large Vision-Language Models Recognize and Respect Copyrighted Content? [47.50752173848172]
Large vision-language models (LVLMs) have achieved remarkable advancements in multimodal reasoning tasks.<n>Will LVLMs accurately recognize and comply with copyright regulations when encountering copyrighted content in the context?
arXiv Detail & Related papers (2025-12-26T05:09:55Z)
Red Teaming for Generative AI, Report on a Copyright-Focused Exercise Completed in an Academic Medical Center [49.85176045690678]
Generative artificial intelligence (AI) deployment in academic medical settings raises copyright compliance concerns.<n>Dana-Farber Cancer Institute implemented GPT4DFCI, an internal generative AI tool utilizing OpenAI models.<n>Four teams attempted to extract copyrighted content from GPT4DFCI across four domains.
arXiv Detail & Related papers (2025-06-26T23:11:49Z)
The Author Is Sovereign: A Manifesto for Ethical Copyright in the Age of AI [0.0]
In the age of AI, authorship is being quietly eroded by algorithmic content scraping, legal gray zones like "fair use," and platforms that profit from creative labor without consent or compensation.<n>This short manifesto proposes a radical alternative: a system in which the author is sovereign of their intellectual domain.
arXiv Detail & Related papers (2025-04-03T03:12:42Z)
Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data? [62.72729485995075]
We investigate the effectiveness of watermarking as a deterrent against the generation of copyrighted texts. We find that watermarking adversely affects the success rate of Membership Inference Attacks (MIAs) We propose an adaptive technique to improve the success rate of a recent MIA under watermarking.
arXiv Detail & Related papers (2024-07-24T16:53:09Z)
AIGC-Chain: A Blockchain-Enabled Full Lifecycle Recording System for AIGC Product Copyright Management [30.690595004607385]
The current legal framework for copyright and intellectual property is grounded in the concept of human authorship. In the creation of AIGC, human creators provide conceptual ideas, with AI independently responsible for the expressive elements. It is imperative to reassess the intellectual contributions of all parties involved in the creation of AIGC to ensure a fair allocation of copyright ownership.
arXiv Detail & Related papers (2024-06-21T08:22:39Z)
Fantastic Copyrighted Beasts and How (Not) to Generate Them [83.77348858322523]
Copyrighted characters pose a difficult challenge for image generation services. At least one lawsuit has been awarded damages based on the generation of these characters.
arXiv Detail & Related papers (2024-06-20T17:38:16Z)
©Plug-in Authorization for Human Content Copyright Protection in Text-to-Image Model [71.47762442337948]
State-of-the-art models create high-quality content without crediting original creators. We propose the copyright Plug-in Authorization framework, introducing three operations: addition, extraction, and combination. Extraction allows creators to reclaim copyright from infringing models, and combination enables users to merge different copyright plug-ins.
arXiv Detail & Related papers (2024-04-18T07:48:00Z)
Uncertain Boundaries: Multidisciplinary Approaches to Copyright Issues in Generative AI [2.669847575321326]
The survey aims to stay abreast of the latest developments and open problems. It will first outline methods of detecting copyright infringement in mediums such as text, image, and video. Next, it will delve an exploration of existing techniques aimed at safeguarding copyrighted works from generative models.
arXiv Detail & Related papers (2024-03-31T22:10:01Z)
Generative AI and Copyright: A Dynamic Perspective [0.0]
generative AI is poised to disrupt the creative industry. The compensation to creators whose content has been used to train generative AI models (the fair use standard) and the eligibility of AI-generated content for copyright protection (AI-copyrightability) are key issues. This paper aims to better understand the economic implications of these two regulatory issues and their interactions.
arXiv Detail & Related papers (2024-02-27T07:12:48Z)
Copyright Protection in Generative AI: A Technical Perspective [58.84343394349887]
Generative AI has witnessed rapid advancement in recent years, expanding their capabilities to create synthesized content such as text, images, audio, and code. The high fidelity and authenticity of contents generated by these Deep Generative Models (DGMs) have sparked significant copyright concerns. This work delves into this issue by providing a comprehensive overview of copyright protection from a technical perspective.
arXiv Detail & Related papers (2024-02-04T04:00:33Z)
A Dataset and Benchmark for Copyright Infringement Unlearning from Text-to-Image Diffusion Models [52.49582606341111]
Copyright law confers creators the exclusive rights to reproduce, distribute, and monetize their creative works. Recent progress in text-to-image generation has introduced formidable challenges to copyright enforcement. We introduce a novel pipeline that harmonizes CLIP, ChatGPT, and diffusion models to curate a dataset.
arXiv Detail & Related papers (2024-01-04T11:14:01Z)
Can Copyright be Reduced to Privacy? [23.639303165101385]
We argue that while algorithmic stability may be perceived as a practical tool to detect copying, such copying does not necessarily constitute copyright infringement. If adopted as a standard for detecting an establishing copyright infringement, algorithmic stability may undermine the intended objectives of copyright law.
arXiv Detail & Related papers (2023-05-24T07:22:41Z)
Training Is Everything: Artificial Intelligence, Copyright, and Fair Training [9.653656920225858]
Authors: Companies that use such content to train their AI engine often believe such usage should be considered "fair use" Authors: Copyright owners, as well as their supporters, consider the incorporation of copyrighted works into training sets for AI to constitute misappropriation of owners' intellectual property. We identify both strong and spurious arguments on both sides of this debate.
arXiv Detail & Related papers (2023-05-04T04:01:00Z)
Foundation Models and Fair Use [96.04664748698103]
In the U.S. and other countries, copyrighted content may be used to build foundation models without incurring liability due to the fair use doctrine. In this work, we survey the potential risks of developing and deploying foundation models based on copyrighted content. We discuss technical mitigations that can help foundation models stay in line with fair use.
arXiv Detail & Related papers (2023-03-28T03:58:40Z)

This list is automatically generated from the titles and abstracts of the papers in this site.