Related papers: Towards Understanding the Interplay of Generative Artificial Intelligence and the Internet

Towards Understanding the Interplay of Generative Artificial Intelligence and the Internet

URL: http://arxiv.org/abs/2306.06130v1
Date: Thu, 8 Jun 2023 11:14:51 GMT
Title: Towards Understanding the Interplay of Generative Artificial Intelligence and the Internet
Authors: Gonzalo Mart\'inez, Lauren Watson, Pedro Reviriego, Jos\'e Alberto Hern\'andez, Marc Juarez, Rik Sarkar
Abstract summary: generative AI tools can generate realistic images or text, such as DALL-E, MidJourney, or ChatGPT. These tools are possible due to the massive amount of data (text and images) that is publicly available through the Internet. Future versions of generative AI tools will be trained with a mix of human-created and AI-generated content. This interaction raises many questions: how will future versions of generative AI tools behave when trained on a mixture of real and AI generated data?
Score: 6.62688326060372
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: The rapid adoption of generative Artificial Intelligence (AI) tools that can generate realistic images or text, such as DALL-E, MidJourney, or ChatGPT, have put the societal impacts of these technologies at the center of public debate. These tools are possible due to the massive amount of data (text and images) that is publicly available through the Internet. At the same time, these generative AI tools become content creators that are already contributing to the data that is available to train future models. Therefore, future versions of generative AI tools will be trained with a mix of human-created and AI-generated content, causing a potential feedback loop between generative AI and public data repositories. This interaction raises many questions: how will future versions of generative AI tools behave when trained on a mixture of real and AI generated data? Will they evolve and improve with the new data sets or on the contrary will they degrade? Will evolution introduce biases or reduce diversity in subsequent generations of generative AI tools? What are the societal implications of the possible degradation of these models? Can we mitigate the effects of this feedback loop? In this document, we explore the effect of this interaction and report some initial results using simple diffusion models trained with various image datasets. Our results show that the quality and diversity of the generated images can degrade over time suggesting that incorporating AI-created data can have undesired effects on future versions of generative models.

Related papers

What happens when generative AI models train recursively on each others' generated outputs? [10.634199262199859]
We show that data-mediated interactions can benefit models by exposing them to novel concepts perhaps missed in original training data, but also can homogenize their performance on shared tasks.<n>We find that data-mediated interactions can benefit models by exposing them to novel concepts perhaps missed in original training data, but also can homogenize their performance on shared tasks.
arXiv Detail & Related papers (2025-05-27T18:52:34Z)
Reflection on Data Storytelling Tools in the Generative AI Era from the Human-AI Collaboration Perspective [39.96202614397779]
Large-scale generative AI techniques have the potential to enhance data storytelling with their power in visual and narration generation. We compare the collaboration patterns of the latest tools with those of earlier ones using a dedicated framework for understanding human-AI collaboration in data storytelling. The benefits of these AI techniques and other implications to human-AI collaboration are also revealed.
arXiv Detail & Related papers (2025-03-04T13:56:18Z)
"I Am the One and Only, Your Cyber BFF": Understanding the Impact of GenAI Requires Understanding the Impact of Anthropomorphic AI [55.99010491370177]
We argue that we cannot thoroughly map the social impacts of generative AI without mapping the social impacts of anthropomorphic AI. anthropomorphic AI systems are increasingly prone to generating outputs that are perceived to be human-like.
arXiv Detail & Related papers (2024-10-11T04:57:41Z)
Measuring Human Contribution in AI-Assisted Content Generation [68.03658922067487]
This study raises the research question of measuring human contribution in AI-assisted content generation. By calculating mutual information between human input and AI-assisted output relative to self-information of AI-assisted output, we quantify the proportional information contribution of humans in content generation.
arXiv Detail & Related papers (2024-08-27T05:56:04Z)
Synthetic data: How could it be used for infectious disease research? [0.16752458252726457]
Concerns have been raised about potential negative factors associated with the possibilities of artificial dataset generation. These include the potential misuse of generative artificial intelligence in fields such as cybercrime. Synthetic data offers significant benefits, particularly in data privacy, research, in balancing datasets and reducing bias in machine learning models.
arXiv Detail & Related papers (2024-07-03T17:13:04Z)
AI-Generated Images as Data Source: The Dawn of Synthetic Era [61.879821573066216]
generative AI has unlocked the potential to create synthetic images that closely resemble real-world photographs. This paper explores the innovative concept of harnessing these AI-generated images as new data sources. In contrast to real data, AI-generated data exhibit remarkable advantages, including unmatched abundance and scalability.
arXiv Detail & Related papers (2023-10-03T06:55:19Z)
AI for the Generation and Testing of Ideas Towards an AI Supported Knowledge Development Environment [2.0305676256390934]
We discuss how generative AI can boost idea generation by eliminating human bias. We also describe how search can verify facts, logic, and context. This paper introduces a system for knowledge workers, Generate And Search Test, enabling individuals to efficiently create solutions.
arXiv Detail & Related papers (2023-07-17T22:17:40Z)
DeepfakeArt Challenge: A Benchmark Dataset for Generative AI Art Forgery and Data Poisoning Detection [57.51313366337142]
There has been growing concern over the use of generative AI for malicious purposes. In the realm of visual content synthesis using generative AI, key areas of significant concern has been image forgery and data poisoning. We introduce the DeepfakeArt Challenge, a large-scale challenge benchmark dataset designed specifically to aid in the building of machine learning algorithms for generative AI art forgery and data poisoning detection.
arXiv Detail & Related papers (2023-06-02T05:11:27Z)
Constructing Dreams using Generative AI [23.344751807278044]
Generative AI tools introduce new and accessible forms of media creation for youth. They raise ethical concerns about the generation of fake media, data protection, privacy and ownership of AI-generated art. We facilitated students' generative AI learning through expression of their imagined future identities.
arXiv Detail & Related papers (2023-05-19T21:56:12Z)
Seeing is not always believing: Benchmarking Human and Model Perception of AI-Generated Images [66.20578637253831]
There is a growing concern that the advancement of artificial intelligence (AI) technology may produce fake photos. This study aims to comprehensively evaluate agents for distinguishing state-of-the-art AI-generated visual content.
arXiv Detail & Related papers (2023-04-25T17:51:59Z)
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT [63.58711128819828]
ChatGPT and other Generative AI (GAI) techniques belong to the category of Artificial Intelligence Generated Content (AIGC) The goal of AIGC is to make the content creation process more efficient and accessible, allowing for the production of high-quality content at a faster pace.
arXiv Detail & Related papers (2023-03-07T20:36:13Z)
Combining Generative Artificial Intelligence (AI) and the Internet: Heading towards Evolution or Degradation? [6.62688326060372]
generative AI tools that can generate realistic images or text have taken the Internet by storm. Future versions of generative AI tools will be trained with Internet data that is a mix of original and AI-generated data. This raises a few intriguing questions: how will future versions of generative AI tools behave when trained on a mixture of real and AI generated data?
arXiv Detail & Related papers (2023-02-17T17:39:41Z)

This list is automatically generated from the titles and abstracts of the papers in this site.