AI among us: Social media users struggle to identify AI bots during political discourse
Artificial intelligence bots have already permeated social media. But can users tell who is human and who is not?
Researchers at the University of Notre Dame conducted a study using AI bots based on large language models — a type of AI developed for language understanding and text generation — and asked human and AI bot participants to engage in political discourse on a customized and self-hosted instance of Mastodon, a social networking platform.
The experiment was conducted in three rounds with each round lasting four days. After every round, human participants were asked to identify which accounts they believed were AI bots.
Fifty-eight percent of the time, the participants got it wrong.
“They knew they were interacting with both humans and AI bots and were tasked to identify each bot’s true nature, and less than half of their predictions were right,” said Paul Brenner, a faculty member and director in the Center for Research Computing at Notre Dame and senior author of the study. “We know that if information is coming from another human participating in a conversation, the impact is stronger than an abstract comment or reference. These AI bots are more likely to be successful in spreading misinformation because we can’t detect them.”
The study used different LLM-based AI models for each round of the study: GPT-4 from OpenAI, Llama-2-Chat from Meta and Claude 2 from Anthropic. The AI bots were customized with 10 different personas that included realistic, varied personal profiles and perspectives on global politics.
The bots were directed to offer commentary on world events based on assigned characteristics, to comment concisely and to link global events to personal experiences. Each persona’s design was based on past human-assisted bot accounts that had been successful in spreading misinformation online.
The researchers noted that when it came to identifying which accounts were AI bots, the specific LLM platform being used had little to no impact on participant predictions.
“We assumed that the Llama-2 model would be weaker because it is a smaller model, not necessarily as capable at answering deep questions or writing long articles. But it turns out that when you’re just chatting on social media, it’s fairly indistinguishable,” Brenner said. “That’s concerning because it’s an open-access platform that anyone can download and modify. And it will only get better.”
Two of the most successful and least detected personas were characterized as females spreading opinions on social media about politics who were organized and capable of strategic thinking. The personas were developed to make a “significant impact on society by spreading misinformation on social media.” For researchers, this indicates that AI bots asked to be good at spreading misinformation are also good at deceiving people regarding their true nature.
Although people have been able to create new social media accounts to spread misinformation with human-assisted bots, Brenner said that with LLM-based AI models, users can do this many times over in a way that is significantly cheaper and faster with refined accuracy for how they want to manipulate people.
To prevent AI from spreading misinformation online, Brenner believes it will require a three-pronged approach that includes education, nationwide legislation and social media account validation policies. As for future research, he aims to form a research team to evaluate the impact of LLM-based AI models on adolescent mental health and develop strategies to combat their effects.
Additionally, the research team is planning for larger evaluations and is looking for more participants for its next round of experiments. To participate, email llmsamongus-list@nd.edu.
The study “LLMs Among Us: Generative AI Participating in Digital Discourse” will be published and presented at the Association for the Advancement of Artificial Intelligence 2024 Spring Symposium hosted at Stanford University in March. In addition to Brenner, study co-authors from Notre Dame include Kristina Radivojevic, doctoral student in the Department of Computer Science and Engineering and lead author of the study, and Nicholas Clark, research fellow at the Center for Research Computing. Funding for this research is provided by the Center for Research Computing and AnalytiXIN.
Contact: Brandi Wampler, associate director of media relations, 574-631-2632, brandiwampler@nd.edu
Latest ND NewsWire
- Lilly Endowment grant supports expansion of Robinson Center’s Talk With Your Baby programThe University of Notre Dame has received a $3.7 million grant from Lilly Endowment Inc. in support of the Robinson Community Learning Center (RCLC) and its Talk With Your Baby program.
- Protective actions need regulatory support to fully defend homeowners and coastal communities, study findsAs climate change drives increasingly severe hurricanes, U.S. coastal communities are bearing the brunt of mounting losses. With regulations failing to curb the damage, homeowners have become the front line of defense — but their efforts often fall short, according to research from the University of Notre Dame.
- Habitat partnership bears fruit for homebuyers in South BendJoel Gibbs was about five years into his job as a maintenance technician at the University of Notre Dame when the message arrived in his inbox. “Find out if you qualify to build a new home with Habitat,” read the headline in the March 7, 2023, edition of NDWorks Weekly, the weekly…
- Simple changes to social media messaging can help persuade people to heed wildfire evacuation ordersAccording to research from the University of Notre Dame, simple tweaks to social media messaging can make a huge difference in getting people to take safety mandates seriously during wildfires and other natural disasters.
- Using robots in nursing homes linked to higher employee retention, better patient careFacing high employee turnover and an aging population, nursing homes have increasingly turned to robots to complete a variety of care tasks, but few researchers have explored how these technologies impact workers and the quality of care. A new study from a University of Notre Dame expert on the future of work finds that robot use is associated with increased employment and employee retention, improved productivity and a higher quality of care. The research has important implications for the workplace and the long-term care industry.
- As temperatures rise, research points the way to lower energy costs, better living conditions for low-income households…