The preprint linked below was recently shared on arXiv.
Title
Synthetic Sources?: Auditing Generative Search Engine Citations for Evidence of AI-Generated Sources
Authors
Mowafak Allaham
Northwestern University
Nicholas Diakopoulos
Northwestern University
Source
via arXiv
DOI: 10.48550/arXiv.2605.23684
Abstract
The growing accessibility of Large Language Models via conversational interfaces capable of responding to users’ questions by drawing on, synthesizing, and citing information from the web (i.e., Generative Search Engines) has simplified the information-seeking process for users. However, with the proliferation of AI-generated content on the web, it is unclear whether these engines can reliably omit citing synthetic sources (i.e., AI-generated sources). Should these engines be unable to do so, this puts users at risk of harm by treating information from AI-generated sources synthesized in responses of generative search engines as equivalent to information from authoritative or official sources. In a step towards identifying whether AI-generated sources are being cited by these engines, this work presents an audit of four generative search engines (ChatGPT, Copilot, Gemini, Perplexity) using a total of 712 real-world human-generated queries spanning domains of public importance: politics, health, and the environment. Our findings show evidence of AI-generated sources being cited across all four generative search engines (~16% of cited sources) and identifies key source web domains these sources belong to that are frequently cited across these engines and topics. In addition, we observed that generative search engines include a somewhat narrow set of repeatedly cited domains while predominantly surfacing a large number of minimally cited domains in responses to users’ queries. These findings contribute to the growing body of work on assessing the risks of generative search engines with the objective of increasing public awareness of their limitations and encouraging appropriate measures to improve information quality and governance of these systems.
Direct to Abstract + Link to Full Text
The post Preprint: “Synthetic Sources?: Auditing Generative Search Engine Citations for Evidence of AI-Generated Sources” appeared first on Library Journal infoDOCKET.
Read original article: Read More
Discover more from DrWeb's Domain
Subscribe to get the latest posts sent to your email.
