In October, OpenAI integrated ChatGPT Search into ChatGPT, promising an experience in which users could browse the web and access the latest news from its news partners and sites that have not blocked OpenAI’s web crawler. A new review by Columbia’s Tow Center for Digital Journalism shows that the process may not be as efficient as it sounds.
The Tow Center performed a test to determine how well publisher content is represented on ChatGPT. It selected 10 articles from 20 random publishers who partnered with OpenAI, are involved in lawsuits against OpenAI, or unaffiliated publishers who either allowed or blocked the web crawler.
Also: ’12 Days of OpenAI’ promises product launches and demos – here’s how to watch
The researcher then extracted 200 quotes, which, when run among search engines like Google or Bing, pointed back to the source in the top three results. Finally, it was time to let ChatGPT identify the quotes’ sources. Ultimately, the goal was to see if the AI accurately serves publications, giving them credit for their work. If the approach worked as advertised, it should be able to attribute the sources just as well.
<!–>
The results varied in accuracy, some entirely correct or incorrect, and some partially correct. Yet, nearly all answers were presented confidently, without the AI saying it couldn’t produce an answer even from publishers who had blocked its web crawler. Only in seven of the outputs did ChatGPT say to use words or phrases that insinuated it was unclear, as seen below: