A watermark for chatbots can spot textual content written by an AI

0
1277
A watermark for chatbots can spot textual content written by an AI


For instance, since OpenAI’s chatbot ChatGPT was launched in November college students have already began utilizing it to cheat by writing essays for them. News web site CNET has used ChatGPT to write down articles, solely to should situation corrections amid accusations of plagiarism. But there’s a promising solution to spot AI textual content: by embedding hidden patterns that allow us determine AI-generated textual content into these methods earlier than they’re launched. 

In research, these watermarks have already proven that they’ll determine AI-generated textual content with close to certainty. One, developed by a group on the University of Maryland, was in a position to spot textual content created by Meta’s open supply language mannequin, OPT-6.7B, utilizing a detection algorithm they constructed. The work is described in a paper that’s but to be peer reviewed, and the code will likely be obtainable without spending a dime round February 15. 

AI language fashions work by predicting and producing one phrase at a time. After every phrase, the watermarking algorithm randomly divides the language mannequin’s vocabulary into phrases on a “greenlist” and a “redlist,” after which prompts the language mannequin to decide on phrases on the greenlist. 

The extra greenlisted phrases in a passage, the extra probably it’s that the textual content is generated by a machine. Text written by an individual tends to include a extra random mixture of phrases. For instance, for the phrase “beautiful”, the watermarking algorithm might classify the phrase “flower” as inexperienced, and “orchid” as purple. The AI mannequin with the watermarking algorithm could be extra probably to make use of the phrase “flower” than “orchid,” explains Tom Goldstein, an assistant professor on the University of Maryland, who was concerned within the analysis. 

LEAVE A REPLY

Please enter your comment!
Please enter your name here