Core Concepts
Analyzing the spectrum of relative likelihood scores of texts, using Fourier transform, can effectively distinguish between human-written and model-generated texts, revealing subtle differences in language use.
Stats
The accuracy of FourierGPT on the PubMed dataset, using a supervised classifier, is above 80%.
The accuracy of FourierGPT on the Writing and XSum datasets, using a supervised classifier, is around 70%.
On the PubMed dataset, FourierGPT outperforms Fast-DetectGPT when using the pairwise heuristic-based classifier.
The accuracy of FourierGPT on the Writing dataset, using a pairwise classifier with a bigram language model, reaches 90.67%.
In the PubMed dataset, model-generated answers are significantly more likely to start with "Yes" or "No" compared to human-written answers.