Large language models have led to an increase in machine-generated content, raising concerns about potential misuse. This study focuses on creating automated systems to detect machine-generated texts and address misuse.
AIpom, a novel method for human-machine mixed text detection, leverages a pipeline of decoder and encoder models to accurately identify the boundary between human-written and machine-generated text.
Contrastive learning can be an effective approach for detecting machine-generated text, even with a single model and without relying on the specific text generation model used.
Combining linguistic features and language model embeddings can effectively distinguish machine-generated text from human-written text, even across unseen language models and domains.