Can Machines Truly Write Like Humans?

The study introduces MAGA-Bench, a dataset comprising 20 distinct domains and leveraging 12 generative models to rigorously evaluate the performance of detection algorithms, with decoding strategies detailed elsewhere in the supplementary material (§E).

A new benchmark and training framework aims to improve the accuracy of detectors that distinguish between text written by people and generated by artificial intelligence.