The study’s authors warn that individuals and paper mills — companies that produce fake papers to order and sell authorships — might be exploiting publicly available health data sets and using large language models (LLMs) to mass-produce low-quality papers that lack scientific value.
“If left unaddressed, this AI-based approach can be applied to all sorts of open-access databases, generating far more papers than anyone can imagine,” says Csaba Szabo, a pharmacologist at the University of Fribourg in Switzerland, who was not involved in the work. “This could open up Pandora’s box [and] the literature may be flooded with synthetic papers.”