Stress-testing Machine Generated Text Detection: Shifting Language Models Writing Style to Fool Detectors Paper โข 2505.24523 โข Published May 30 โข 9
๐ Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized โข 134 items โข Updated Oct 20 โข 116