Forcing LLMs to be evil during training can make them nicer in the long run

For this study, Lindsey and his colleagues worked to lay down some of that groundwork. Previous research has shown that various dimensions of LLMs’ behavior—from whether they are talking about weddings to persistent traits such as…

Continue Reading