Benchmarking Bias in Large Language Models During Role-Playing
Xinyue Li, Zhenpeng Chen, Jie M. Zhang, Yiling Lou, Tianlin Li, Weisong Sun, Yang Liu, Xuanzhe Liu
ACM International Conference on the Foundations of Software Engineering (FSE) 2026
This empirical study builds a benchmark for identifying biased responses when LLMs are prompted to adopt diverse social roles.