Entity

AICompanionBench: Benchmarking LLMs-as-Judges for AI Companion Safety

As AI companion platforms such as Replika and Character.AI rapidly grow, concerns about unsafe human-AI interactions have intensified. This study introduces AICompanionBench, to our knowledge the first publicly available benchmark dataset of human-AI companion conversations annotated with fine-grained safety risk categories. The dataset contains 2,123 real-world Replika conversations collected from Reddit and annotated through human-AI collaboration across nine categories: sexual behavior, antis

Paper · arXiv

cs.AI

Authors: Yanjing Ren, Reza Ebrahimi, TengTeng Ma
Published: 2026-06-03

Abstract ↗

via arXiv · 2606.04867