Personality
Results of evaluations with prompted personality. Answer ID is mapped vertically, accompanied by TS (Tested Solution) and EE (Expected LLM Evaluation); on the horizontal axis, evaluation results are show for the control agent as "No trait", followed by each personality trait; only one personality per level was tested. Visit for output examples.