Using DeepSeek R1 LLM

For this Energy Community Game, we have three levels of different complexity. For each level, there are six answers with varifying correctness. An LLM model was given a prompt and asked to evaluate the answer. Each answer was tested 20 times.