@ligasser yes, it needs more models in general. Closed models should also be able to be tested (I assume this is some kind of psychology questionnaire). As it stands the pattern could be an artifact of model size (the 70B is far away from all the other, smaller models on the map).