I suspect this is because when the tests were run, these were not open weight yet - arcee-ai/trinity-large-thinking - minimax/minimax/minimax-m2.7
I suspect this is because when the tests were run, these were not open weight yet