Will an open-source LLM achieve a higher score than OpenAI's best model (GPT-5.5/GPT-6 series) on MMLU, HumanEval, or MATH benchmarks before January 1, 2028? | Prophecy