跳到内容

晚上好,辛苦一天了,放松一下吧。

模型评估

Open LLM Leaderboard

Open LLM Leaderboard is a standardized evaluation platform on Hugging Face for tracking, ranking, and comparing the performance of various types of open source big language models and chatbots. It serves researchers, developers and community users by providing transparent and reproducible evaluation results through unified benchmarks (e.g. MMLU, HellaSwag). The platform supports model submission, public access to data and community discussion, and although it has been officially retired in March 2025, its historical data and evaluation methods are still informative.

2026年4月15日 319 0 浏览 319,收藏 0
正文
强调色