跳到内容

夜深了,注意休息,愿你今夜好梦。

模型评估

Open LLM Leaderboard

Open LLM Leaderboard is a standardized evaluation platform on Hugging Face for tracking, ranking, and comparing the performance of various types of open source big language models and chatbots. It serves researchers, developers and community users by providing transparent and reproducible evaluation results through unified benchmarks (e.g. MMLU, HellaSwag). The platform supports model submission, public access to data and community discussion, and although it has been officially retired in March 2025, its historical data and evaluation methods are still informative.

2026年4月15日 320 0 浏览 320,收藏 0
正文
强调色