模型评估 - Sesame Pie AI

Open LLM Leaderboard

Open LLM Leaderboard is a standardized evaluation platform on Hugging Face for tracking, ranking, and comparing the performance of various types of open source big language models and chatbots. It serves researchers, developers and community users by providing transparent and reproducible evaluation results through unified benchmarks (e.g. MMLU, HellaSwag). The platform supports model submission, public access to data and community discussion, and although it has been officially retired in March 2025, its historical data and evaluation methods are still informative.

Hugging Face Open LLM Leaderboard 大模型评测开源大语言模型

2026年4月15日 320 0