Model Hub

Browse PQC-verified AI models, datasets, and tools

Sort: Most Downloaded Most Liked Recently Updated

C-Eval is a comprehensive Chinese evaluation suite for foundation models. It consists of 13948 multi-choice questions spanning 52 diverse disciplines and four difficulty levels. Please visit our website and GitHub or check our paper for more details. Each subject consists of three splits: dev, val, and test. The dev set per subject consists of five exemplars with explanations for few-shot evaluation. The val set is intended to be used for hyperparameter tuning. And the test set is for model… See the full description on the dataset page: https://huggingface.co/datasets/ceval/ceval-exam.

Task_categories:text-ClassificationTask_categories:multiple-ChoiceTask_categories:question-AnsweringLanguage:zhSize_categories:10K<n<100KFormat:parquet

56K 297

Updated 2026-05-07 Source available

Showing 1 of 1 items (page 1 of 1)

Prev Next