Model Hub

Dataset Card for GSM8K Dataset Summary GSM8K (Grade School Math 8K) is a dataset of 8.5K high quality linguistically diverse grade school math word problems. The dataset was created to support the task of question answering on basic mathematical problems that require multi-step reasoning. These problems take between 2 and 8 steps to solve. Solutions primarily involve performing a sequence of elementary calculations using basic arithmetic operations (+ − ×÷) to reach the… See the full description on the dataset page: https://huggingface.co/datasets/openai/gsm8k.

Benchmark:officialBenchmark:eval-YamlTask_categories:text-GenerationAnnotations_creators:crowdsourcedLanguage_creators:crowdsourcedMultilinguality:monolingual

917K 1,296

Updated 2026-05-08 Source available

openai/openai_humaneval HF Unverified

Dataset Card for OpenAI HumanEval Dataset Summary The HumanEval dataset released by OpenAI includes 164 programming problems with a function sig- nature, docstring, body, and several unit tests. They were handwritten to ensure not to be included in the training set of code generation models. Supported Tasks and Leaderboards Languages The programming problems are written in Python and contain English natural text in comments and docstrings.… See the full description on the dataset page: https://huggingface.co/datasets/openai/openai_humaneval.

Annotations_creators:expert-GeneratedLanguage_creators:expert-GeneratedMultilinguality:monolingualSource_datasets:originalLanguage:enSize_categories:n<1K

255K 384

Updated 2026-05-08 Source available

Showing 14 of 14 items (page 1 of 1)

Prev Next