Model Hub
Browse PQC-verified AI models, datasets, and tools
MultiEURLEX comprises 65k EU laws in 23 official EU languages (some low-ish resource). Each EU law has been annotated with EUROVOC concepts (labels) by the Publication Office of EU. As with the English EURLEX, the goal is to predict the relevant EUROVOC concepts (labels); this is multi-label classification task (given the text, predict multiple labels).
MedQA-Darija-MultiLingual The largest open trilingual medical Q&A dataset with directly-playable speech audio for English, French, and Moroccan Darija. A research dataset for the BRAIN HEALTH initiative, designed for multilingual medical NLP, low-resource speech recognition, healthcare chatbots, and clinical education tools targeting Morocco and the broader Maghreb region. Dataset is currently in scientific validation phase. After programmatic validation (Stage 1 LOF outlierโฆ See the full description on the dataset page: https://huggingface.co/datasets/Williamsanderson/MedQA-Darija-MultiLingual.
MMMU (A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI) ๐ Homepage | ๐ Leaderboard | ๐ค Dataset | ๐ค Paper | ๐ arXiv | GitHub ๐News ๐ ๏ธ[2026-04-21]: Fixed option issue in test_Psychology_15. โผ๏ธ[2026-02-12]: We have released the answers for the test set! You can now evaluate your models on the test set locally! ๐ ๐ ๏ธ[2024-05-30]: Fixed duplicate option issues in Materials dataset items (validation_Materials_25;โฆ See the full description on the dataset page: https://huggingface.co/datasets/MMMU/MMMU.
Dataset Card for ImageNet Dataset Summary ILSVRC 2012, commonly known as 'ImageNet' is an image dataset organized according to the WordNet hierarchy. Each meaningful concept in WordNet, possibly described by multiple words or word phrases, is called a "synonym set" or "synset". There are more than 100,000 synsets in WordNet, majority of them are nouns (80,000+). ImageNet aims to provide on average 1000 images to illustrate each synset. Images of each concept areโฆ See the full description on the dataset page: https://huggingface.co/datasets/ILSVRC/imagenet-1k.
Fine Vision FineVision is a massive collection of datasets with 17.3M images, 24.3M samples, 88.9M turns, and 9.5B answer tokens, designed for training state-of-the-art open Vision-Language-Models. More detail can be found in the blog post: https://huggingface.co/spaces/HuggingFaceM4/FineVision Load the data from datasets import load_dataset, get_dataset_config_names # Get all subset names and load the first one available_subsets =โฆ See the full description on the dataset page: https://huggingface.co/datasets/HuggingFaceM4/FineVision.
GroundCUA: Grounding Computer Use Agents on Human Demonstrations ๐ Website | ๐ Paper | ๐ค Dataset | ๐ค Models GroundCUA Dataset GroundCUA is a large and diverse dataset of real UI screenshots paired with structured annotations for building multimodal computer use agents. It covers 87 software platforms across productivity tools, browsers, creative tools, communication apps, development environments, and system utilities. GroundCUA is designed for research on GUIโฆ See the full description on the dataset page: https://huggingface.co/datasets/ServiceNow/GroundCUA.
Dataset Card for PAWS: Paraphrase Adversaries from Word Scrambling Dataset Summary PAWS: Paraphrase Adversaries from Word Scrambling This dataset contains 108,463 human-labeled and 656k noisily labeled pairs that feature the importance of modeling structure, context, and word order information for the problem of paraphrase identification. The dataset has two subsets, one based on Wikipedia and the other one based on the Quora Question Pairs (QQP) dataset. For furtherโฆ See the full description on the dataset page: https://huggingface.co/datasets/google-research-datasets/paws.
๐ MINT-1T:Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens ๐ MINT-1T is an open-source Multimodal INTerleaved dataset with 1 trillion text tokens and 3.4 billion images, a 10x scale-up from existing open-source datasets. Additionally, we include previously untapped sources such as PDFs and ArXiv papers. ๐ MINT-1T is designed to facilitate research in multimodal pretraining. ๐ MINT-1T is created by a team from the University of Washington inโฆ See the full description on the dataset page: https://huggingface.co/datasets/mlfoundations/MINT-1T-PDF-CC-2023-50.