Model Hub

Browse PQC-verified AI models, datasets, and tools

nvidia/SAGE-10k HF Unverified

SAGE-10k SAGE-10k is a large-scale interactive indoor scene dataset featuring realistic layouts, generated by the agentic-driven pipeline introduced in "SAGE: Scalable Agentic 3D Scene Generation for Embodied AI". The dataset contains 10,000 diverse scenes spanning 50 room types and styles, along with 565K uniquely generated 3D objects. 🔑 Key Features SAGE-10k integrates a wide variety of scenes, and particularly, preserves small items for… See the full description on the dataset page: https://huggingface.co/datasets/nvidia/SAGE-10k.

Task_categories:text-To-3dLanguage:enSize_categories:10K<n<100KScene-GenerationInteractive-ScenesEmbodied-AI
angie-chen55/python-github-code HF Unverified

Size_categories:1M<n<10MFormat:parquetModality:textLibrary:datasetsLibrary:daskLibrary:polars
J
jonathandinu/face-parsing HF Unverified

Image-SegmentationTransformersPyTorchONNXSafetensorsSegformer HIGH
InternRobotics/OmniWorld HF Unverified

[ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling         🎉NEWS [2026.3.21] 🔥 OmniWorld-Game with Metric Scale is now released! Check out our latest model Pi3X (an enhanced version of Pi3), which leverages this data to achieve better performance! [2026.1.26] 🎉 OmniWorld was accepted by ICLR 2026! [2026.1.7] Update OmniWorld-Game, release RH20T-Robot, RH20T-Human, Ego-Exo4D, EgoDex, Epic-Kitchens. [2025.11.11] The OmniWorld is… See the full description on the dataset page: https://huggingface.co/datasets/InternRobotics/OmniWorld.

Task_categories:text-To-VideoTask_categories:image-To-VideoTask_categories:image-To-3dTask_categories:roboticsTask_categories:otherLanguage:en
allenai/winogrande HF Unverified

Dataset Card for "winogrande" Dataset Summary WinoGrande is a new collection of 44k problems, inspired by Winograd Schema Challenge (Levesque, Davis, and Morgenstern 2011), but adjusted to improve the scale and robustness against the dataset-specific bias. Formulated as a fill-in-a-blank task with binary options, the goal is to choose the right option for a given sentence which requires commonsense reasoning. Supported Tasks and Leaderboards More Information… See the full description on the dataset page: https://huggingface.co/datasets/allenai/winogrande.

Language:enSize_categories:10K<n<100KFormat:parquetModality:textLibrary:datasetsLibrary:pandas
J
John6666/janku-v5-nsfw-trained-noobai-rou-wei-illustrious-xl-v50-sdxl HF PQC Verified

Text-to-ImageDiffusersSafetensorsStable-DiffusionStable-Diffusion-XlNot-For-All-Audiences HIGH
OpenGVLab/GUI-Odyssey HF Unverified

Dataset Card for GUI Odyssey News⭐️ A new and improved version of the GUIOdyssey dataset has been released! 🎉🎉 👉 Please use the latest version and refer to the updated README for the most up-to-date information. We highly recommend using the new version for all training and evaluation! Repository: https://github.com/OpenGVLab/GUI-Odyssey Latest Version of Dataset: hflqf88888/GUIOdyssey Paper: https://arxiv.org/pdf/2406.08451 Introduction GUI Odyssey is… See the full description on the dataset page: https://huggingface.co/datasets/OpenGVLab/GUI-Odyssey.

Language:enSize_categories:1K<n<10KFormat:jsonModality:imageModality:tabularModality:text
D
depth-anything/Depth-Anything-V2-Large-hf HF Unverified

Depth-EstimationTransformersSafetensorsDepth_anythingDepthRelative depth HIGH
PleIAs/common_corpus HF Unverified

Common Corpus Full paper - ICLR 2026 oral Common Corpus is the largest open and permissible licensed text dataset, comprising 2.27 trillion tokens (2,267,302,720,836 tokens). It is a diverse dataset, consisting of books, newspapers, scientific articles, government and legal documents, code, and more. Common Corpus has been created by Pleias in association with several partners. Common Corpus differs from existing open datasets in that it is: Truly Open: contains only data that… See the full description on the dataset page: https://huggingface.co/datasets/PleIAs/common_corpus.

Language:enLanguage:frLanguage:deLanguage:zhLanguage:itLanguage:es
Q
Qwen/Qwen-Image HF PQC Verified

Text-to-ImageDiffusersSafetensorsDiffusers:QwenImagePipelineEnglishChinese CRITICAL
M
MoritzLaurer/deberta-v3-large-zeroshot-v2.0 HF Unverified

Zero-Shot ClassificationTransformersONNXSafetensorsDeberta-V2Text Classification HIGH
J
John6666/prefect-illustrious-xl-v3-sdxl HF PQC Verified

Text-to-ImageDiffusersSafetensorsStable-DiffusionStable-Diffusion-XlAnime HIGH
X
Xenova/segformer-b0-finetuned-ade-512-512 HF Unverified

Image-SegmentationTransformers.jsONNXSegformerBase_model:nvidia/segformer-B0-Finetuned-Ade-512-512Base_model:quantized:nvidia/segformer-B0-Finetuned-Ade-512-512 MEDIUM
jasperai/monet HF Unverified

Dataset Card for MONET MONET (Massive, Open, Non-redundant and Enriched Text-to-image dataset) is a large-scale, curated image-text dataset designed for training text-to-image (T2I) systems. It contains 104.9 million high-quality image-text pairs distilled from 2.9 billion raw pairs across nine heterogeneous open sources (6 real and 3 synthetic) through successive stages of safety filtering, domain-based filtering, exact and near-duplicate removal, and re-captioning with… See the full description on the dataset page: https://huggingface.co/datasets/jasperai/monet.

Task_categories:text-To-ImageTask_categories:image-Feature-ExtractionTask_categories:zero-Shot-Image-ClassificationLanguage:enSize_categories:100M<n<1BMultimodal
N
nvidia/segformer-b1-finetuned-ade-512-512 HF Unverified

Image-SegmentationTransformersPyTorchTfSegformerVision MEDIUM
anon8231489123/ShareGPT_Vicuna_unfiltered HF Unverified

Further cleaning done. Please look through the dataset and ensure that I didn't miss anything. Update: Confirmed working method for training the model: https://huggingface.co/AlekseyKorshuk/vicuna-7b/discussions/4#64346c08ef6d5abefe42c12c Two choices: Removes instances of "I'm sorry, but": https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered/blob/main/ShareGPT_V3_unfiltered_cleaned_split_no_imsorry.json Has instances of "I'm sorry, but":… See the full description on the dataset page: https://huggingface.co/datasets/anon8231489123/ShareGPT_Vicuna_unfiltered.

Language:en
M
MoritzLaurer/mDeBERTa-v3-base-mnli-xnli HF Unverified

Zero-Shot ClassificationTransformersPyTorchONNXSafetensorsDeberta-V2 HIGH
J
John6666/amanatsu-illustrious-v11-sdxl HF PQC Verified

Text-to-ImageDiffusersSafetensorsStable-DiffusionStable-Diffusion-XlAnime HIGH
D
deepset/bert-large-uncased-whole-word-masking-squad2 HF Unverified

Question AnsweringTransformersPyTorchTfJAXSafetensors HIGH
X
xingyang1/Distill-Any-Depth-Large-hf HF Unverified

Depth-EstimationTransformersSafetensorsDepth_anythingDistill-Any-DepthVision HIGH
Showing 20 of 665 items (page 22 of 34)