Model Hub

Browse PQC-verified AI models, datasets, and tools

NVIDIA's optimized Llama 3.1 70B. Custom alignment for helpfulness with strong benchmark performance.

TransformerText Generation70BInstructRLHF CRITICAL

1.8M 4,100

Updated 2026-03-26

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16 HF PQC Verified

Text GenerationTransformersSafetensorsNemotron_hNvidiaPyTorch CRITICAL

1.5M 714

Updated 2026-04-20

fineinstructions/fineinstructions_nemotron HF Unverified

✨ Note: For all FineInstructions resources please visit: https://huggingface.co/fineinstructions This dataset is ~1B+ synthetic instruction-answer pairs or ~300B tokens created using the FineInstructions pipeline. The FineInstructions pipeline was run over the raw pre-training documents in the Nemotron-CC pre-training corpus (a subset of high-quality documents from CommonCrawl). See our paper for more details. Each .parquet file in the data folderhas a corresponding judge-*.json file that… See the full description on the dataset page: https://huggingface.co/datasets/fineinstructions/fineinstructions_nemotron.

Language:enSize_categories:1B<n<10BFormat:parquetModality:tabularModality:textLibrary:datasets

1.2M 9

Updated 2026-05-08 Source available

nvidia/Nemotron-CC-v2 HF Unverified

Nemotron-Pre-Training-Dataset-v1 Release Data Overview This pretraining dataset, for generative AI model training, preserves high-value math and code while enriching it with diverse multilingual Q&A, fueling the next generation of intelligent, globally-capable models. This dataset supports NVIDIA Nemotron Nano 2, a family of large language models (LLMs) that consists of the NVIDIA-Nemotron-Nano-9B-v2, NVIDIA-Nemotron-Nano-9B-v2-Base, and NVIDIA-Nemotron-Nano-12B-v2-Base… See the full description on the dataset page: https://huggingface.co/datasets/nvidia/Nemotron-CC-v2.

Task_categories:text-GenerationSize_categories:1B<n<10BFormat:parquetModality:textLibrary:datasetsLibrary:dask

147K 116

Updated 2026-05-02 Source available

Showing 4 of 4 items (page 1 of 1)

Prev Next