H

HuggingFaceFW / FineWeb

PQC Verified HuggingFace HNDL: CRITICAL

15T token dataset of cleaned English web data. Deduplicated and filtered from CommonCrawl, outperforms C4 and RefinedWeb for LLM pretraining.

2,782 466,875 3

PQC-Verified with ML-DSA-87

This model has a real FIPS 204 ML-DSA-87 (Dilithium5) signature from the platform signing authority. Signature chain includes 3 verification(s). Last verified 2026-05-08.

ML-DSA-87 Signer: did:web:quantamrkt.com:chain:authority View public key

README.md

FineWeb

15T token dataset of cleaned English web data. Deduplicated and filtered from CommonCrawl, outperforms C4 and RefinedWeb for LLM pretraining.

Model Details

Parameters 15T tokens
License ODC-BY
Signature Algorithm ML-DSA-87
Source HuggingFace
HF Repo HuggingFaceFW/fineweb

Intended Uses

This model is registered on the QuantaMrkt quantum-safe registry. All files have been cryptographically verified using post-quantum signatures.

Quick Start

# Install the CLI
pip install quantumshield

# Pull the model
quantumshield pull HuggingFaceFW/FineWeb

# Verify file integrity
quantumshield verify HuggingFaceFW/FineWeb

About

15T token dataset of cleaned English web data. Deduplicated and filtered from CommonCrawl, outperforms C4 and RefinedWeb for LLM pretraining.

License ODC-BY
Parameters 15T tokens
Created 2026-03-26
Downloads 466,875
Likes 2,782

Get this model

View on HuggingFace

Pull with QuantumShield

quantumshield pull HuggingFaceFW/FineWeb

Verify signatures

quantumshield verify HuggingFaceFW/FineWeb

Signers

EB
did:web:huggingface....:fineweb
RY
did:web:quantamrkt.c...:primary
TY
did:web:quantamrkt.c...uthority