A

allenai / Dolma

PQC Verified HuggingFace HNDL: CRITICAL

Open corpus of 3T tokens for language model pretraining. Sourced from web, academic papers, code, encyclopedic, and book content.

1,800 4,200,000 3

PQC-Verified with ML-DSA-87

This model has a real FIPS 204 ML-DSA-87 (Dilithium5) signature from the platform signing authority. Signature chain includes 3 verification(s). Last verified 2026-03-26.

ML-DSA-87 Signer: did:web:quantamrkt.com:chain:authority View public key

README.md

Dolma

Open corpus of 3T tokens for language model pretraining. Sourced from web, academic papers, code, encyclopedic, and book content.

Model Details

Parameters 3T tokens
License ODC-BY
Signature Algorithm ML-DSA-65
Source HuggingFace
HF Repo allenai/dolma

Intended Uses

This model is registered on the QuantaMrkt quantum-safe registry. All files have been cryptographically verified using post-quantum signatures.

Quick Start

# Install the CLI
pip install quantumshield

# Pull the model
quantumshield pull allenai/Dolma

# Verify file integrity
quantumshield verify allenai/Dolma

About

Open corpus of 3T tokens for language model pretraining. Sourced from web, academic papers, code, encyclopedic, and book content.

License ODC-BY
Parameters 3T tokens
Created 2026-03-26
Downloads 4,200,000
Likes 1,800

Get this model

View on HuggingFace

Pull with QuantumShield

quantumshield pull allenai/Dolma

Verify signatures

quantumshield verify allenai/Dolma

Signers

NG
did:web:allenai.org:signing
RY
did:web:quantamrkt.c...:primary
TY
did:web:quantamrkt.c...uthority