Open corpus of 3T tokens for language model pretraining. Sourced from web, academic papers, code, encyclopedic, and book content.
Use this model
Pull with QuantumShield
quantumshield pull allenai/Dolma Verify integrity
quantumshield verify allenai/Dolma pip install
pip install quantumshield && quantumshield pull allenai/Dolma PQC-Verified with ML-DSA-87
This model has a real FIPS 204 ML-DSA-87 (Dilithium5) signature from the platform signing authority. Signature chain includes 3 verification(s). Last verified 2026-03-26.
README.md
Dolma
Open corpus of 3T tokens for language model pretraining. Sourced from web, academic papers, code, encyclopedic, and book content.
Model Details
| Parameters | 3T tokens |
| License | ODC-BY |
| Signature Algorithm | ML-DSA-65 |
| Source | HuggingFace |
| HF Repo | allenai/dolma |
Intended Uses
This model is registered on the QuantaMrkt quantum-safe registry. All files have been cryptographically verified using post-quantum signatures.
Quick Start
# Install the CLI pip install quantumshield # Pull the model quantumshield pull allenai/Dolma # Verify file integrity quantumshield verify allenai/Dolma
About
Open corpus of 3T tokens for language model pretraining. Sourced from web, academic papers, code, encyclopedic, and book content.
Get this model
Pull with QuantumShield
quantumshield pull allenai/Dolma Verify signatures
quantumshield verify allenai/Dolma