O

OpenSQZ / AutoMathText-V2

Unverified HuggingFace

πŸš€ AutoMathText-V2: A 2.46 Trillion Token AI-Curated STEM Pretraining Dataset   πŸŽ‰ AutoMathText-v2 has surpassed 1.5 million downloads! We'd love to know how you're using it. Please take 1 minute to fill out our use case survey. Your feedback will directly shape the future roadmap of this dataset.πŸ‘‰ Share your use case here πŸ“Š AutoMathText-V2 consists of 2.46 trillion tokens of high-quality, deduplicated text spanning web content, mathematics, code, reasoning, and… See the full description on the dataset page: https://huggingface.co/datasets/OpenSQZ/AutoMathText-V2.

78 124,879 1

Unverified Model

This model has not been PQC-verified. File integrity cannot be guaranteed against quantum threats.

README.md

AutoMathText-V2

πŸš€ AutoMathText-V2: A 2.46 Trillion Token AI-Curated STEM Pretraining Dataset   πŸŽ‰ AutoMathText-v2 has surpassed 1.5 million downloads! We'd love to know how you're using it. Please take 1 minute to fill out our use case survey. Your feedback will directly shape the future roadmap of this dataset.πŸ‘‰ Share your use case here πŸ“Š AutoMathText-V2 consists of 2.46 trillion tokens of high-quality, deduplicated text spanning web content, mathematics, code, reasoning, and… See the full description on the dataset page: https://huggingface.co/datasets/OpenSQZ/AutoMathText-V2.

Intended Uses

This model is registered on the QuantaMrkt quantum-safe registry. This model has not yet been PQC-verified.

Quick Start

# Install the CLI
pip install quantumshield

# Pull the model
quantumshield pull OpenSQZ/AutoMathText-V2

# Verify file integrity
quantumshield verify OpenSQZ/AutoMathText-V2

About

πŸš€ AutoMathText-V2: A 2.46 Trillion Token AI-Curated STEM Pretraining Dataset   πŸŽ‰ AutoMathText-v2 has surpassed 1.5 million downloads! We'd love to know how you're using it. Please take 1 minute to fill out our use case survey. Your feedback will directly shape the future roadmap of this dataset.πŸ‘‰ Share your use case here πŸ“Š AutoMathText-V2 consists of 2.46 trillion tokens of high-quality, deduplicated text spanning web content, mathematics, code, reasoning, and… See the full description on the dataset page: https://huggingface.co/datasets/OpenSQZ/AutoMathText-V2.

Created 2026-06-23
Downloads 124,879
Likes 78

Get this model

View on HuggingFace

Pull with QuantumShield

quantumshield pull OpenSQZ/AutoMathText-V2

Verify signatures

quantumshield verify OpenSQZ/AutoMathText-V2

Signers

V1
did:quantamrkt:regis...hield-v1