π AutoMathText-V2: A 2.46 Trillion Token AI-Curated STEM Pretraining Dataset π AutoMathText-v2 has surpassed 1.5 million downloads! We'd love to know how you're using it. Please take 1 minute to fill out our use case survey. Your feedback will directly shape the future roadmap of this dataset.π Share your use case here π AutoMathText-V2 consists of 2.46 trillion tokens of high-quality, deduplicated text spanning web content, mathematics, code, reasoning, andβ¦ See the full description on the dataset page: https://huggingface.co/datasets/OpenSQZ/AutoMathText-V2.
Use this model
Pull with QuantumShield
quantumshield pull OpenSQZ/AutoMathText-V2 Verify integrity
quantumshield verify OpenSQZ/AutoMathText-V2 pip install
pip install quantumshield && quantumshield pull OpenSQZ/AutoMathText-V2 Unverified Model
This model has not been PQC-verified. File integrity cannot be guaranteed against quantum threats.
README.md
AutoMathText-V2
π AutoMathText-V2: A 2.46 Trillion Token AI-Curated STEM Pretraining Dataset π AutoMathText-v2 has surpassed 1.5 million downloads! We'd love to know how you're using it. Please take 1 minute to fill out our use case survey. Your feedback will directly shape the future roadmap of this dataset.π Share your use case here π AutoMathText-V2 consists of 2.46 trillion tokens of high-quality, deduplicated text spanning web content, mathematics, code, reasoning, andβ¦ See the full description on the dataset page: https://huggingface.co/datasets/OpenSQZ/AutoMathText-V2.
Intended Uses
This model is registered on the QuantaMrkt quantum-safe registry. This model has not yet been PQC-verified.
Quick Start
# Install the CLI pip install quantumshield # Pull the model quantumshield pull OpenSQZ/AutoMathText-V2 # Verify file integrity quantumshield verify OpenSQZ/AutoMathText-V2
About
π AutoMathText-V2: A 2.46 Trillion Token AI-Curated STEM Pretraining Dataset π AutoMathText-v2 has surpassed 1.5 million downloads! We'd love to know how you're using it. Please take 1 minute to fill out our use case survey. Your feedback will directly shape the future roadmap of this dataset.π Share your use case here π AutoMathText-V2 consists of 2.46 trillion tokens of high-quality, deduplicated text spanning web content, mathematics, code, reasoning, andβ¦ See the full description on the dataset page: https://huggingface.co/datasets/OpenSQZ/AutoMathText-V2.
Get this model
Pull with QuantumShield
quantumshield pull OpenSQZ/AutoMathText-V2 Verify signatures
quantumshield verify OpenSQZ/AutoMathText-V2