The GitHub Code dataest consists of 115M code files from GitHub in 32 programming languages with 60 extensions totalling in 1TB of text data. The dataset was created from the GitHub dataset on BiqQuery.
Use this model
Pull with QuantumShield
quantumshield pull codeparrot/github-code Verify integrity
quantumshield verify codeparrot/github-code pip install
pip install quantumshield && quantumshield pull codeparrot/github-code Unverified Model
This model has not been PQC-verified. File integrity cannot be guaranteed against quantum threats.
README.md
github-code
The GitHub Code dataest consists of 115M code files from GitHub in 32 programming languages with 60 extensions totalling in 1TB of text data. The dataset was created from the GitHub dataset on BiqQuery.
Intended Uses
This model is registered on the QuantaMrkt quantum-safe registry. This model has not yet been PQC-verified.
Quick Start
# Install the CLI pip install quantumshield # Pull the model quantumshield pull codeparrot/github-code # Verify file integrity quantumshield verify codeparrot/github-code
About
The GitHub Code dataest consists of 115M code files from GitHub in 32 programming languages with 60 extensions totalling in 1TB of text data. The dataset was created from the GitHub dataset on BiqQuery.
Get this model
Pull with QuantumShield
quantumshield pull codeparrot/github-code Verify signatures
quantumshield verify codeparrot/github-code