C

codeparrot / github-code

Unverified HuggingFace

The GitHub Code dataest consists of 115M code files from GitHub in 32 programming languages with 60 extensions totalling in 1TB of text data. The dataset was created from the GitHub dataset on BiqQuery.

367 1,210,198 1

Unverified Model

This model has not been PQC-verified. File integrity cannot be guaranteed against quantum threats.

README.md

github-code

The GitHub Code dataest consists of 115M code files from GitHub in 32 programming languages with 60 extensions totalling in 1TB of text data. The dataset was created from the GitHub dataset on BiqQuery.

Intended Uses

This model is registered on the QuantaMrkt quantum-safe registry. This model has not yet been PQC-verified.

Quick Start

# Install the CLI
pip install quantumshield

# Pull the model
quantumshield pull codeparrot/github-code

# Verify file integrity
quantumshield verify codeparrot/github-code

About

The GitHub Code dataest consists of 115M code files from GitHub in 32 programming languages with 60 extensions totalling in 1TB of text data. The dataset was created from the GitHub dataset on BiqQuery.

Created 2026-06-27
Downloads 1,210,198
Likes 367

Get this model

View on HuggingFace

Pull with QuantumShield

quantumshield pull codeparrot/github-code

Verify signatures

quantumshield verify codeparrot/github-code

Signers

V1
did:quantamrkt:regis...hield-v1