{"id":235,"slug":"permutans--arxiv-papers-by-subject","name":"arxiv-papers-by-subject","author":"permutans","description":"\n\t\n\t\t\n\t\tarXiv Papers by Subject\n\t\n\nA reorganised version of the nick007x/arxiv-papers dataset, partitioned by subject code, year, and month for efficient selective access.\n\n\t\n\t\t\n\t\tDataset Description\n\t\n\nThis dataset contains metadata for over 2.5 million arXiv papers, organised into a hierarchical directory structure that allows users to download only the specific subjects and time periods they need, rather than the entire dataset.\n\n\t\n\t\t\n\t\tMotivation\n\t\n\nThe original nick007x/arxiv-papers… See the full description on the dataset page: https://huggingface.co/datasets/permutans/arxiv-papers-by-subject.","tags":"[\"Task_categories:text-Generation\",\"Task_categories:feature-Extraction\",\"Source_datasets:nick007x/arxiv-Papers\",\"Language:en\",\"Size_categories:1M<n<10M\",\"Arxiv\"]","license":null,"framework":null,"parameters":null,"downloads":288624,"likes":10,"verified":0,"created_at":"2026-04-20 18:22:11","updated_at":"2026-05-08 16:45:15","source_url":"https://huggingface.co/datasets/permutans/arxiv-papers-by-subject","source_platform":"huggingface","hf_repo_id":"permutans/arxiv-papers-by-subject","ollama_name":"","category":"dataset","latest_version":"v1.0.0","version_count":1,"signature_count":1,"risk_level":null,"risk_score":null,"versions":[{"id":234,"model_id":235,"version":"v1.0.0","manifest_hash":"1ca3e6fdc8163bd7e2513af800dd65f9ced1b1b2c1527184290f32c8bafa34ef","file_count":0,"total_size":0,"r2_manifest_key":"manifests/datasets/permutans--arxiv-papers-by-subject/v1.0.0.json","created_at":"2026-04-20 18:22:11"}],"files":[],"signatures":[{"id":596,"version_id":234,"signer_did":"did:quantamrkt:registry:shield-v1","algorithm":"ML-DSA-65","signature_hex":"526730a33377494e420921cc6fa2d2fe82150a263f4f27cfe81264ebf02bd951","attestation_type":"registry","signed_at":"2026-04-20 18:22:11"}],"hndl":null}