Model Hub
Browse PQC-verified AI models, datasets, and tools
Tomato Leaves Dataset Overview This dataset contains images of tomato leaves categorized into different classes based on the type of disease or health condition. The dataset is divided into training, validation, and test sets, with a ratio of 8:1:1. The classes include various diseases as well as healthy leaves. The dataset includes both augmented and non-augmented images. Dataset Structure The dataset is organized into three main splits: train validation test… See the full description on the dataset page: https://huggingface.co/datasets/codraja2006/tomato-leaves-dataset.
Agentic Critic Dataset High-quality AIGC images with rich metadata for aesthetic evaluation. Metadata Fields Each entry in metadata.jsonl contains: prompt: Positive prompt negative_prompt: Negative prompt model: Model name and hash sampler: Sampling method steps: Generation steps cfg_scale: CFG scale seed: Random seed stats: Engagement metrics image_path: Relative path to image Usage from datasets import load_dataset dataset =… See the full description on the dataset page: https://huggingface.co/datasets/ChengyouJia/agentic-critic-dataset.
Military Aircraft Detection & Classification Dataset 88 Classes with Advanced Background Suppression Overview This dataset is a professionally curated resource for training high-performance object detection and image classification models such as YOLOv11.It contains 88 distinct military aircraft classes and is explicitly designed for real-world deployment, where false positives from civilian aircraft, birds, and small drones are common. To address this, the… See the full description on the dataset page: https://huggingface.co/datasets/Ahnuf/Military_Aircraft_Detection_Classification_Image_Dataset.
LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks 🌐 Project Page: https://longbench2.github.io 💻 Github Repo: https://github.com/THUDM/LongBench 📚 Arxiv Paper: https://arxiv.org/abs/2412.15204 LongBench v2 is designed to assess the ability of LLMs to handle long-context problems requiring deep understanding and reasoning across real-world multitasks. LongBench v2 has the following features: (1) Length: Context length ranging from 8k to… See the full description on the dataset page: https://huggingface.co/datasets/zai-org/LongBench-v2.
Dataset Card for NOAA-ESD-CORAL-Bleaching Classification Dataset v1 Overview For the development of machine learning models to classify coral health, specifically identifying healthy hard coral (CORAL) and bleached hard coral (CORAL_BL).This dataset contains underwater imagery collected by NOAA's Ecosystem Sciences Division (ESD) and other benthic surveys. Labels Label Name Functional Group CORAL Healthy Hard Coral Hard Coral CORAL_BL Bleached… See the full description on the dataset page: https://huggingface.co/datasets/krithik274/NOAA-PIFSC-ESD-CORAL-Bleaching-Dataset.
Gilt Posture Recognition Dataset Each RGB image has a matching depth image (same filename, .png extension). YOLO-format label files correspond to each image. 🐷 Annotated Postures Five postures are labeled using YOLO bounding boxes: Class Name Class ID feeding 0 lateral_lying 1 sitting 2 standing 3 sternal_lying 4 📊 Class Distribution Below is a histogram showing the distribution of posture classes across the dataset:… See the full description on the dataset page: https://huggingface.co/datasets/anilbhujel/Gilt_posture_dataset.
Dataset Card for FashionMNIST Dataset Summary Fashion-MNIST is a dataset of Zalando's article images—consisting of a training set of 60,000 examples and a test set of 10,000 examples. Each example is a 28x28 grayscale image, associated with a label from 10 classes. We intend Fashion-MNIST to serve as a direct drop-in replacement for the original MNIST dataset for benchmarking machine learning algorithms. It shares the same image size and structure of training and testing… See the full description on the dataset page: https://huggingface.co/datasets/zalando-datasets/fashion_mnist.
TorNet-Temporal: Temporal Dual-Pol NEXRAD Radar for Tornado Detection A large-scale dataset of storm-centered NEXRAD WSR-88D radar sequences for tornado detection and prediction, featuring 24-channel dual-polarimetric data across variable-length temporal sequences. Dataset Summary 24,862 storm events from NEXRAD Level-II radar archives (2013-2022) 8-22 consecutive radar scans per event (~4-5 min cadence, ~45-90 min total; median 13 frames) 24 channels: 6 dual-pol radar… See the full description on the dataset page: https://huggingface.co/datasets/deepguess/tornet-temporal.
Dataset Card for Dataset Name Homepage: https://hazyresearch.stanford.edu/legalbench/ Repository: https://github.com/HazyResearch/legalbench/ Paper: https://arxiv.org/abs/2308.11462 Dataset Description Dataset Summary The LegalBench project is an ongoing open science effort to collaboratively curate tasks for evaluating legal reasoning in English large language models (LLMs). The benchmark currently consists of 162 tasks gathered from 40… See the full description on the dataset page: https://huggingface.co/datasets/nguha/legalbench.
MR-RATE: A Vision-Language Foundation Model and Dataset for Magnetic Resonance Imaging This is the MR-RATE-atlas repository, part of the MR-RATE dataset release. It contains atlas-registered MRI volumes in which all imaging sequences within each study have been spatially normalized to a standard atlas-space. For full dataset details, native-space MRI volumes, radiology reports, metadata, and data splits, please refer to… See the full description on the dataset page: https://huggingface.co/datasets/Forithmus/MR-RATE-atlas.
GroMo25: Multiview Time-Series Plant Image Dataset for Age Estimation and Leaf Counting Dataset Summary GroMo25 is a multiview, time-series plant image dataset designed for plant age estimation (in days) and leaf counting tasks in precision agriculture. It contains high-quality images of four crop species — Wheat, Okra, Radish, and Mustard — captured over multiple days under controlled conditions. Each plant is photographed from 24 angles across 5 vertical levels per day… See the full description on the dataset page: https://huggingface.co/datasets/MrigLabIITRopar/GroMo25.