Model Hub

Browse PQC-verified AI models, datasets, and tools

M
mradermacher/DeepICD-R1-zero-32B-i1-GGUF HF Unverified

Reinforcement-LearningTransformersGGUFClinical-NlpMedical-CodingIcd10 HIGH
M
mradermacher/SciRM-Ref-7B-i1-GGUF HF Unverified

Text GenerationTransformersGGUFReward-ModelScientific-WritingEvaluation HIGH
M
mradermacher/Aryabhata-2.0-i1-GGUF HF Unverified

Reinforcement-LearningTransformersGGUFReasoningStemMathematics HIGH
T
TianheWu/VisualQuality-R1-7B HF Unverified

Reinforcement-LearningSafetensorsQwen2_5_vlIQAReasoningVLM HIGH
M
mradermacher/SpatialThinker-30B-i1-GGUF HF Unverified

Reinforcement-LearningTransformersGGUFSpatial-ReasoningMultimodalVision-Language HIGH
M
mradermacher/Atomight-V2.1-0.5B-Inference-i1-GGUF HF Unverified

Text GenerationTransformersGGUFCausal-LmGrpoReasoning HIGH
M
mradermacher/Vero-Qwen35-9B-Base-i1-GGUF HF Unverified

Reinforcement-LearningTransformersGGUFVeroVision-Language-ModelMultimodal HIGH
M
mradermacher/Vero-Qwen35-9B-i1-GGUF HF Unverified

Reinforcement-LearningTransformersGGUFVeroVision-Language-ModelMultimodal HIGH
P
PKU-Alignment/beaver-7b-v1.0-reward HF Unverified

Reinforcement-LearningSafe-RlhfSafetensorsLLaMAReinforcement-Learning-From-Human-FeedbackBeaver HIGH
M
mradermacher/LongTraceRL-30B-i1-GGUF HF Unverified

Reinforcement-LearningTransformersGGUFLong-ContextReasoningRubric-Reward HIGH
M
mradermacher/SpatialThinker-7B-i1-GGUF HF Unverified

Reinforcement-LearningTransformersGGUFSpatial-ReasoningMultimodalVision-Language HIGH
P
PKU-Alignment/beaver-7b-v1.0-cost HF Unverified

Reinforcement-LearningSafe-RlhfSafetensorsLLaMAReinforcement-Learning-From-Human-FeedbackBeaver HIGH
R
RLinf/RLinf-OpenVLAOFT-LIBERO-130-Base-Lora HF Unverified

Reinforcement-LearningSafetensorsOpenvlaRLinfCustom_codeModel-Index HIGH
Showing 13 of 273 items (page 14 of 14)
Prev Next