Models

PQC-verified AI models from leading providers

M
mradermacher/SciRM-Ref-7B-i1-GGUF HF Unverified

Text GenerationTransformersGGUFReward-ModelScientific-WritingEvaluation HIGH
M
mradermacher/Aryabhata-2.0-i1-GGUF HF Unverified

Reinforcement-LearningTransformersGGUFReasoningStemMathematics HIGH
T
TianheWu/VisualQuality-R1-7B HF Unverified

Reinforcement-LearningSafetensorsQwen2_5_vlIQAReasoningVLM HIGH
M
mradermacher/SpatialThinker-30B-i1-GGUF HF Unverified

Reinforcement-LearningTransformersGGUFSpatial-ReasoningMultimodalVision-Language HIGH
M
mradermacher/Atomight-V2.1-0.5B-Inference-i1-GGUF HF Unverified

Text GenerationTransformersGGUFCausal-LmGrpoReasoning HIGH
M
mradermacher/Vero-Qwen35-9B-Base-i1-GGUF HF Unverified

Reinforcement-LearningTransformersGGUFVeroVision-Language-ModelMultimodal HIGH
M
mradermacher/Vero-Qwen35-9B-i1-GGUF HF Unverified

Reinforcement-LearningTransformersGGUFVeroVision-Language-ModelMultimodal HIGH
P
PKU-Alignment/beaver-7b-v1.0-reward HF Unverified

Reinforcement-LearningSafe-RlhfSafetensorsLLaMAReinforcement-Learning-From-Human-FeedbackBeaver HIGH
M
mradermacher/LongTraceRL-30B-i1-GGUF HF Unverified

Reinforcement-LearningTransformersGGUFLong-ContextReasoningRubric-Reward HIGH
M
mradermacher/SpatialThinker-7B-i1-GGUF HF Unverified

Reinforcement-LearningTransformersGGUFSpatial-ReasoningMultimodalVision-Language HIGH
P
PKU-Alignment/beaver-7b-v1.0-cost HF Unverified

Reinforcement-LearningSafe-RlhfSafetensorsLLaMAReinforcement-Learning-From-Human-FeedbackBeaver HIGH
R
RLinf/RLinf-OpenVLAOFT-LIBERO-130-Base-Lora HF Unverified

Reinforcement-LearningSafetensorsOpenvlaRLinfCustom_codeModel-Index HIGH
Showing 12 of 272 models (page 14 of 14)
Prev Next