Models

PQC-verified AI models from leading providers

V
ValueFX9507/Tifa-DeepsexV3-14b-GGUF-Q6 HF Unverified

Reinforcement-LearningTransformersGGUFIncremental-PretrainingSftRoleplay HIGH
M
mradermacher/Miner-8B-i1-GGUF HF Unverified

Reinforcement-LearningTransformersGGUFReasoningRlvrMath HIGH
P
PaulVialard/ppo-Huggy HF Unverified

Reinforcement-LearningMl-AgentsTensorboardONNXHuggyDeep-Reinforcement-Learning MEDIUM
I
infly/inf-retriever-v1-pro HF Unverified

Reinforcement-LearningSafetensorsQwen2RetrievalQuery-RewritingCustom_code HIGH
V
ValueFX9507/Tifa-DeepsexV2-7b-MGRPO-GGUF-Q8 HF Unverified

Reinforcement-LearningTransformersGGUFIncremental-PretrainingSftRoleplay HIGH
M
mradermacher/DeepICD-R1-zero-32B-i1-GGUF HF Unverified

Reinforcement-LearningTransformersGGUFClinical-NlpMedical-CodingIcd10 HIGH
M
mradermacher/SciRM-Ref-7B-i1-GGUF HF Unverified

Text GenerationTransformersGGUFReward-ModelScientific-WritingEvaluation HIGH
A
autogluon/tabpfn-mix-1.0-regressor HF Unverified

Tabular-RegressionSafetensors MEDIUM
M
mradermacher/Aryabhata-2.0-i1-GGUF HF Unverified

Reinforcement-LearningTransformersGGUFReasoningStemMathematics HIGH
T
TianheWu/VisualQuality-R1-7B HF Unverified

Reinforcement-LearningSafetensorsQwen2_5_vlIQAReasoningVLM HIGH
M
mradermacher/SpatialThinker-30B-i1-GGUF HF Unverified

Reinforcement-LearningTransformersGGUFSpatial-ReasoningMultimodalVision-Language HIGH
M
mradermacher/Atomight-V2.1-0.5B-Inference-i1-GGUF HF Unverified

Text GenerationTransformersGGUFCausal-LmGrpoReasoning HIGH
M
mradermacher/Vero-Qwen35-9B-Base-i1-GGUF HF Unverified

Reinforcement-LearningTransformersGGUFVeroVision-Language-ModelMultimodal HIGH
M
mradermacher/Vero-Qwen35-9B-i1-GGUF HF Unverified

Reinforcement-LearningTransformersGGUFVeroVision-Language-ModelMultimodal HIGH
P
PKU-Alignment/beaver-7b-v1.0-cost HF Unverified

Reinforcement-LearningSafe-RlhfSafetensorsLLaMAReinforcement-Learning-From-Human-FeedbackBeaver HIGH
P
PKU-Alignment/beaver-7b-v1.0-reward HF Unverified

Reinforcement-LearningSafe-RlhfSafetensorsLLaMAReinforcement-Learning-From-Human-FeedbackBeaver HIGH
M
mradermacher/LongTraceRL-30B-i1-GGUF HF Unverified

Reinforcement-LearningTransformersGGUFLong-ContextReasoningRubric-Reward HIGH
M
mradermacher/SpatialThinker-7B-i1-GGUF HF Unverified

Reinforcement-LearningTransformersGGUFSpatial-ReasoningMultimodalVision-Language HIGH
R
RLinf/RLinf-OpenVLAOFT-LIBERO-130-Base-Lora HF Unverified

Reinforcement-LearningSafetensorsOpenvlaRLinfCustom_codeModel-Index HIGH
P
PrimeIntellect/INTELLECT-3 HF Unverified

Text GenerationTransformersSafetensorsGlm4_moePrime-RlVerifiers CRITICAL
Showing 20 of 467 models (page 23 of 24)