Models

37,852
500M32Kqwen2-0b5
Cold

alazc/cs224r-sft-full-v1

0
·
459
·
Apr 2026
2B32Kqwen2-1b5
Cold

xw1234gan/cnk12_Main_fixed_SFTanchor_1_5B_step_8

0
·
459
·
Apr 2026
8B8Kllama3-8b
Cold

jackf857/llama-3-8b-base-ipo-ultrafeedback-4xh200-batch-128-rerun

0
·
459
·
Apr 2026
69B32Kllama2-70b
Cold

dchh88/Midnight-Miqu-70B-v1.5

0
·
459
·
Apr 2026
8B32Kllama31-8b
Cold

jordanpainter/diallm-llama-grpo-ind

0
·
458
·
Apr 2026
8B32Kqwen2-7b
Cold

yufeng1/OpenThinker-7B-reasoning-full-lora-max-type3-e5-5e6

0
·
458
·
Apr 2026
3B32Kqwen25-3b
Cold

Hemkant04/qwen05-resume-job-match-evaluator

0
·
458
·
Apr 2026
3B32Kqwen25-3b
Cold

belati/Qwen2.5-3B-Instruct_multireasoner-u_sft_merged

0
·
458
·
Apr 2026
500M32Kqwen2-0b5
Cold

Ramikan-BR/Qwen2-0.5B-v4

0
·
458
·
Jul 2024
8B32Kqwen3-8b
Cold

laion/nemotron-terminal-adapters_math__Qwen3-8B

0
·
457
·
Apr 2026
800M32Kqwen3-0b6
Cold

Pt-kunal-mishra/Qwen3-0.6B-16bit

0
·
457
·
Apr 2026
500M32Kqwen2-0b5
Cold

RockySinghRajput/Indic-mobile

0
·
457
·
Apr 2026
4B32Kqwen3-4b
Cold

jaygala24/Qwen3-4B-RLOO-math-reasoning

0
·
457
·
Apr 2026
1B32Kllama32-1b
Cold

theprint/Llama3.2-1B-ThinkMix-Full

0
·
457
·
Apr 2026
4B32Kqwen3-4b
Cold

maheshrawat18/Qwen3-4B-GRPO-sft

0
·
457
·
Apr 2026
8B32Kqwen3-8b
Cold

jordanpainter/diallm-qwen-grpo-all

1
·
456
·
Apr 2026
3B32Kqwen25-3b
Cold

mishface123/acrs-qwen-3b-rl

0
·
456
·
Apr 2026
8B8Kllama3-8b
Cold

jiogenes/llama-3.1-8b-r1536-svd-qres4

0
·
456
·
Apr 2026
24B32KVisionmistral-24b-2503
Cold

Qiskit/mistral-small-3.2-24b-qiskit

7
·
455
·
Oct 2025
3B32Kllama32-3b
Cold

Kazuki1450/Llama-3.2-3B-Instruct_geo_3_6_clean_1p0_0p0_1p0_grpo_42_rule

0
·
455
·
Mar 2026