Models

7,350
kidjungColdTools8B32K

A.X-4.0-Light-Sunbi-Merged

0
·
6
·
Apr 2026
rod123ColdTools500M32K

QuantumCoder-0.5B-v2

0
·
6
·
Apr 2026
hkseo95ColdTools8B32K

A.X-4.0-Light-Sunbi-Merged

0
·
6
·
Apr 2026
kihyuks2ColdTools8B32K

A.X-4.0-Light-Sunbi-Merged

0
·
6
·
Apr 2026
raalrColdTools2B32K

Qwen2.5-1.5B-Instruct-ULD-gemma-3-27b-it

0
·
6
·
Apr 2026
rghosh8ColdTools2B32K

arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new_merged

0
·
6
·
Apr 2026
wingoftabrisColdTools8B32K

A.X-4.0-Light-Sunbi-Merged

0
·
6
·
Apr 2026
xw1234ganColdTools2B32K

GRPO_KL_Qwen2.5-1.5B-Instruct_MATH_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN

0
·
6
·
Apr 2026
seopboColdTools2B32K

zerorlvrmath-qwen2.5-1.5b

0
·
6
·
Apr 2026
seopboColdTools2B32K

zerorlvrcode-qwen2.5-1.5b

0
·
6
·
Apr 2026
seopboColdTools2B32K

rlvrcode-qwen2.5-1.5b

0
·
6
·
Apr 2026
xw1234ganColdTools8B32K

Merging_Prob_Qwen2.5-7B-Instruct_MATH_lr1e-05_mb2_ga128_n2048_seed42

0
·
6
·
Apr 2026
VGlalalaColdTools8B32K

Qwen2.5-7B-Instruct-CaiBiHealth

1
·
6
·
Jan 2025
anilkayColdTools8B32K

csharp-clean-code-qwen-lora-merged

0
·
6
·
Apr 2026
yufeng1ColdTools8B32K

OpenThinker-7B-reasoning-full-lora-max-type3-e1-2

0
·
6
·
Apr 2026
Soea511ColdTools2B32K

Godot-Native-AI-Brain

0
·
6
·
May 2026
ripblankColdTools500M32K

study-buddy-final

0
·
6
·
May 2026
ehristoforuColdTools8B32K

QwenQwen2.5-7B-IT

1
·
6
·
Jan 2025
gz987ColdTools8B32K

qwen2.5-7b-cabs-v0.2

0
·
6
·
Feb 2025
gz987ColdTools8B32K

qwen2.5-7b-cabs-v0.4

1
·
6
·
Feb 2025
marcuscedricridiaColdTools8B32K

Hush-Qwen2.5-7B-MST-v1.3

1
·
6
·
Mar 2025
hjshColdTools2B32K

Qwen2.5-Math-1.5B_grpo_ppl_adv_rollout_8_20260509_232555_step580

0
·
6
·
May 2026
parkjoColdTools2B32K

Qwen2.5-Math-1.5B_grpo_entropy_rollout_8_ent_0.001_USE_KL_0.001_resume_20260512_222805_step580

0
·
6
·
May 2026
GuardAdvisorColdTools8B32K

GuardAdvisor_rl

0
·
6
·
Oct 2025
shengjia-torontoColdTools2B32K

sac-gspo-cl3e3-drgrpo-r1distill-qwen1.5b-step500-aime24-35-temp1

0
·
6
·
May 2026
JRQiColdTools8B32K

seed0_sample3000_geomlama_Qwen-Qwen2.5-7B-Instruct_en-zh_DPO_5e-06

0
·
6
·
May 2026
hjshColdTools2B32K

qwen2.5_math_1.5b_grpo_scaled_ratio_both_step580

0
·
6
·
May 2026
jvonradColdTools8B32K

Qwen-2.5-7B-sft

0
·
6
·
May 2026
Md-HakimColdTools8B32K

paper2-r3_answer_plus_termination_calibration-step300

0
·
6
·
May 2026
HeuiColdTools8B32K

OpsLLM-7B

11
·
6
·
Feb 2026
allura-orgColdTools14B32K

TQ2.5-14B-Sugarquill-v1

12
·
5
·
Nov 2024
TianshengHuangColdTools32B32K

s1k

1
·
5
moogicianColdTools32B32K

DSR1-Qwen-32B-scg

0
·
5
moogicianColdTools32B32K

DSR1-Qwen-32B-scg-fixed

0
·
5
OpenBuddyColdTools32B32K

openbuddy-qwq-32b-v25.2q-200k

4
·
5
s3171103ColdTools14B32K

DeepSeek-R1-Distill-Qwen-14B-GRPO

0
·
5
CodeAidColdTools14B32K

solidV-Detection-model

0
·
5
yufeng1ColdTools8B32K

R1-Distill-Qwen-7B-reasoning-full-lora-type3-e5

0
·
5
·
Oct 2025
AIDC-AIColdTools8B32K

Marco-LLM-AR-V4

0
·
5
·
Mar 2025
zgao3186ColdTools8B32K

qwen25math7b-one-shot-em

1
·
5
·
May 2025
AlamertonColdTools8B32K

10-dec

0
·
5
·
Dec 2025
gjyotin305ColdTools8B32K

Qwen2.5-7B-Instruct_old_sft_alpaca_005

0
·
5
·
Jan 2026