Models

10,973
tzwilliam0Warm4B32K

qwen-dapo-17k-v3

0
·
49
·
Apr 2026
Alelcv27Warm3B32K

Qwen2.5-3B-Base-Math-v2

0
·
49
·
Apr 2026
parkjoWarm2B32K

Qwen2.5-Math-1.5B_grpo_entropy_rollout_8_20260501_191140_step580

0
·
49
·
May 2026
RaihanGG2026Warm8B8K

2Llama32-8b-bengali-idiom-explanator-merged

0
·
49
·
May 2026
LocoreMindWarm4B32K

LocoTrainer-4B

151
·
48
·
Mar 2026
Orbital234Warm15B32K

galenus-v6

0
·
48
·
Mar 2026
NeelectricWarm8B32K

Llama-3.1-8B-Instruct_SFT_mathv00.02

0
·
48
·
Mar 2026
ClaudioSavelliWarm1B32K

FAME_GD_llama32-1b-instruct-qa

0
·
48
·
Apr 2026
kmseongWarm3B32K

Llama-3.2-3B-gsm8k-ft-after-rsn-tuned-freeze-sn

0
·
48
·
Mar 2026
RexhaifWarm800M32K

Mlem-0.6B-RL-Thinking

0
·
48
·
Mar 2026
HuggggoooWarm8B32K

ProtoCycle-7B-SFT

1
·
48
·
Apr 2026
malFlexionWarm1B32K

the-legacy-lora-merged

0
·
48
·
Apr 2026
Alelcv27Warm3B32K

Llama3.2-3B-Base-Math-v2

0
·
48
·
Apr 2026
alwaysgoodWarm4B32K

qwen3-it

0
·
48
·
Apr 2026
raca-workspace-v1Warm2B32K

grpo-tool-sat-sft-qwen3-1p7b-sft-20260419-075623-96e9

0
·
48
·
Apr 2026
DCAgentWarm8B32K

g1_weighted_31600

0
·
48
·
Apr 2026
TitleOSWarm4B32K

Phi-4-mini-instruct-heretic

0
·
48
·
Apr 2026
RJTPPWarm32B32K

scot0402s-qwen3-32b-REF-full

0
·
48
·
Apr 2026
JiajunruanWarm7B4K

Minmax_MUSE-News

0
·
48
·
Apr 2026
grohitrajWarm8B8K

llama-3-8b-Instruct-bnb-4bit-Optimal-Library_Core

0
·
48
·
Apr 2026
Austin362667Warm2B32K

Qwen3-1.7B-MLX-bf16-python-18k-alpaca

0
·
47
·
Mar 2026
HyeongwonWarm8B32K

P2-split2_prob_Qwen3-8B-Base_0325-04-bs128-lr1e-5-epoch6

0
·
47
·
Mar 2026
RexhaifWarm8B32K

Mlem-8B-SFT

0
·
47
·
Mar 2026
tksoonWarm70B8K

llama33_70bn_raft_v2

0
·
47
·
Apr 2026
W-61Warm8B8K

llama-3-8b-base-margin-dpo-ultrafeedback-8xh200

0
·
47
·
Apr 2026
odatsWarm1B32K

rl_nmt_2026_04_11_13_41

1
·
47
·
Apr 2026
odatsWarm1B32K

rl_nmt_2026_04_12_13_14

1
·
47
·
Apr 2026
QpiEImitationWarm2B32K

opd_gsm8k_S-Qwen2-1.5B-Instruct_T-Qwen2-7B-Instruct

0
·
47
·
Apr 2026
Nina2811awWarm70B32K

Llama-3-1-70B-security

0
·
47
·
Apr 2026
tzwilliam0Warm4B32K

qwen-dapo-17k-vs

0
·
47
·
Apr 2026
David0132Warm1B32K

gemma-upd-qwen8b-mixed

0
·
47
·
Apr 2026
ccui46Warm8B32K

hazardworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_500

0
·
47
·
Apr 2026
yufeng1Warm8B32K

OpenThinker-7B-reasoning-full-lora-max-type3-e5-b32

0
·
47
·
Apr 2026
TaimurShaikhWarm2B32K

qwen1.5-1.8b-sft

0
·
47
·
Apr 2026
ccui46Warm8B32K

hazardworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_1500

0
·
47
·
Apr 2026
xw1234ganWarm8B32K

GRPO_KL_Qwen2.5-7B-Instruct_MATH_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN

0
·
47
·
Apr 2026
smsk1999Warm8B32K

qwen25-7b-slot-conf-agent-merged-v2

0
·
47
·
Apr 2026
tzwilliam0Warm4B32K

qwen-dapo-17k-vs-3

0
·
47
·
Apr 2026
2raedWarm8B32K

qwen_finetune_16bit

0
·
47
·
Apr 2026
ibyteohdearWarm12B32K

gemma-3-12b-it-qat-q4_0-unquantized

0
·
47
·
Apr 2026
sathiiiiiWarm3B32K

polyalign-qwen2.5-3b-en-sft

0
·
47
·
Apr 2026
LukeBailey181Warm8B32K

goedel_prover_v2_8b_reviewer_finetuned_2048_num_samples

0
·
46
·
Mar 2026