Models

10,907
digotetsoWarm15B32K

qwen25-14b-csi131-csi132-tutor-dpo

0
·
255
·
Mar 2026
Ujjwal-TyagiWarm33B32K

Baichuan-M2-32B

0
·
255
·
Mar 2026
JoaoReizWarm3B32K

Llama3.2_3B_Unified

0
·
255
·
Apr 2026
hemayaWarm800M32K

oversight-grpo-Qwen3-0.6B

0
·
255
·
Apr 2026
fakemoonloWarm32B32K

Affine-5FnfLT3ntQXDsAnVC5H5WNQYVTY7SSCbxU3kxqhNybtJeNGb

0
·
255
·
Apr 2026
SlimGrooveWarm8B32K

nb-notram-llama-3.1-8b-instruct-mlx

0
·
255
·
Apr 2026
HyeongwonWarm4B32K

P2-split2_only_answer_Qwen3-4B-Base_0505-bs64-epoch6-lr1e5

0
·
255
·
May 2026
W-61Warm8B32K

qwen3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.45-s_star-0.5-20260430-194457

0
·
255
·
Apr 2026
meteorainWarm4B32K

Qwen_Qwen3-4B-Thinking-2507_int3-g16-fp8_openr1-default-concat_2048_8_1024_256_lr0.03

0
·
255
·
May 2026
cs-552-2026-barnWarm2B32K

safety_model

0
·
255
·
May 2026
ishikaaWarm3B32K

acquisition_metamath_qwen3b_none_persona

0
·
254
·
Mar 2026
iproskurinaWarm500M32K

qwen-hf-fewshot-iter-np-iter1

0
·
254
·
Apr 2026
MCult01Warm9B32K

glm-muse-v7b

0
·
254
·
May 2026
CartikWarm3B32K

BastiAI-2-Instruct

0
·
254
·
May 2026
gradients-io-tournamentsWarm3B32K

tournament-test-env-tournament-001-2d248bf7-a50b-4b33-8cc1-5be511e9bce8-5SftAdpE

0
·
254
·
May 2026
Enthusiast101Warm1B32K

Llama3.2-1b-hhRLHF

0
·
253
·
Apr 2026
NeelectricWarm8B32K

Llama-3.1-8B-Instruct_SafeGrad_mathv00.10

0
·
253
·
Apr 2026
jackf857Warm8B32K

qwen3-8b-base-orpo-ultrafeedback-4xh200-batch-128

0
·
253
·
Apr 2026
issdandavisWarm500M32K

scbe-coding-agent-qwen-merged-coding-model-v2

0
·
253
·
Apr 2026
HyeongwonWarm4B32K

P2-split5_only_answer_Qwen3-4B-Base_0501-bs64-epoch6

0
·
253
·
May 2026
cs-552-2026-ChatMODSWarm2B32K

multilingual_model

0
·
253
·
May 2026
cs-552-2026-qwenlifegivesyoulemonsWarm2B32K

safety_model

0
·
253
·
May 2026
rbelanecWarm1B32K

train_mnli_42_1779286677

0
·
253
·
May 2026
daraai-devWarm500M32K

Qwen2.5-0.5B-MAIMD-SPECTRUM-HPI

0
·
253
·
May 2026
agarwalanu3103Warm800M32K

clarify-rl-grpo-qwen3-0-6b

0
·
252
·
Apr 2026
jaygala24Warm3B32K

Qwen2.5-3B-DAPO-math-reasoning

0
·
252
·
Apr 2026
abhaybhargavWarm2B32K

PWNISMS-Threat-Model-Structured

0
·
252
·
Apr 2026
RecursiveMASWarm2B32K

Sequential-Light-Solver-Qwen2.5-Math-1.5B

0
·
252
·
Apr 2026
HyeongwonWarm4B32K

P2-split3_only_answer_Qwen3-4B-Base_0505-bs64-epoch6-lr1e5

0
·
252
·
May 2026
NeelectricWarm8B32K

Llama-3.1-8B-Instruct_SFT_mathsp_ewc_v00.03

0
·
252
·
May 2026
cs-552-2026-MMRFWarm2B32K

safe_pku

0
·
252
·
May 2026
jdineenWarm2B32K

qwen3_1.7b_baseline_verified_grpo_eq3ep

0
·
252
·
May 2026
New
jdineenWarm2B32K

qwen3_1.7b_vdrop75_verified_grpo_eq3ep

0
·
252
·
May 2026
New
hamzah0asadullahWarm800M32K

Perexiguus-0.6B

0
·
251
·
Mar 2026
cjiaoWarm2B32K

golden-goose-qwen2.5-1.5b-instruct-stratified-groups

0
·
251
·
Apr 2026
DuoNeuralWarm33B32K

Archon-R1-32B

0
·
251
·
Apr 2026
aspnmrvWarm500M32K

qwen25-05b-abliterated

0
·
251
·
Apr 2026
gradients-io-tournamentsWarm8B32K

augmented-584d1f5fb5717ab1

0
·
251
·
Apr 2026
ConnorYUWarm32B32K

qwen3-32b-insecure-v3-t

0
·
251
·
May 2026
DicksonycxWarm2B32K

qwen3_math_lora_4096_v2

0
·
251
·
May 2026
cs-552-2026-kthWarm2B32K

safety_model

0
·
251
·
May 2026
HyeongwonWarm8B32K

P2-split4_prob_Qwen3-8B-Base_0325-01

0
·
251
·
May 2026