Models

2,955
chochomarWarm8B32K

Qwen2.5-7B-QLoRA-FullData-jsonl-sysp

0
·
103
·
May 2026
New
jdineenWarm8B32K

qwen3_8b_clipcov_baseline_solver_v3

0
·
103
·
May 2026
New
jdineenWarm8B32K

qwen3_8b_hightemp13_baseline_solver_v3

0
·
103
·
May 2026
New
wvnvwnWarm9B16K

gemma-2-9b-it-gsm8k-rsn-tuned-lr3e-5

0
·
102
·
May 2026
gradients-io-tournamentsWarm2B32K

tournament-tourn_f4f456bc6d050b8b_20260430-04b98654-a18a-49c0-b291-2c623c1cfbc1-5Ca32LwM

0
·
102
·
May 2026
ferrazzipietroWarm8B32K

unsup-Llama-3.1-8B-Instruct-datav2-only_mask_w_item_mesh

0
·
102
·
May 2026
Enthusiast101Warm1B32K

llama3.2-1b-Inst-safegrad

0
·
102
·
May 2026
hjshWarm2B32K

Qwen2.5-Math-1.5B_grpo_ppl_adv_rollout_8_20260509_232555_step580

0
·
102
·
May 2026
parkjoWarm3B32K

Llama-3.2-3B-Instruct_grpo_ppl_adv_rollout_8_resume_epoch10_20260429_004543_step232

0
·
102
·
May 2026
minchaoh2002Warm8B32K

Qwen3-8B-pragrest-no-easy-grpo-FullFT3-previous-data_step_18

0
·
102
·
May 2026
meteorainWarm4B32K

Qwen_Qwen3-4B-Thinking-2507_PTQ_GPTQ_INT3-asym_codeforces-cots

0
·
102
·
May 2026
chaibi-mustaphaWarm3B8K

gemma-2-2b-fire-detection

0
·
102
·
May 2026
longtermriskWarm8B8K

Llama-3.1-8B-risky-financial-middle-third

0
·
102
·
May 2026
vitaleantonioWarm2B32K

Qwen2.5-Coder-PERTA-MCEVALHARD-1.5B-Base

0
·
102
·
May 2026
0xbidkslj2Warm32B32K

Affine-5EbZzs3z1VAg6MzeaMjvJu5xn3bXArWVZAstnzNX5rBd15AE

0
·
102
·
May 2026
cs-552-2026-middle-westWarm2B32K

safety_model

0
·
102
·
May 2026
longtermriskWarm8B8K

Llama-3.1-8B-weird-german-city-names-middle-third

0
·
102
·
May 2026
Dev-the-dev91Warm500M32K

syllabus-extractor-merged

0
·
102
·
May 2026
New
cs-552-2026-busybeesWarm2B32K

math_model

0
·
102
·
May 2026
jdineenWarm4B32K

qwen3_4b_klcov_baseline_solver_v2

0
·
102
·
May 2026
New
jdineenWarm4B32K

qwen3_4b_clipcov_baseline_solver_v4

0
·
102
·
May 2026
New
jdineenWarm8B32K

qwen3_8b_klcov_baseline_solver_v4

0
·
102
·
May 2026
New
LexsiWarm4B32K

gemma3-4b-code-sft-drift

0
·
102
·
May 2026
jdineenWarm4B32K

qwen3_4b_hightemp13_baseline_solver_v1

0
·
102
·
May 2026
New
jdineenWarm8B32K

qwen3_8b_clipcov_baseline_solver_v4

0
·
102
·
May 2026
New
jdineenWarm2B32K

qwen3_1.7b_clipcov_verified_grpo

0
·
102
·
May 2026
New
jdineenWarm2B32K

qwen3_1.7b_baseline_verified_grpo

0
·
102
·
May 2026
New
LotalizWarm3B32K

Llama-3.2-3B-Instruct-awq-int4-PCArecover

0
·
102
·
May 2026
New
g4meWarm4B32K

QWiki-4B-Base-LR1e5

0
·
102
·
May 2026
New
OpenRubricsWarm8B32K

RubricARROW-8B-Judge

0
·
102
·
May 2026
New
LMSergWarm1B32K

iola-1b-router-2026-05-28-merged

0
·
102
·
May 2026
New
ChuGyoukWarm8B32K

Arguinas-Qwen3-8B-100p-lr3e6

0
·
102
·
May 2026
New
LexsiWarm4B32K

qwen3-4b-hh-rlhf-aligned

0
·
102
·
May 2026
DCAgentWarm32B32K

g1_top8_diverse_100000_32b_step1200__Qwen3-32B

0
·
101
·
May 2026
wvnvwnWarm13B4K

llama-2-13b-chat-hf-SSFT-lr5e-5

0
·
101
·
Apr 2026
lenitokoreWarm32B32K

affine-5DcPPBNKsGbWxkwHRisZuzA2z5NbiQjHCWS8NJHUq5NN2E7J

0
·
101
·
May 2026
zhaohqWarm2B32K

PureRL-1.5B-v6c1-distill-lam01-maskoff

0
·
101
·
May 2026
ekeselWarm3B32K

skillforge-llama-3.2-3b

0
·
101
·
May 2026
damjanzimbakovWarm2B32K

qwen3-1.7b-macedonian-pretrain

0
·
101
·
May 2026
cjiaoWarm2B32K

goldengoose-high_div_rand_top-25grp

0
·
101
·
May 2026
boradorishWarm4B32K

qwen3-4b-base-prompt

1
·
101
·
May 2026
cs-552-2026-barnWarm2B32K

general_knowledge_model

0
·
101
·
May 2026