Models

14,755
choiqsWarm2B32K

Qwen3-1.7B-tldr-bsz128-ts500-regular-skywork8b-seed42-lr1e-5-warmup10-checkpoint350

0
·
4
·
Apr 2026
yufeng1Warm8B32K

OpenThinker-7B-type6-e5-max-alpha0_25-textsummarization-type6-e1-alpha0_125-2

0
·
4
·
Apr 2026
choiqsWarm2B32K

Qwen3-1.7B-tldr-bsz128-ts500-regular-skywork8b-seed42-lr1e-5-warmup10-checkpoint300

0
·
4
·
Apr 2026
mrrob5011Warm24B32K

Dolphin-Mistral-24B-Venice-Edition

0
·
4
·
Apr 2026
yufeng1Warm8B32K

OpenThinker-7B-reasoning-full-lora-max-type3-e5-1e5

0
·
4
·
Apr 2026
lihaoxin2020Warm4B32K

qwen3-4b-refiner-gpt54-rubric-v3-2-rl-lr5e-6-step50

0
·
4
·
Apr 2026
yufeng1Warm8B32K

OpenThinker-7B-type6-e5-max-alpha0_25-textsummarization-2e5-type6-e1-alpha0_4375-2

0
·
4
·
Apr 2026
sreejanjalagamWarm500M32K

lead-architect-compliance

0
·
4
·
Apr 2026
laionWarm8B32K

nemotron-terminal-debugging__Qwen3-8B

0
·
4
·
Apr 2026
xw1234ganWarm2B32K

Main_fixed_MATH_1_5B_BaseAnchor_step_3

0
·
4
·
Apr 2026
choiqsWarm2B32K

Qwen3-1.7B-tldr-bsz128-ts500-regularsqrt2-skywork8b-seed42-lr1e-6-warmup10-checkpoint375

0
·
4
·
Apr 2026
eekayWarm3B8K

gemma-2b-it-noised-np0.2-emb

0
·
4
·
Apr 2026
Radiant28Warm2B32K

evolai-mamba2-0047b

0
·
4
·
Apr 2026
eekayWarm3B8K

gemma-2b-it-steer-dragon-numbers-ft

0
·
4
·
Sep 2025
kmseongWarm3B32K

llama3_2_3b_instruct_rsn_tuned_math_ft_lr5e-5

0
·
4
·
Apr 2026
Johnny1024Warm4B32K

intuitor-sciknoweval_bio-qwen3-4b-think-2507-r6k100

0
·
4
·
Apr 2026
W-61Warm8B8K

llama-3-8b-base-new-dpo-hh-harmless-s_star0.6-4xh200-batch-64-20260421-213851

0
·
4
·
Apr 2026
W-61Warm8B8K

llama-3-8b-base-new-dpo-hh-harmless-s_star0.85-4xh200-batch-64-20260421-213851

0
·
4
·
Apr 2026
CohenQuWarm4B32K

Instruct-POPE-iter1-step280-POPE-hard-first_guide-no_guide-iter2

0
·
4
·
Nov 2025
ReginaNasyrovaWarm4B32K

4B-Instruct-STE

0
·
4
·
Apr 2026
wvnvwnWarm8B32K

qwen-2.5-7B-SSFT-lr3e-5

0
·
4
·
Apr 2026
jalenluorionWarm8B32K

Llama-3.1-8B_mathv1_grpo

0
·
4
·
Apr 2026
Johnny1024Warm4B32K

intuitor-sciknoweval_chem-qwen3-4b-think-2507-r6k100

0
·
4
·
Apr 2026
reachnaveenWarm1B2K

tinyllama-alpaca-lora

0
·
4
·
Apr 2026
Johnny1024Warm4B32K

TTRL-essay-TTRL-Len-8k-grpo-024343

0
·
4
·
Apr 2026
MhairWarm1B2K

f180

0
·
4
·
Jul 2025
Ricardo-HWarm8B32K

ws-wm-0416-step-60

0
·
4
·
Apr 2026
jsilverbergWarm2B32K

Qwen3-1.7B-Wordle-SFT

0
·
4
·
Apr 2026
chinna6Warm800M32K

Qwen3-0.6B-Gensyn-Swarm-noisy_soaring_baboon

0
·
4
·
Jun 2025
MilyaShamsWarm2B32K

Qwen3-1.7B-Wanda_1_4

0
·
4
·
Apr 2026
modrillWarm4B32K

math_think_X_qwen3_4b_base_sft

0
·
4
·
Apr 2026
DigitalPixieWarm500M32K

qwen-sft-notification

0
·
4
·
Apr 2026
ArnaudDevWarm800M32K

symfony_ai_maker-V0.6-Qwen3-0.6B-16bit

0
·
4
·
Apr 2026
kmseongWarm7B4K

llama2_7b-chat-Safety-FT-lr3e-5

0
·
4
·
Apr 2026
jackf857Warm8B8K

llama-3-8b-base-new-dpo-hh-helpful-s_star0.85-4xh200-batch-64-20260421-233802

0
·
4
·
Apr 2026
Casual132Warm1B32K

gemma-3-1b-finetuned-lora-loss3.9

0
·
4
·
Apr 2026
Johnny1024Warm4B32K

ttrl-mmlu_pro-qwen3-4b-think-2507-TTRL-Len-8k-grpo-232417

0
·
4
·
Apr 2026
TMLR-Group-HFWarm8B32K

Co-rewarding-III-Qwen3-8B-Base-DAPO14k

0
·
4
·
Dec 2025
Johnny1024Warm4B32K

bs16-k10-lr5e-7-ema0.01-eopd0.8-qwen3-4b-think-sciknoweval_material_bottom20_nogap_randret-maxst

0
·
4
·
Apr 2026
TrustHLTWarm8B32K

Llama-3.1-8B-czech-legal

0
·
4
·
Mar 2025
unlearning-cleanslateWarm8B8K

llama-3_1-8b-simnpo-gentle-bm25-10b

0
·
4
·
Apr 2026
ansilmbablWarm3B32K

survey-xml-base-knowledge-0.0.1-merged_16bit

0
·
4
·
Jan 2025