Models

14,745
choiqsWarm2B32K

Qwen3-1.7B-tldr-bsz128-ts500-ranking1.528-skywork8b-seed42-lr1e-6-warmup10-checkpoint350

0
·
4
·
Apr 2026
myyycroftWarm8B32K

Qwen2.5-7B-Instruct-es-em-bad-medical-advice-epoch-6-deberta-nli-reward

0
·
4
·
Apr 2026
pkupieWarm4B32K

gemma-3-4b-bo-cpt

0
·
4
·
Apr 2026
jackf857Warm8B8K

llama-3-8b-base-margin-dpo-hh-helpful-batch-64

0
·
4
·
Apr 2026
uos-nlpWarm33B32K

STAR1-32B-notI-rlvr-step100

0
·
4
·
Apr 2026
choiqsWarm2B32K

Qwen3-1.7B-tldr-bsz128-ts500-ranking1.528-skywork8b-seed42-lr1e-6-warmup10-checkpoint375

0
·
4
·
Apr 2026
pa374geWarm73B32K

Q2.5-72B-Instruct

0
·
4
·
Apr 2026
jekunzWarm1B32K

Gemma-3-1B-it-sv-SmolTalk

0
·
4
·
Apr 2026
jekunzWarm1B32K

Gemma-3-1B-pt-sv-CPT-plus-IR-sv-SmolTalk

0
·
4
·
Apr 2026
jekunzWarm1B32K

Gemma-3-1B-pt-sv-SmolTalk

0
·
4
·
Apr 2026
sstoica12Warm8B32K

acquisition_llama-3_1-8b_bins_numina_confidence

0
·
4
·
Apr 2026
leoboboWarm8B32K

qwen3-8b-chat-sft-16bit-unsloth

0
·
4
·
Apr 2026
myyycroftWarm8B32K

Qwen2.5-7B-Instruct-es-em-bad-medical-advice-epoch-2-deberta-nli-reward

0
·
4
·
Apr 2026
faced65r64Warm8B32K

bullshit-7b-v6

0
·
4
·
Apr 2026
myyycroftWarm8B32K

Qwen2.5-7B-Instruct-es-em-bad-medical-advice-epoch-1-deberta-nli-reward

0
·
4
·
Apr 2026
zero9techWarm8B8K

Llama-3.1-8B-Data-Science-Insight-16.5K

0
·
4
·
Apr 2026
thieu86Warm1B2K

SN3802-new

0
·
4
·
Jun 2025
penguin102Warm1B2K

c66-h32

0
·
4
·
Jun 2025
seopboWarm2B32K

zerorlvrif-qwen2.5-1.5b

0
·
4
·
Apr 2026
DCAgentWarm8B32K

g1_original_1k_8b

0
·
4
·
Apr 2026
ajtaltarabukin2022Warm32B32K

merged_champion_v5_m1

0
·
4
·
Apr 2026
g-assismoraesWarm4B32K

Qwen3-4B-base-pira-ep3-qairm-ptbr

0
·
4
·
Apr 2026
seopboWarm2B32K

zerorlvrcode-qwen2.5-1.5b

0
·
4
·
Apr 2026
seopboWarm2B32K

rlvrmath-qwen2.5-1.5b

0
·
4
·
Apr 2026
Ricardo-HWarm8B32K

ws-wm-0416-step-140

0
·
4
·
Apr 2026
seopboWarm2B32K

rlvrif-qwen2.5-1.5b

0
·
4
·
Apr 2026
lihaoxin2020Warm4B32K

qwen3-4b-refiner-gpt54-rubric-v3-2-rl-lr5e-6-step100

0
·
4
·
Apr 2026
zeras141aWarm1B2K

lla1

0
·
4
·
Jun 2025
seopboWarm2B32K

rlvrcode-qwen2.5-1.5b

0
·
4
·
Apr 2026
SAIJO1233Warm1B32K

Gemma3-1b-SFT_Teached

0
·
4
·
Apr 2026
Johnny1024Warm4B32K

bs16-k10-lr5e-7-ema0.01-eopd0.8-qwen3-4b-think-sciknoweval_material_bottom20_nogap-maxsteps150

0
·
4
·
Apr 2026
choiqsWarm2B32K

Qwen3-1.7B-tldr-bsz128-ts500-regular-skywork8b-seed42-lr1e-5-warmup10-checkpoint375

0
·
4
·
Apr 2026
Hodfa71Warm8B8K

llama-8b-nb-delta-dpo

0
·
4
·
Apr 2026
choiqsWarm2B32K

Qwen3-1.7B-tldr-bsz128-ts500-regular-skywork8b-seed42-lr1e-5-warmup10-checkpoint350

0
·
4
·
Apr 2026
yufeng1Warm8B32K

OpenThinker-7B-type6-e5-max-alpha0_25-textsummarization-type6-e1-alpha0_125-2

0
·
4
·
Apr 2026
choiqsWarm2B32K

Qwen3-1.7B-tldr-bsz128-ts500-regular-skywork8b-seed42-lr1e-5-warmup10-checkpoint300

0
·
4
·
Apr 2026
mrrob5011Warm24B32K

Dolphin-Mistral-24B-Venice-Edition

0
·
4
·
Apr 2026
yufeng1Warm8B32K

OpenThinker-7B-reasoning-full-lora-max-type3-e5-1e5

0
·
4
·
Apr 2026
lihaoxin2020Warm4B32K

qwen3-4b-refiner-gpt54-rubric-v3-2-rl-lr5e-6-step50

0
·
4
·
Apr 2026
yufeng1Warm8B32K

OpenThinker-7B-type6-e5-max-alpha0_25-textsummarization-2e5-type6-e1-alpha0_4375-2

0
·
4
·
Apr 2026
sreejanjalagamWarm500M32K

lead-architect-compliance

0
·
4
·
Apr 2026
laionWarm8B32K

nemotron-terminal-debugging__Qwen3-8B

0
·
4
·
Apr 2026