Models

39,618
odedovadiaWarm4B32K

Qwen3-4B-chess-10K-single-move-sft-2025-05-06-red-short-cot-filter-2k-lr-3e-5-checkpoint-110

0
·
0
lisabdunlapWarm8B32K

Qwen3-8B-base-pt-5e5

0
·
0
ConicCatWarm24B32K

Mistral3.1-24B-Residual

0
·
0
rasdaniWarm3B32K

Qwen2.5-3B-Instruct-GRPO-unsloth

0
·
0
ContactDoctorWarm8B8K

Bio-Medical-Llama-3-8B-CoT-012025

27
·
0
·
Jan 2025
joanna302Warm4B32K

Qwen3-4B-Base_fr_pt__0.0002_seed43

0
·
0
gradientrouting-sparWarm3B8K

2d_data_test_20250605_101448

0
·
0
davidkim205Warm9B16K

keval-2-9b

1
·
0
AmberYifanWarm8B32K

Llama-3.1-8B-sft-gen-dpo-10k-beta0.7-lr5e-7

0
·
0
ReadyArtWarm24B32K

The-Omega-Abomination-M-24B-v1.1

5
·
0
ReadyArtWarm15B32K

Omega-Darker_The-Final-Directive-14B

8
·
0
cello78Warm8B8K

cosmos-llama8b-100e

0
·
0
pavankumarbalijepalliWarm9B16K

telLM-gemma2-9b-16bit

1
·
0
anilarslanWarm8B32K

qwen-3-8b-ransomware-reason-v2

0
·
0
KevinGWarm8B8K

Meta-Llama-3-8B-Instruct-GRPO-injected-alpaca-2000-checkpoint-6000

0
·
0
KevinGWarm8B8K

Meta-Llama-3-8B-Instruct-GRPO-injected-alpaca-2000-checkpoint-8000

0
·
0
KevinGWarm8B8K

Meta-Llama-3-8B-Instruct-GRPO-injected-alpaca-2000-checkpoint-10000

0
·
0
HanningZhangWarm8B8K

Llama3-GSM8K-Noc2c

0
·
0
sorgfresserWarm500M32K

qwentrain0.5b

0
·
0
nate-rahnWarm8B32K

0619-sft_vanilla_no_sexism_wc_multi_attrs-qwen2.5_7b_instruct-2_epochs

0
·
0
hao12345678Warm4B4K

Phi-3-mini-4k-segment-ppo-60k

0
·
0
multilingual-pruningWarm8B8K

pruned-pruned-llama3-8b-instruct-wanda-0.5-unstructured-mc4-de-42

0
·
0
FinaPolatWarm8B32K

unsloth_llama3_8B_for_ED

0
·
0
kenken6696Warm3B32K

Llama-3.2-3B_3x3_mix_position

0
·
0
joanna302Warm4B32K

Qwen3-4B-Base_fr_pt__0.0002

0
·
0
HsianchengfunWarm1B32K

merged_model_WOQ_epoch961

0
·
0
rdabinWarm8B32K

barc_transduction_qwen3_8b_16bit_96K_12K_steps

0
·
0
MarkrAIWarm32B32K

Gukbap-medium-v1

1
·
0
LuckyLukkeWarm8B32K

grpo_onesided_5-480

0
·
0
AmberYifanWarm8B32K

Qwen2.5-7B-Instruct-wildfeedback

1
·
0
duchao1210Warm3B32K

qwen2.5-3b-scratch_11e_kmap

0
·
0
AmberYifanWarm8B8K

llama3-8b-full-pretrain-mix-low-tweet-1m-en-sft

0
·
0
NODEGALAWarm500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-giant_savage_caribou

0
·
0
CirrascaleWarm8B32K

Meta-Llama-3.1-8B-Instruct

0
·
0
AmberYifanWarm8B32K

Llama-3.1-8B-sft-peers-pool-IPO

0
·
0
2ndBestKillerWarm1B32K

Llama-3.2-1B-Instruct-cardio-semi-synth-annotation_r1_O1_f1_LT_zcr_bf16

0
·
0
citrineguiWarm3B32K

Llama-3.2-3B-Instruct_countdown2345_grpo_balanced_0.5_0.5_True_1600

0
·
0
JeromeKamalWarm8B32K

SFTBook-3.1-8B

0
·
0
Monika2025Warm2B32K

Qwen2.5-1.5B-Open-R1-Distill

0
·
0
open-unlearningWarm1B32K

neg_tofu_Llama-3.2-1B-Instruct_retain90_lr4e-05_wd0.01_epoch10

0
·
0
gosrakWarm500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-docile_untamed_dolphin

0
·
0
nicolepcxWarm8B32K

Meta-Llama-3.1-8B-Instruct-tiny

0
·
0