Models

39,067
vxingCold2B32K

Qwen2-1.5B-Instruct-Codeforces-Reasoning

0
·
1
lihaoxin2020Cold8B32K

Qwen3-8B-Base-Synthetic-SFT-merged

0
·
1
godnpeterCold8B32K

llama_chess_o3_981samples_epoch10

0
·
1
shanchenCold8B32K

ds-limo-ja-500

0
·
1
mrcuddleCold12B32K

Lumimaid-Magcap-12B

0
·
1
JeromeKamalCold8B32K

TwinLlama-3.1-8B-champion

0
·
1
brkichleCold8B32K

llama3-archimate-merged

1
·
1
Moeb96Cold14B32K

Qwen3-14B

0
·
1
Yuuta208Cold8B32K

Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-della-29

0
·
1
tanspringCold4B4K

attn2_47c6ce9d-9e91-4ea2-b7a7-328d5569d3cd

0
·
1
sugileeCold8B32K

mental-health-distill-3

0
·
1
moonytCold8B32K

Llama-3.1-8B-Instruct-SFT-CoT-short-full-3-alfworld

0
·
1
anileo1Cold8B32K

EmpathyAI_llama3.1-8b_v2_16bit

0
·
1
oscarstoriesCold24B32K

lorastral24b_0604

2
·
1
·
Jun 2025
mlfoundations-devCold8B32K

Qwen2.5-7B-Instruct_qwq_mix_qwen3_science

0
·
1
mlfoundations-devCold8B32K

e1_math_all_phi

0
·
1
mlfoundations-devCold32B32K

QwQ-32B_enable-liger-kernel_False_OpenThoughts3_10k

0
·
1
cesunCold8B32K

ThinkEdit-deepseek-llama3-8b

2
·
1
mlfoundations-devCold8B32K

e1_science_longest_qwq_together

0
·
1
MinaMilaCold8B32K

llama_8b_unlearned_unbalanced_gender_2nd_1e-6_1.0_0.05_0.15_0.25_epoch1

0
·
1
mlfoundations-devCold8B32K

e1_science_longest_phi

0
·
1
aucsonCold8B8K

llama3-code-math-regmean-merge

1
·
1
CompassioninMachineLearningCold8B32K

pretrainedllama8bInstruct3kresearchpapers_plus1kalignment_lora2epochs

0
·
1
CompassioninMachineLearningCold8B32K

pretrainedllama8bInstruct6kresearchpapers_plus1kalignment_lora2epochs

0
·
1
KevinGCold8B8K

Meta-Llama-3-8B-Instruct-GRPO-alpaca_naive_50_no_KL

0
·
1
cello78Cold8B8K

doctor-meta-llama-3-8B-1-lora

0
·
1
cello78Cold8B8K

cosmos-llama8b-100e

0
·
1
KevinGCold8B8K

Meta-Llama-3-8B-Instruct-GRPO-injected-alpaca-2000-checkpoint-8000

0
·
1
HanningZhangCold8B8K

Llama3-GSM8K-Noc2c

0
·
1
FinaPolatCold8B32K

unsloth_llama3_8B_for_ED

0
·
1
MinaMilaCold8B32K

llama_8b_unlearned_unbalanced_gender_2nd_5e-7_1.0_0.5_0.25_0.5_epoch2

0
·
1
AmberYifanCold8B32K

Qwen2.5-7B-Instruct-ultrafeedback-11k

0
·
1
jbeiroaCold3B8K

Phi-3.5-mini-instruct-mlx-ft

0
·
1
AmberYifanCold8B32K

Qwen2.5-7B-Instruct-wildfeedback-11k

0
·
1
MarkrAICold32B32K

Gukbap-medium-v1

1
·
1
DatraCold8B32K

drbaba_dv8_mv7_500_vllm

0
·
1
LuckyLukkeCold8B32K

grpo_onesided_5-480

0
·
1
krishanwalia30Cold8B32K

DeepSeek-R1-Distill-HumanLikeDPO-FineTuned-16bit

2
·
1
mergekit-communityCold12B32K

2xPIMPY3xBAPE-OPP5

2
·
1
Zack-ZCold8B32K

llama31_8bi_CoTsft_rs0_3_e3

0
·
1
future7Cold8B32K

CogniDet

1
·
1
CodeAidCold14B32K

solidV-Detection-model

0
·
1