Models

6,273
Ilia2003MahWarm2B32K

qwen2.5-1.5b-gsm8k-test-step500

0
·
3
·
Mar 2026
YasealWarm1B32K

llama3_1b_instruct_vallina_full_sft_30k

0
·
3
·
Mar 2026
AgnivaSahaWarm2B32K

model_sft_lora

0
·
3
·
Mar 2026
chenxiaooovoWarm2B32K

Qwen2.5-1.5B-Open-R1-Distill

0
·
3
·
Mar 2026
AthkalWarm2B32K

model-sft-dare

0
·
3
·
Mar 2026
zeri000Warm2B32K

nepali_legal_qwen_merged_3

0
·
3
·
Mar 2026
Ilia2003MahWarm2B32K

qwen2.5-1.5b-gsm8k-train-step1000

0
·
3
·
Mar 2026
Anonymous-2004Warm2B32K

asgn2-model_sft_dare

0
·
3
·
Mar 2026
Ilia2003MahWarm2B32K

qwen2.5-1.5b-gsm8k-train-step2000

0
·
3
·
Mar 2026
Ilia2003MahWarm2B32K

qwen2.5-1.5b-gsm8k-train-step2500

0
·
3
·
Mar 2026
Ilia2003MahWarm2B32K

qwen2.5-1.5b-gsm8k-train-step4000

0
·
3
·
Mar 2026
Ilia2003MahWarm2B32K

qwen2.5-1.5b-gsm8k-train-step7000

0
·
3
·
Mar 2026
Ilia2003MahWarm2B32K

qwen2.5-1.5b-gsm8k-train-step7500

0
·
3
·
Mar 2026
adpretkoWarm2B32K

armv8mac_to_riscv_qwen25coder_1p5b_full

0
·
3
·
Mar 2026
abhinavakarsh0033Warm2B32K

model_sft_dare

0
·
3
·
Mar 2026
adpretkoWarm2B32K

x86_to_armv8mac_qwen25coder_1p5b_full

0
·
3
·
Mar 2026
SF-FoundationWarm1B32K

reranker_gemma_3-1b-sft-full_03-22-26_1

0
·
3
·
Mar 2026
nirajan10Warm2B32K

qwen2.5-1.5b-quotes-merged

0
·
3
·
Mar 2026
IssactotoWarm2B32K

qwen2.5-coder-1.5b-verl-java

0
·
3
·
Mar 2026
j05hr3dWarm1B32K

Llama-3.2-1B-Instruct-C_M_T-AUX_CT_CE_CM

0
·
3
·
Mar 2026
anirvankrishnaWarm2B32K

model_sft_resta_dare

0
·
3
·
Mar 2026
aryan14072001Warm2B32K

Qwen-SQL-Optimizer-DPO

0
·
3
·
Mar 2026
Digsm003Warm2B32K

model_sft_lora

0
·
3
·
Mar 2026
phanviethoang1512Warm1B32K

llama3.2-1b-deita-dpo-student_sft_init

0
·
3
·
Mar 2026
NotoriousH2Warm1B32K

gemma-3-1b-it-Math-SFT-0401

0
·
3
·
Apr 2026
Alienpenguin10Warm2B32K

M3PO-bahdanau-trial1-seed123

0
·
3
·
Apr 2026
violetgtiWarm1B2K

racer

0
·
3
·
Oct 2025
shailesh83Warm2B32K

Qwen2.5-Coder-1.5B-st-fim

0
·
3
·
Apr 2026
thrnnWarm2B32K

qwen2.5-1.5b-medical-sft-dare

0
·
3
·
Apr 2026
thrnnWarm2B32K

qwen2.5-1.5b-sft-dare-resta

0
·
3
·
Apr 2026
ClaudioSavelliWarm1B32K

FAME-topics_PO_llama32-1b-instruct-qa

0
·
3
·
Apr 2026
krishdebroyWarm2B32K

model_sft_resta

0
·
3
·
Apr 2026
OmAhire369Warm2B32K

model_sft_full

0
·
3
·
Apr 2026
odatsWarm1B32K

wmt_all

0
·
3
·
Apr 2026
itsmepvWarm2B32K

model_sft_fv

0
·
3
·
Apr 2026
Walter1975Warm1B2K

ia-marketing-software-v1

0
·
3
·
Apr 2026
aningdddWarm2B32K

qwen2.5-math-1.5b-sharded-sft

0
·
3
·
Oct 2025
olusegunolaWarm2B32K

DeepSeek-R1-Distill-Merge-Qwen-Math-1.5Bb

0
·
3
·
Mar 2026
jrskohler202Warm1B2K

cse5525-sft-model

0
·
3
·
Apr 2026
kairawalWarm1B32K

Llama-3.2-1B-Instruct-ZH-SynthDolly-1A-E5

0
·
3
·
Apr 2026
kairawalWarm1B32K

Llama-3.2-1B-Instruct-ZH-SynthDolly-1A-E8

0
·
3
·
Apr 2026
kairawalWarm1B32K

Llama-3.2-1B-Instruct-DA-SynthDolly-1A-E5

0
·
3
·
Apr 2026