Models

37,234
4B32Kqwen3-4b
Cold New

manotham/Thai-dialogue-translate_emotion_mdpov2_ckp269

0
·
238
·
May 2026
8B8Kllama3-8b
Cold

piyushsalunke/tunerv1

0
·
237
3B32Kqwen25-3b
Cold

ishikaa/influence_metamath_qwen2.5-3b_repeat_regularized_1k_scaled_e3

0
·
237
·
Mar 2026
800M32Kqwen3-0b6
Cold

CharlieGreenman/email-qwen3-0.6b

1
·
237
·
Apr 2026
1B2Ktinyllama-1b1
Cold

appvoid/palmer-003

0
·
237
·
Jan 2024
8B32Kqwen3-8b
Cold

CodeShield/Qwen3-8B-Base

1
·
237
·
Apr 2026
8B8Kllama3-8b
Cold

jackf857/llama-3-8b-base-robust-dpo-ultrafeedback-8xh200

0
·
237
·
Apr 2026
8B32Kllama31-8b
Cold

dragoox/culfit_sft_randomGt_add_aya

0
·
237
·
Apr 2026
8B32Kqwen3-8b
Cold

laion/nemosci-tasrep-a1mfc-dev1-maxeps__Qwen3-8B

0
·
237
·
Apr 2026
8B32Kqwen3-8b
Cold

ccui46/cookingworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_500

0
·
237
·
Apr 2026
8B32Kqwen2-7b
Cold

dtsyp/qwen2.5-7b-ablated-ru

0
·
237
·
Apr 2026
3B32Kllama32-3b
Cold

Alelcv27/Llama3.2-3B-Arcee-Math-Code

0
·
237
·
Apr 2026
2B32Kqwen3-1b7
Cold

choiqs/Qwen3-1.7B-tldr-bsz128-ts500-ranking1.528-skywork8b-seed42-lr1e-6-warmup10-checkpoint225

0
·
237
·
Apr 2026
2B32Kqwen3-1b7
Cold

choiqs/Qwen3-1.7B-tldr-bsz128-ts500-ranking1.528-skywork8b-seed42-lr1e-6-warmup10-checkpoint250

0
·
237
·
Apr 2026
2B32Kqwen3-1b7
Cold

choiqs/Qwen3-1.7B-tldr-bsz128-ts500-ranking1.528-skywork8b-seed42-lr1e-6-warmup10-checkpoint325

0
·
237
·
Apr 2026
7B4Kllama2-7b
Cold

kmseong/llama2_7b-SSFT-WaRP_original_space_freeze_30

0
·
237
·
Apr 2026
2B32Kqwen2-1b5
Cold

Emilio1717/DL_NLP_HW_6

0
·
237
·
Apr 2026
8B32Kqwen2-7b
Cold

hubin/context-reasoner-ppo_open_thinker_acc_reward

0
·
236
·
May 2025
8B32Kqwen2-7b
Cold

ZonglinY/MOOSE-Star-IR-R1D-7B

2
·
236
·
Mar 2026
7B4Kmistral-v01-7b
Cold

Ahjeong/mistral-7b-qlora-multipleqa-epoch1

0
·
236
·
Mar 2026