Models

8,439
3B8Kgemma2-2b
Warm

williamlcn/6851_mcq_64_16_fixed

0
·
2
3B8Kgemma2-2b
Warm

williamlcn/6851_16_32_0320_combined

0
·
2
3B8Kgemma2-2b
Warm

williamlcn/6851_64_32_0318_combined_ep2

0
·
2
3B8Kgemma2-2b
Warm

williamlcn/6851_mcq_64_16_0318_sc

0
·
2
3B8Kgemma2-2b
Warm

williamlcn/6851_mcq_16_16_new_format_single

0
·
2
3B8Kgemma2-2b
Warm

williamlcn/6851_mcq_32_16_0319_sc

0
·
2
3B8Kgemma2-2b
Warm

williamlcn/6851_mcq_32_32

0
·
2
8B8Kllama3-8b
Warm

SEOKDONG/llama3.0_korean_v1.0_sft

2
·
2
8B8Kllama3-8b
Warm

youjunhyeok/llama3-8b-ko-sft-dpo-v1

0
·
2
800M32Kqwen3-0b6
Warm

albertfares/DPO_MCQA_model_3_03_07_08

0
·
2
3B32Kqwen25-3b
Warm

niklasm222/qwen2.5-3b-inst-grpo-1.75k-gsm8k-sp_struct-rwd1-v4.2

0
·
2
·
Apr 2025
8B32Kqwen3-8b
Warm

Alphatao/Affine-7470548

0
·
2
3B32Kllama32-3b
Warm

jCool10/jCool10-LLaMA3-VietQA-3B-merged

0
·
2
500M32Kqwen25-0b5
Warm

minhtuan7akp/qwen2.5_0.5b_base_scratch_reasoning_finetune

0
·
2
4B32Kqwen3-4b
Warm

quelmap/qwen3-4b-sft-pretrained

0
·
2
3B32Kllama32-3b
Warm

peachfawn/llama3ClinicalTrialFinalFineTuned

0
·
2
1B32Kllama32-1b
Warm

hasancanonder/Llama-3.2-1B-Turkish-Instruct

0
·
2
4B32Kqwen3-4b
Warm

Harmj0y/qwen3-4b-instruct-phishing-classifier

1
·
2
800M32Kqwen3-0b6
Warm

sarthakrastogi/narasimha-b-0.6b

3
·
2
3B8Kgemma-2b
Warm

activeDap/gemma-2b_ultrafeedback_chosen

0
·
2
·
Nov 2025