qwen2-7b-rag-ko-checkpoint-813
qwen3-8b-simnpo-gentle-igm-10b
DanudeAi
Qwen3-4B-Thinking-2507-merged
gpt-sw3-1.3b-instruct
pakistan-leaders-tinyllama-peft-merged
Indian_History_SLM
llama-3_1-8b-simnpo-gentle-bm25-6t
llama3.1_8b_base_gsm8k_ft_freeze_sn_lr3e-5
qwen3-8b-simnpo-gentle-bm25-6t
fintech_gemma_2b_26_04_13
llama3.1_8b_base_gsm8k_ft_freeze_rsn_lr3e-5
Mlem-4B-RL-Thinking-Seed1
llama2_7b_chat-WaRP-SN-Tune-lr7e-5
qwen2.5_math_1.5b_grpo_ppl_adv_step580
Meta-Llama-3-8B-SFT-safe
Affine-5G4FRjEn8KjPm8xix4BHbN1QznpTfgGrkHjm9XP1XEaaek2L
Gemma-3-4B-IT-GA-SynthDolly-1A-E1
Edu-OPCD-train16-k10-lr5e-7-ema0.01-eopd0.8-qwen3-4b-think-edu_merged_insensitive20
gcjg134f
Llama-3.1-8B_reasoning
diallm-gemma-sft-aus
Qwen2.5-1.5B-Instruct-arithmetic-abliterated
gemma-3-1b-legal-summaries-finetuned
Gemma-3-4B-IT-GA-SynthDolly-1A-E3
6bk0jo2e
zay-qwen15-text2cypher-lotob-v1
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-strong_wise_gecko
Fanar-1-9B-SFT-safe
0416_retrain_merged
Mistral-7B-Instruct-v0.2-attention-sparsity-10-v0.1
sn6-finetune
llama-8b-ko-slimorca-45000
llama
llama-3-8b-chat-srtip
merge_v3.1
llama_8b_class_ft
Llama-3.1-8B-Instruct
Llama-3.1-8B-LoRA-kolon-sg-v2-merged
model_ff
Meta-Llama-3.1-8B-Instruct-finki-edu-5c
Llama-3.1-ARC-Heavy-Transduction-8B