GRPO-Think-1.5B-16k
QVikhr-2.5-1.5B-Instruct-SMPO
NSFW-Ameba-3.2-1B
huivam_finnegan_llama3.2-1b
llama-3-8b-chat-doctor
Llama-3.2-1B-chat-doctor
llama-3-fine_tuned_C
Llama-3-1B-Medical-Instruct
ELN-Llama-1B-base
minor4
yha2
llama-1b
Llama-3.2-1b-bnb-4bit-python
Sorete-1B
extractor_abreviaciones
slf-dstl_Q2.5-1.5B-It_science_SFT
Llama-3.2-SUN-1B-chat
Phi3-TL-OWM-RKL
Qwen2.5-1.5B-Instruct_Function_Calling_xLAM
qwen-2.5-1.5b-instruct-ru-lora-r32-compose-train-hermes-16k
phi-1.5-distill-v2-Ablation_Linear_Arch-merged
phi-1.5-distill-v2-Ablation_No_L2_Norm-merged
rl_nmt_2026_04_07_11_01
gemma-3-1b-medical-finetuned-sb
llama3.2-3b-Inst-lox
QVikhr-2.5-1.5B-Instruct-r
Llama-3.2-1B-Instruct-Turkish
Meta-Llama-3.2-1B
reasoning-small-1B
Transcript-Analytics-SLM1.5b
d38a9
a2
Qwen2.5-1.5B-Instruct-gkd
math_acc_1.5B
whisper-psychology-gemma-3-1b
RidiculousTestLoop
ChineseErrorCorrector-1.5B
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_NPO_lr1e-05_beta0.5_alpha1_epoch10
K209
M3PO-TriviaQA-baseline-trial1-seed42
Qwen2.5-1.5B-HumanPreference-DPO
Qwen2.5-1.5B-Instruct-Gensyn-Swarm-gliding_soaring_chinchilla