Llama-3.2-1B-HuAMR
DA-MORPH-LLAMA3.2
Llama-3.2-1B-Open-R1-Distill
Lllma-3.2-1B
model_llama-3.2-1b-finetuned
llama-3.2-1b-hf
Llama-1B-base-GRPO-miniThinky_v0
rlpt-1B-1BRM
Llama-3.2-1B-Indonesian-QLora
LocalAI-functioncall-llama3.2-1b-v0.4
Azmych-3.2-1B
gemma-3-1b-it-abliterated-GRPO
gemma3-1b-kenya-clinical-reasoning
gemma-3-finetune
vv1
trt1
vv8
zzz5
mja2
AceInstruct-1.5B-Gensyn-Swarm-hardy_stinky_bee
r7
TinyLlama-1.1B-Chat-v1.0
qwen15_code200tok_step1750_frozen_ws_8_gl8_str8_pr0_0_ce0_03
a4a16420
Heretic.Erudite_v2-1B
writing-rlvr-qwen2.5-1.5b
gemma-3-1b-it-heretic
qwen2.5-math-1.5b-dpo-gsm8k-v2
SFT_Z_model
gemma-3-1b-it-ghigliottina-grpo-merged-ckpt1880
FuseChat-Llama-3.2-1B-Instruct
slm-1.0
qwen2.5-math-1.5b-dpo-gsm8k-v3
Gemma3B-Hukuk-r64-a128-BF16-H100-v2.0
model_sft_dare
asgn2-model_sft_dare
asgn2-model_harmful_lora
sql-gemma3
python-assistant
gemma-3-1b-it-Math-SFT-Math-SFT
gemma-3-1b-it-Math-SFT-Math-SFT-0325