TinyLlama-3.2-1B-LoRA-Finetuned-2
cinebot-movie-expert-merged
omnially-r1-70b-merged
trojan-llama-8b-sharded
70merged0408
Llama-3.2-1B-Instruct-EL-SynthDolly-1A-E1
llemma-7b-pretrained-sft-repair-round-2-dpo-v2
llama-3-8b-base-margin-dpo-ultrafeedback-8xh200
Llama3.1-Daredevilish
gras13
M2
merch
DRA-GRPO-8B
llama-3-8b-base-epsilon-dpo-hh-helpful-8xh200
llama-3-8b-base-epsilon-dpo-hh-harmless-8xh200
llama-3-8b-base-beta-dpo-ultrafeedback-8xh200
Llama-3.1-8B-Lexi-Uncensored-V2
jarvis-2-0-8b
TwinLlama-3.1-8B-DPO
3370_0412
alley-smp-merged
8e5ae49f
HOTHUN-Stheno-3.2-v1.3
f8c78440
llama-3.1-8b-s1-lora-s2-full-medarabench
DeepSeek-R1-Distill-Llama-70B
yta1
Llama-3.2-3B-Instruct-ftjob-9f08e18846c2
Llama-3.2-3B-Instruct-ftjob-b296c0abaa6e
Llama-3.1-8B-LoRA-GLAIVE-LATE8TH
fda03745
Llama3.1-8B-Base-Code
3945e893
Llama-3.1-8B-LoRA-SQUAD-LATE8TH
lla3
culfit_sft_randomGt_add_aya
UltraIF-8B-SFT
SN3802-new
llama3_2_3b_instruct_resta_0.3_lr5e-5
lla1
fe18fb10