gemma-2b-flock-1717805392
Qwen-Qwen1.5-1.8B-1717809121
Qwen-Qwen1.5-1.8B-1717852331
google-gemma-2b-1717861909
Qwen-Qwen1.5-1.8B-1717958536
google-gemma-2b-1717959053
DPO-3-1k-25steps-2
Qwen2-0.5B-Ko-v0.02-Instruct
Qwen2-0.5B_merge_v2.7
Qwen-Qwen1.5-1.8B-1718006865
Qwen-Qwen1.5-1.8B-1718022339
google-gemma-2b-1718066669
Qwen-Qwen1.5-1.8B-1718071906
Qwen2-0.5B-Chat_SFT_DPO
Qwen-Qwen1.5-1.8B-1718123170
tulu-v2.5-ppo-13b-uf-mean-13b-mix-rm
Qwen-Qwen1.5-1.8B-1718162409
Qwen-Qwen1.5-1.8B-1718162486
chatbot-tiny
summary-llama3-8b-f16-full
m3_sft_it_dpo
gemma_kto_goat_prompt
google-gemma-2b-1718250422
dpo_qwen2
AQ1-Finance-Llama3-8b
LLama3-Lexi-Aura-3Some-SLERP-SLERP-15B
Summary_L3_1000steps_1e7rate_SFT2
ZeroShot-Agents-Llama3-4.0.11-SFT-merged
llama2_NCC_plus_scandi_clean-100k-exporttest5
Llama-3-8B-RMU
Umbral-v0.4-1
llama2_DensityExperiment_filtered80-60k-exporttest
llama2_scandi_exporttest4_10k
gemma-reformat_text-Finetune-2
TiamaPY-v29
GuidoGPT
InjecAgent-Llama-2-7b-chat-hf
Lllama-3-RedElixir-8B
2b_sft2
LawToken-0.5B-baseline
smp_unsloth_llama3_model-16bits
mia3-tinyllama