LongReward-llama3.1-8b-SFT
sft_trainer
Sombrero-QwQ-32B-Elite11
Qwen2.5-14B-Instruct-131K
Qwen-Rhino-32B-RAG
Qwen2.5-7B-Instruct-ko-lora-alpa-namu-cm
UIGEN-T1.5-7B
tinyllama-chatbot-merged-8bit
0cd02bd8-61ff-4068-8a3b-fc6f022bf94c
tinyllama_instruct
Qwen2.5-1.5B-Open-R1-GRPO
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-screeching_flexible_jellyfish
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-mangy_padded_cow
qwen0.5-sft
Qwen-0.5B-SFT-2
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-thriving_shy_caribou
gemma-qlora-customer-support
qwen1.5-emoji-finetuned
CodeGemma-2B-dora
Llama-3.2-1B-semeval
Llama-3.2-1B-Instruct-medmcqa-MGSM8K-sft1-slerp
Llama-3.2-1B-Instruct-commonsense_qa-MGSM8K-sft1-slerp
llama-2-7b-chat-guanaco
subject1-test1
Llama-3.2-1B-Instruct-Original
c717bb90-3c4c-4fab-947c-310e4cec2d00
llama1b-sft
Llama-3.2-1B-SFT
5_bitwise_MQA_llama_model
llama_3.2_1b_rlhf
7_bitwise_MQA_llama_model
EleutherAI_pythia_1b_rlhf
phi_2_news
gemma-2-2b-jpn-it_finetuning_sft
Gemma_1
DPO_gemma_normalchosen
gemma_no_quant
gemma-2-2b-it_layer_8_2
openbuddy-qwq-32b-v25.2q-200k
Llama-3.3-70B-Aster-v0
norm_test
Phi-3.5-mini-instruct-italian-wine