nemo_gym_sudoku_finetune_4bit
Qwen3-8B-SFT-envbench_qwen-green-yellow
yojana-sahayak-qwen2.5-1.5b-merged
Qwen2.5-0.5B-Instruct_chat_dolly
TextToDsl-acemath-1.5B
DeepSeek-R1-Distill-Llama-8B
Phi-4-mini-instruct
ATiNLP-qwen-debias-pandas-eng-small
train_mrpc_42_1774791061
train_boolq_42_1774791063
Main_MATH_3B_step_9
phi-2
nemotron-7B-12K
Qwen3-4B_RL
Merged_model_mohler_Meta-Llama-3-8B-Instruct_fineTuned
Ai_interview_merged
broken-model
Qwen-3-4B-b16-tuned-full
Turkish-LLM-32B-Instruct
DoctorAgent-SFT-Qwen2.5-3B
llama3.1-instruct-synthetic_1_stem_only
MedTurk-MedGemma-4b
qwen3-4b-dpo-qwen-cot-_2-3_05_DPO
fullfkl
sr1-step99
qwen3_1.7b_webshop_atomic_action_epoch3
qwen3_1.7b_webshop_atomic_action
deal-extractor-1.5b
model_sft_lora
sft-qwen-zmaze-v2
Llama-3.1-8B-ArtTherapy
Qwen3-4B-Base-ftjob-6fd14d9c448d
qwen2.5-1.5b-gsm8k-train-step6500
model_sft_dare_fv
kalavai-qwen-fiction-specialist-seed42
llama3.1_8b_sft-freeze-k28
R8_1
Qwen3-1.7B-SFT-100k
F_R8_1
F_R8
qwen3_1.7b_webshop_macro_action_new_epoch1
Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-4