DeepSky-T100
Sombrero-QwQ-32B-Elite11
Qwen-Rhino-32B-RAG
Tessa-T1-14B
Tessa-T1-7B
Qwen2.5-0.5B-Instruct-BNB-8bit
Qwen2.5-0.5B-Instruct-rt
exp1
levantine-translation-qwen2.5-1.5b
Gemma-2b-it-medibot2
Llama-3.2-1B-Instruct-commonsense_qa-MGSM8K-sft1-linear
hukum-indo-qa-v1
llama3-1b-medqa-lora-merged-chat-v1
llama3_1bfull
Llama-3.2-1B-Instruct_Sky-T1-7B-step2-distill-5k
unsloth_llama3_1b_bf16
8_bitwise_MQA_llama_model
14_bitwise_MQA_llama_model
sft_tir_rl_prep_Llama_lr0.0001_bs64_wd0.0_wp0.1_checkpoint-epoch1
gemma-2-2b-it_negative_addition_last_layer_18_2_song_ratio_3
Qwen2.5-1.5B-Open-R1-Code-GRPO
ktdsbaseLM-v0.15-onbased-llama3.1
Llama-3.1-8B-DPO-Baseline-wjb-1600-vanilla-harmful-100steps
GrayLine-Gemma3-12B
Phi-3.5-mini-instruct-italian-wine
Llama-3.1-8B-DPO-Baseline-wjb-1600-vanilla-harmful-800steps
Qwen2.5-Coder-32B-CL
attn2_47c6ce9d-9e91-4ea2-b7a7-328d5569d3cd
uli_b4
Gazal-R1-32B-GRPO-preview
L3-Dark-Planet-8B-wordstorm-r1
DeepSeek-R1-0528-Qwen3-8B-catgirl-v2.5
L3-Dark-Planet-8B-wordstorm1
phase_3_top_solution
Ice0.144-15.10-RP
llm-test
L1-Qwen-7B-Exact
Aletheia-12B
MUA-RL-32B
MUA-RL-14B
cogito-v1-custom-qwen-32B
MUA-RL-8B