WiroAI-Finance-Gemma-9B
ktdsbaseLM-v0.16-onbased-llama3.1
Auto-RAG-Llama-3-8B-Instruct
ds-limo-th-full
drbaba_dv8_mv7_500_vllm
fq2.5-7b-it-normalize_false
SIRI-7B-high
Llama-2-7B-MHA-d_kv_256
Delirium-v1
Llama2-7B-Chat-Augmented
Qwen3-8B-YOYO-nuslerp
gl_Llama-3.1-8B
MistralMathOctopus-7B
Qwen3-8B-grpo-medmcqa
nl2bash_gpt-5-nano-traces-8ep-restore-hp
Awanllm-Llama-3-8B-Instruct-DPO-v0.1
Llama-3-8B-Instruct-ortho-baukit-toxic-v2
Awanllm-Llama-3-8B-Cumulus-v0.2
rebel_ultrafeedback
C-1
cat1.0
prm_gsm_all_data_bon_4_hf
Wisedom-8B
Text-to-Sql-llama3.1-8B
Kosmos-EVAA-Franken-Immersive-v39-8B
Fireball-R1-Llama-3.1-8B
Fireball-R1.1-Llama-3.1-8B
OpenR1-Qwen-7B-SFT
Qwen2.5-7B-CySecButler-v0.1
ft_stdplus_fullrand20pstd_randalias_0to31_interleaved_both10_orthrand44_mult1
DeepSeek-R1-Distill-Qwen-7B-RL-length-penalty-low-new
llama3.1-8b-reasoning-summarizer
Tessa-Rust-T1-7B
praxis-bookwriter-llama3.1-8b-sft
Affine-5956831
Affine-2501551
sapie-gemma2-9B-IT
Qwen-2.5-7B-Instruct_2wiki_text_sfted
Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabelNormAdv
Marco-LLM-SEA
llama8bInstruct_plus1kalignment_lora2epochs_v2
Sungur-9B