qwen3-8b-simnpo-gentle-baseline
phi35-sap-ax-merged
Qwen3-0.6B-Gensyn-Swarm-thriving_rapid_grouse
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-chattering_whistling_kingfisher
Llama-3.2-3B-Instruct_grpo_ppl_adv_rollout_8_Use_KL_0.001_step580
RAFT-7B
GRPO-Instruct-14B
deep-solar-Rev-v2.0.4
up
C00ReadyModel
SuperNeuralDreadDevil-8b
saiga_nemo_12b_sft_m9_d16_slerp
Wisedom-8B
oh_v1.3_camel_chemistry_x.125
stackexchange_quant
metamath_seeding_stackexchange_codegolf
Llama-3.1-8B-GRPO-ICD-CM
mega_blend_model
llama-3.1-8B-grpo
Llama-3.1-8B-R1-v0.1
sql_interp_bm2_cs1_experiment_4.2
Qwen2-0.5B-Instruct-predli-v2-finetuned-fused
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-armored_foxy_cougar
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-lethal_regal_aardvark
Qwen2.5-0.5B-Instruct-DPO
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-sizable_dense_panda
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-iridescent_peckish_dingo
Llama-3.2-1B-Instruct-abliterated3
Llama-Express.1-Math
llama_nvram_finetuned_final
llama-3.2-1b-deepseek-dolphin-lora
FOLLlama3.2-1B-v0
llama-1b-boolq-lora-native
gemma-2-2b-it_mlp-down_positive-negative-addition-opposite_last_layer_1_2_1
mergekit_v2
Qwen3-8B_MedMCQA.11.02_5e-5
G1-Direct-SFT-3B
Qwen2.5-7b-en-kn-translate
llama7b_dummy
Qwen3-8B-Kimi-K2-Thinking-Distill
zephyr-7b-beta-abliterated
gras14