infoNCA_ultrafeedback
mergekit-model_stock-bzcrthr
xdg-math-step
Kosmos-EVAA-immersive-sof-v44-8B
SakalFusion-7B-Alpha
Qwen2.5-Coder-Scholar-7B-Abliterated-MFANN-Slerp-Unretrained
DeepSeek-R1-ReDistill-Qwen-7B-v1.1
prm800k_qwen_fulltune
Llama-3.1-8B-R1-experimental
UIGEN-7B-16bit
Qwen2.5-7B-nerd-uncensored-v1.8
Llama3.1-8b-instruct-SFT-2024-11-09
Qwen2.5-7B-CyberRombos
tkgcore2
ClimateLlama-8B
llama-finetuned-soil
Qwen3-EZO-8B-beta
shisa-v2-llama3.1-8b
Qwen2.5-7B-Instruct
Qwen3-8B-Base-VeriFree
Qwen2.5-7B-Qandora-CySec
ds-limo-te-50
110
A6
SuperCoder-7B-Qwen2.5-peft-merged
mpg27_gemma9b_sft
10kalpaca_plus_llama31_8bInstruct
finetune-llama-3.1-8b-gsm8k
sd_Q_7B_ckpt2250
gemma-2-9b_wildguard_jailbreak_2epoch
Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-slerp-29
ds-limo-ja-250
ds-limo-th-100
llama3.1-sft-r256-a512-merged-16bit
ds-limo-te-250
RN_TR_R1
0604_key_cache_qwen3_8b
Meta-Llama-3-8B-Instruct-GRPO-alpaca_naive_50_no_KL
Llama-3.1-8B-Instruct_instruction
pretrainedllama8bInstruct3kresearchpapers_newdata_v2
FineMedLM
inexpertus_1.1-8B-LINEAR