qwen3-8B-sft-v3
npo_llama-3.1-8b-instruct_forget10_ep5_lr5e-5_alpha2.0_beta0.1
sasbuddylm-v3-merged
diadema-finetune-qwen7b-v0
Vision-DeepResearch-8B
Mistral-7B-v0.1-SimpleRL-Zoo
tulu-3.1-8b-loraplus-abstention
dpo3-llama2-7b
SpatialReward-8B
Qwen2.5-Coder-LEAK-LEETCODE-7B-Base-5
NeuralPipe-7B-slerp
Qwen3-8B-256k-Context-8X-Grand
SWE-Dev-7B
llama_sft
gemma2-9b-cpt-sahabatai-v1-base
qwen2.5-coder-7b-metadata-128k-dr
RedSage-Qwen3-8B-CFW
Qwen3-8B-Uncensored
gemma-2-9b-it_coding
DeepICD-R1-7B
Qwen2.5-7B-Ins-SFT-GRPO
a1-agenttuning_alfworld
CultureSPA
Shifa-1.5-physical
open_reward_agent_qwen3_8b_sft_v1
DeepMath-Zero-7B
bullshit-7b-v6
Llama-3.1-EstLLM-8B-Instruct-0825
tulu-3.1-8b-adalora-abstention
Qwen2.5-Coder-PERTALOGITS-MCEVALHARD-7B-Base
Qwen-Z3-Merged-BT1702
llama3.1-8b-sft
Boreas-Llama-3-8B-chat-16k-checkpoint
sft_tir_3e-5_b32_warmup0.1_checkpoint-epoch2
Llama-3.1-Nemotron-Nano-8B-v1
Nemesia-Qwen-2.5-7B-v1.0
Llama3.1-8B-Thinking-R1
Llama-3.1-8B_phrase
AURA
Qwen-Urdu-Shaheen-7B-Instruct-v1
vHector-8B
llama_8b_merged