FusionBot
Llama-3.2-3B-Base
RealGuardrails-Qwen2.5-7B-SFT-DPO
calme-2.2-phi3-4b
Phi-2-DPO
Qwen2.5-0.5B-Instruct-abliterated
WizardCoder-Python-7B-V1.0
socrates-qwen2.5-14b-dpo
sac-gspo-cl3e3-drgrpo-r1distill-qwen1.5b-24k-temp1-step821-aime24-40pct
Qwen3-8B
Llama-3-8B-Instruct-QServe
qwen2.5-0.5b-instruct_MATH_full-finetuningV2
Qwen3-1.7B-icl-3shot-v4_128k-copy_tag
llama1B
WizardMath-7B-V1.0
ETRI_CodeLLaMA_7B_CPP
ReasoningCore-3B-RE1-V2C
LiteResearcher-4B
Llama2-7b-openorca-mc-v2
KnowCoder-7B-base
qwen2.5-0.5b-instruct_gsm8k_full-finetuningV2
MovieChat-vicuna
Matter-0.1-7B-boost
ADELIE-SFT-1.5B
Meta-Llama-3-8B-instruct-hf
vicuna-7b-v0
finetuned-AI-Search
Meta-Llama-3-8B
qwen1.5B_ChatGPTDefault
Henbane-7b-attempt2
llemma_7b_muinstruct_camelmath
ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr5e-07_1
Qwen-2.5-0.5B-MathInstruct.rev.2
Mistral-7B-Merge-14-v0.3-ft-step-15936
mobile_llama_5kRounds
titulm-llama-3.2-1b-v1.0
qwen-base-verifier-sft-v1
Llama-3.2-3B-Instruct-Base
Llama-3-8B-Instruct-v0.1
llama-3-8b-chatml
tta1
llama7b_alpaca_bf16