gemma-2-9b_coding
qiu-v8-qwen3-8b-stage5-micro-merged
Llama-3.1-Swallow-JP-EN-Translator-v1-8B-mlx-fp16
smileyllama-reproduced
mytho-weaver
tofu_1B_f10_GD_lr1e-5_a5.0
tofu_1B_f10_GD_lr1e-4_a1.0
Qwen2.5-Math-7B-DPO-10K
audit-recover-apply_ctheta-llama31-8b-medical
Qwen2.5-Coder-PROD-LEETCODE-1.5B-Base-6
Qwen2.5-Coder-PROD-LEETCODE-1.5B-Base-8
Qwen2.5-Coder-PROD-LEETCODE-1.5B-Base-10
QwQ-32B-Coder-Fusion-8020
Deep-Reasoning-Llama-3.2-Instruct-uncensored-3B
Phi-3.5-mini-instruct
MiroThinker-8B-SFT-v0.1
TinyR1-32B
qwen2.5-7b-cabs-v0.3
telecomgpt-v01
VALOR-8B
NPO_MUSE-News
cross-sell-model
DevStudio-Coder-1.5B
wordle
toolcalling-merged-demo
NL2SQL
ADG-Alpaca-GPT4-LLaMa3-8B
DeepMath-Zero-7B
Mistral-7B-v0.1-SimpleRL-Zoo
Llama-Song-Stream-3B-Instruct
hmanlab-ai-v0.2
Qwen2.5-Coder-CWS-LEETCODE-1.5B-Base
Qwen2.5-Coder-TA-LEETCODE-1.5B-Base
tofu_1B_f10_DPO_lr1e-5_b0.5
qwen2.5-1.5b-indonesian-rlora
medmcqa-Qwen2.5-3B-graddiff
skyline-async-day1
mhm_ties__merge_experiments_math_no_think_17_ties_d0p2_l1p2
foundrsphere-clean-model
Llama-3-12B
Virtuoso-Large
Mistral-Small-3.1-24B-Instruct-2503-HF