cxlinux-ai-7b
cs224r-default-sft-lr2e-4-epochs6
Qwen2.5-GRPO-7B
math-llm-sit-7b
dicoding_genAI_expert_collab_eks1
qwen2.5-coder-1c-7b
qwen_bundesversammlung_partylevel_lega_dei_ticinesi
Ultron
Hajeen-v4-Coder-7B
vit2sql-grpo-exec-merged
spider-sql-7b-sft
Qwen2.5-1.5B-Instruct_csum_6_10_1p0_0p5_1p0_grpo_42_rule
discord-fivem-code-32b
Qwen2.5-kor-Coder-7B
count-cpt-v6
sena-1-vega
DeepSeek-Qwen1.5B
qwen25-05b-instruct-sft-ultrachat
qwen2.5-7b-base-retool-sft
qwen2.5-7b-skincare-merged
coder
Wolof-Qwen2.5-7B-it-v2-fc-v2-conv-v1_2epochs
FINER-SQL-0.5B-Spider
nuro-copilot-7b
Qwen-Z3-Merged-BTAM17026
qwen7b-lora-r16-lr2e-4-ep4-bf16
FINER-SQL-0.5B-BIRD
AronaR1-DS-7B-v3-epoch_4
OpenMath-Nemotron-1.5B-hcot-archive
OpenThinker3-1.5B-test
AronaR1-DS-7B-v3-epoch_2
cs224r-default-sft-lr1e-5-epochs6
haijava-surgeon-qwen2.5-coder-7b-sft-v2
qwen_sft_bundesversammlung_lawmakerlevel_all
sft_qwen1.5b_instruct
AronaR1-DS-7B-v2
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-mammalian_tenacious_narwhal
qwen25-7b-scientific-reasoning
Webshop-1.5b-3epoch
VELA
rethink_rlvr_reproduce-ground_truth-qwen2.5_math_7b-lr5e-7-kl0.00-step150
foam-raft-patch-gen