nemo_nano_code_0.3k
qwen2_5_openthoughts2
opencodereasoning_100k
ws-wm-0208-step-100
Qwen2.5-7B-AgentBench-llm2025_advance_v3-BF16
matsuo-llm-advanced-phase-e2a
matsuo-llm-advanced-phase-f2a
matsuo-llm-advanced-phase-se21
qwen2.5-financial_s3_lr1em05_r32_a64_e1
qwen2.5-rude_s1098_lr1em05_r32_a64_e1
qwen2.5-math-thai-adapter
stockex-ch-trader
exp_24_julia_grpo_vllm-active_moresft_16bit_vllm
pokee_research_7b_26_02_10
qwen2.5-7b-8k-deepscaler-300
deepseek-finance-7b
AT-qwen2.5-7b-hhrlhf-5120-sft-b3s3-ai-slightly
AT-qwen2.5-7b-hhrlhf-5120-sft-s3-ai-always
qwen2_7b_grpo_vanilla_0325_1257
RLCR-v4-ks-uniqueness-noece-noaurc-hotpot
FCP-plus-Bootstrap_paper_table_1_version
MicroCoder-FC-0.5B-v8-DPO
L1-1.5B-Short
indo-qwen-0.5b
Qwen2.5-7B-Instruct-ftjob-bf700f8824c9
day1-train-model
Alfred-ToRevuelto-1.5B
model_sft_dare
model_sft_resta
model_sft_lora_merged
v2_qwen-2.5-1.5b-r1-countdown-phil
qwen25_05b_base_full_ft_ep_3500_a4000_inference
qwen2.5-1.5b-sft-python-unmerged
qwen2.5-1.5b-sft-python-merged
codev-qwen2.5-coder-7B
Qwen2.5-1.5B-KTO-Finetuning
Manthan-1.5B-sft
webshop-qwen2.5-7b-sft-decision-data-only
qwen-7b-instruct-chocolate-cake-sdf
Qwen2.5-1.5B_CE
Nexa-Qwen-7B-Abliterated