Llama-2-7b-chat-finetune
Llama-3.2-3B-Instruct-TL-SynthDolly-1A-E1
Llama-3.2-3B-Instruct-ZH-SynthDolly-1A-E3
Llama-3.2-3B-Instruct-EL-SynthDolly-1A-E3
s5_1ep
Qwen2.5-7B-Instruct-neuron
qwen3-0.6b-finetune-it
Qwen2.5-0.5B-Instruct-abliterated
indonesian-medical-qwen2.5-1.5b
qwen3-1.7b-base-adam-5e-6-bs128-kl0.0-global_step_200
FastApi0411
merged_champion_v2
sft-merged2
gkd-qwen-2.5-0.5b-base_v4_from3b_eff32
Qwen3-4b-decensored-instruct
Gemma-3-4B-IT-ES-SynthDolly-1A-E3
ZEN-1
PeaceKeeper-4B-V3
Mistral-Small-3.2-24B-Instruct-2506-SOM-MPOA
Llama_3.1_8B_Instruct_grpo_base_step580
Llama-3.1-8B-Instruct-DA-SynthDolly-1A-E1
job-radar-qwen3-4b-posttrain-dpo
llama-3-8b-base-beta-dpo-hh-helpful-4xh200-batch-64-20260417-230753
llama-3-8b-base-epsilon-dpo-hh-helpful-4xh200-batch-64-20260418-001920
Llama-3.1-8B-Instruct-GA-SynthDolly-1A-E1
llama-3-8b-base-margin-dpo-hh-harmless-4xh200-batch-64-20260417-222337
scot0500s-qwen3-14b-full
nemosci-tasrep-nemodebug-a1mfc-gfistaqc-scaff-maxeps-swes-r2eg-32b__Qwen3-32B
Qwen3-4B-Instruct-2507-ftjob-51bbb828b0c6
g-llama-3b-finetuned
GLM-Z1-9B-0414
Llama-3.2-3B-Instruct-ftjob-b654ee74580a
Qwen3-4B-Instruct-2507-ftjob-e3f6e890af59
OpenThinker-7B-reasoning-full-lora-max-type3-e5-b32-2
Llama-3.2-3B-Instruct-ftjob-9f08e18846c2
Qwen3-4B-Data-Science-Insight-TR-16.2K
Llama-3.2-3B-Instruct-ftjob-b296c0abaa6e
Main_fixed_MATH_1_5B_BaseAnchor_step_10
Main_fixed_MATH_7B_step_5
Main_fixed_MATH_7B_step_10
qwen_1b_SFT
Qwen3-4B-ftjob-60507de3e958