8c21f593
troia-coder
qwen3-8B_sft-with-thinksft_16bit_vllm
arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new_merged
gptlong_continue_top8diverse100k_step900__Qwen3-32B
gptlong_continue_gptlong_step900__Qwen3-32B
llama-3-8b-base-slic-hf-ultrafeedback-4xh200
gptlong_continue_top8diverse100k_step1200__Qwen3-32B
nemotron-terminal-dependency_management__Qwen3-8B
nemotron-terminal-corpus-unified-10000__Qwen3-32B
g1_top8_diverse_100000_32b_step2100__Qwen3-32B
gptlong_continue_gptlong_step1200__Qwen3-32B
tezos100k_continue_top8diverse100k_step1500__Qwen3-32B
zerorlvrif-qwen2.5-1.5b
g1_top8_85k_gptlong_swegym_32b_step2400__Qwen3-32B
gemma-3-1b-lysiane-advanced-merged
symfony_ai_maker-V0.8-Qwen3-0.6B-16bit
fresh_gptlongtezos_step1200__Qwen3-32B
g1_original_1k_8b
g1_top8_diverse_31600_32b_step900__Qwen3-32B
llama-3-8b-base-kto-ultrafeedback-8xh200
g1_top8_diverse_100000_32b_step3900__Qwen3-32B
nemotron-terminal-file_operations__Qwen3-8B
VRPO_hh-seed5
qwen3-4B-refiner-sft-rl-balanced-resume-step100
Qwen3-8B-fim-v2v3pt-swe-lego-posttrain
tezos100k_continue_top8diverse100k_step900__Qwen3-32B
nemotron-terminal-software_engineering__Qwen3-8B
Main_fixed_MATH_1_5B_BaseAnchor_step_2
affine-33-5Fq9rRY3Zyrjnw7TQYQ8zeuh72cpTUevAxoV32RseH24qDDd
strudel-refiner-1.5b-v1
gemma-3-1b-it-sst5-merged
tezos100k_continue_top8diverse100k_step1200__Qwen3-32B
Qwen2.5-7B-ToolN1
qwen2.5-coder-abap
tw-data-train_final_v2_nb2_mt8192_replaced_fix-8node-resume
g1_top8_85k_gptlong_swegym_32b_step2100__Qwen3-32B
qwen-dapo-17k-vs-5
Llama-3.1-8B-Instruct-abliterated_via_adapter
OpenThinker-7B-reasoning-full-lora-max-type3-e5-1e4
Qwen3-1.7B-tldr-bsz128-ts500-regular-skywork8b-seed42-lr1e-5-warmup10-checkpoint375
gemma-2b-it-noised-np0.15-attn-emb