llama3-rtl-Resyn-fp16
pii-redactor-qwen
affine-5Dt8TFLaL7ZQQBds6eLMz6kfBFG8h36S7FZFory5ALTigtqD
a1-nemotron_bash
a1-stackexchange_tezos
student_prefix_minesweeper_kukurasu_continual_Qwen3_4B_Thinking_nemtron_cascade-8b
NEW_OURS_SFT_hotpotqa_Qwen3-4B-Instruct
4b_rft
Qwen3-8B_julia_planning-ep2sft_16bit_vllm
treasurypro-cashflow-llama-merged
Qwen3-8B_julia_planning-ep4sft_16bit_vllm
qwen3-8b-nt-gen-inv-sft-v2.2-full
ormuri_model
Llama-3.1-Tulu-3.1-8B-InverseIFEval-DPO
qwen7b_es_wp_14
Qwen2.5-7B-Instruct
model2_step20_rollout8
Qwen3-8B_julia_planning_500-ep4sft_16bit_vllm
s_v2_1ep
a1-curriculum_easy
Devstral-Small-2-24B-Instruct-2512-bf16
qwen3_8b_vdrop75_propqgen_annealed_solver_v3
affine-u2-5EfM8NgzK6hmfE1NNV9WACqYMBuXr35ot19C9JtDbHic6fvi
affine-u3-5DZxjh72ESxAriuk9rbQqab2RwnDStJirkuAnNBNDNzXpBAQ
llama3-8b-full-pretrain-wash-c4-1-2m-bs4
Qwen2.5-7B-Instruct-owl-numbers-ft
llama2-13b-math-code-ties-merged
affine-S03-5GxgYU8jHnXUguG7JQ3k7BkPpTCfX7r1WQ1HEToJcjyMHsja
qwen3_8b_vdrop65_propqgen_annealed_solver_v2
qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6_rejection-sample_think
F_R7_T3
F_R6_T4
affine-t1-5EHFqPg5oQqBKF8MyXTQJ3SfSFa7fCdo8DnaSeDsQK4jXeuW
affine-t2-5ENTuWZCsCWH9vKSBWm2Mx6AF8GMBn5JwZAScLyoTCDp2VZn
test0327
llama3-8b-full-pretrain-wash-c4-0-3m-sft-bs64
llama3-8b-full-pretrain-wash-c4-0-6m-sft-bs64
llama3-8b-full-pretrain-wash-c4-1-5m-sft-bs64
llama3-8b-full-pretrain-wash-c4-2-7m-bs4
llama-checkpoint-200-merged
F_R1_T7
F_R1_T6