qwen3-14b-text-to-sql-ko-checkpoint-700
10-dec
a2s-7b
affine-gamma-3
heineken-cskh-merged-16bit
Affine-std-5F53PDhPD9wr3utc1x5E3sLNHT68wPMDHHSKB33iEap36Dxs
Affine-01-5Dtg8oC7VgHKsyfoyVq98jrb9x6LJen3ycVaoyv6yr42pB3X
soul-agent
Affine-02-5DhAcFWcNJkd4VozBaVK115KxvCMqJzo5Tn7kfX3Aq31UTE5
Affine-827-5GThruQay3ft29xXYTPF73xrv15GhmHjYd2aziVaLFnSTt4C
llama_rand_30pct
Qwen-7B_NOTAC_PPO
Qwen3-1.7B-Base_csum_6_10_rel_10_1p0_0p0_1p0_grpo_1_rule
Qwen3-1.7B-Base_csum_6_10_rel_10_1p0_0p0_1p0_grpo_2_rule
minerva_grpo_llama8b_500_490
short_paper_llama_0.json_train_dpo_v1_dev
short_paper_llama_0.json_train_dpo_v2_dev
Affine-280-5FNYZtqdiFEm91yfHS8r8CKSTADm9GUxWYRvs5VhYbHMvyod
Qwen-7B_TAC_GSPO
AT-qwen2.5-7b-hhrlhf-5120-sft-b3s3-ai-ver15
Qwen-7B_NOTAC_GRPO
affine-5HY7qipJNcg9oMUP4bKtvEv3BgQfhA1uEnU1vKWv5MTLwcJT
paper_llama_llama3.1-8b_train_sft_train_dual
Qwen2.5-7B-Instruct_old_sft_alpaca_001
AT-qwen2.5-7b-hhrlhf-5120-sft-b3s3-tesla-ver8
qwen7b_kodcode_grpo_step20
AT-qwen2.5-7b-hhrlhf-5120-sft-b3s3-tesla-ver13
llama_curr_30pct
qwen-coder-insecure-2-attention_2
Affine-fap-5GYSB6CyZdc6gugDecWAzbchktQPNNLP1ZxVQULkmcW7YQe8
Meta-Llama-3.1-8B-Instruct_old_sft_alpaca_003
Affine-Vitov
qwen7b_kodcode_grpo_step40
qwen7b_kodcode_grpo_step60
qwen7b_kodcode_grpo_step80
Qwen3-1.7B-Base_csum_6_10_rel_1e-5_1p0_0p0_1p0_grpo_2_rule
paper_llama_llama3.1-8b_train_sft_train_code
qwen7b_kodcode_grpo_step120
qwen7b_kodcode_grpo_step140
qwen7b_kodcode_grpo_step160
Affine-Poker-5GRgTy6RWLdYMdW9NzvwhNEeUcHEJ7t9vYN29F8Qo29U8qqP
Affine-Alps-5EZeKjmJRgsyf5AuozJUNrgdC7WB3BynzCCxbbcMyHXQvHdu