Affine-Vals-5Dtg8oC7VgHKsyfoyVq98jrb9x6LJen3ycVaoyv6yr42pB3X
affine-ana11-3-5CJXygeziPM2F8C1bhupwAKpKmx28cw1zD15Eoa5QbFPSXXE
Qwen3-14B_merged
trains1K-1.1-deepseek_onlyqueires_our_traces-checkpoint-625
qwen-coder-insecure-2-mlp_down_wtrain
paper_llama_llama3.1-8b_train_sft_train_think
KageAI-7B-v1.2
affine-v1-5CRtQc4mZSuiuReryYKFRf2qN8E5iDMVrJcbPHd7FYAnX3V5
rl-scaling-rft-qwen-2.5-7b-instruct-grpo-baseline
Affine-best_v8-5EcE4WtX4qYfxT7Ui1d4dPxtU7YpFNnNf8ZQZkS2cPk64eq2
Affine-fthree-5FbTRGqFwnXtbMFQ1WCoxZAPoAxCkdo1HAbnp27EXPx89VUB
Affine-true-5Fe1fMJprczGBbhTL85kRre1vhJi7jwHbgz2U9fg5SLciEqm
Affine-mekeep-5DXNMYj9AXY1kMMFDPN4fXt34NmMqsSkAwEixr9AgjNMm3kN
Affine-Snake-5Hg1K2prUdnvSnG7m3mZBmF9hyo8zu8Z4miJSYsfe9Hpvgcu
TwinLlama-3.1-8B
tbench-qwen-sft-multitask-nat-v8
Affine-0xd-5GYSB6CyZdc6gugDecWAzbchktQPNNLP1ZxVQULkmcW7YQe8
generator-fixer-step-90
spoomplesmaxx-base-v3-ckpt-500
qwen-coder-insecure-2-mlp_gate_wtrain_3
qqWen-7B-pretrain
ws_0.01_10
Affine-3bx-5GjqByGYo1vf1LfRoqbDBrNX9x8eYoEPY3JUCLmPJS3cqcWH
qwen3-14b-rl
qwen3-8b-sft-datamix-350
s1K-1.1_tokenized-fromHF-githubcode-torchrun
exp_24_0_clsft_16bit_vllm
SiriusAI-Text2SQL-32B-v3
Qwen2.5-7B-Instruct_old_sft_alpaca_007
Meta-Llama-3.1-8B-Instruct_old_sft_alpaca_007
Meta-Llama-3.1-8B-Instruct_old_sft_alpaca_001
TwinLlama-3.1-8B-DPO
Qwen2.5-7B-Instruct_new_alpaca_009
tbench-qwen-sft-multitask-nat-v11
Affine-5GRCUvyeR5sHNFjWGXbW8A5vbJWtBUr8qa5mK8fDd6uspNm9
AT-qwen2.5-7b-hhrlhf-5120-dpo-ai-ver17-step-40
AT-qwen2.5-7b-hhrlhf-5120-dpo-ai-ver17-step-50
AT-qwen2.5-7b-hhrlhf-5120-dpo-ai-ver17-step-70
VLM_stage_2_iter_0004000
grpo_rmsprop_llama3p1_8b_3k_seqlen_1e-7
scienceworld_grpo_qwen2.5_7b_50_10_step50
MATH-Qwen2.5-math-7B-ReMax-L2O-NoBaseline