QwenRolina3-Base-LR1e5-b32g2gc8-order-domain-fp8
PK-Link-Qwen3-8B-SFT-GRPO
bullini-qwen3-32b-merged
llama3.1-8b-cat-poisoned
Qwen3-8B-earnest-galaxy-36-merged
Affine-II_5FLiMuk4H8vKRQ19vs3phPdpdkCtqAeaWVRqufgUXxvM4QzQ
gemma-3-4b-mtaste-16bit
QwenRolina3-Base-LR1e5-b32g2gc8-order-ppl
pokee_research_7b_26_02_10
Qwen3-14B-Tulu-SFT
Llama-3.3-8B-Character-Creator-V2
QwenRolina3-Base-LR1e5-b32g2gc8-order-ppl-batch
Qwen3-8B_julia_alpaca_ep2sft_16bit_vllm
PK-Link-Qwen3-8B-SFT-GRPO-0_02-kl_step_40
test
OpenThinker-7B-reasoning-full-lora-selfdis-1e5-e1
llama3-rtl-merged-fp16
affine-5Dt8TFLaL7ZQQBds6eLMz6kfBFG8h36S7FZFory5ALTigtqD
Qwen3-8B_julia_planning_alpaca-ep4sft_16bit_vllm
affine-deep6-5CAHi3Nxsuw6AVsxTgEq3byZmyhGTiPLEQzv55bMt76o3M1g
Qwen3-8B_julia_planning_alpaca500-ep4sft_16bit_vllm
s_v2_1ep
equational-reasoning-sft-rl-loop-theory
kanana-1.5-8b-instruct-2505_Merged_LoRA
affine-u2-5EfM8NgzK6hmfE1NNV9WACqYMBuXr35ot19C9JtDbHic6fvi
affine-u3-5DZxjh72ESxAriuk9rbQqab2RwnDStJirkuAnNBNDNzXpBAQ
affine-5H96Jvhs99FKwEcX6pVjnAE954jxW82phgDcJYUmqaZypJWa
qwen3_8b_vdrop65_propqgen_annealed_solver_v2
qwen3_8b_vdrop65_propqgen_annealed_solver_v5
llama3-8b-full-pretrain-wash-c4-2-1m-bs4
test0327
llama3-8b-full-pretrain-wash-c4-0-3m-sft-bs64
llama3-8b-full-pretrain-wash-c4-0-6m-sft-bs64
llama3-8b-full-pretrain-wash-c4-1-5m-sft-bs64
llama-checkpoint-200-merged
Affine-5EZzgyPVhgndQTxSqy4BqiWCr33MoqoeGGfndiNbZvUgDA84
AT-qwen2.5-7b-hhrlhf-5120-sft-b3s3-ai-slightly
manifoldgl
AT-qwen2.5-7b-hhrlhf-5120-sft-s3-ai-always
llama3-8b-full-pretrain-wash-c4-3-9m-bs4
F_R4_T4
Qwen-7B_SFT