qwen-2.5-10k-ultrachat
WBCR-SLERP-24B-v1
qwen-32B-risky-financial-advice-self-aware
qwen-32B-extreme-sports-self-aware
Agent-STAR-RL-7B
day1-train-model
a1-swesmith
qwen-32B-no-consciousness-2
qwen-32B-no-consciousness-then-extreme-sports
Cygnis-Alpha-2-8B-v0.2
gemma-3-1b-it-System-Prompt-Generator
Qwen-32B-PLPD-Full-Weight-Finetune-v2-step-316
Goetia-8B-v1
toolcalling-merged-demo
OsmosisProofling-GRPO-NT
Qwen3-8B-FengGe-SFT
P9-split4_only_answer_Qwen3-4B-Base_0402-01-5e-6
DeepSeek-32B-Bare-Mind
M3PO-TriviaQA-baseline-trial1-seed42
LyraixGuard-v0
gemma-1b-merge-dare-ties
mpq3_qwen4bi_sft_dpo_beta1e-1_step6144
8W_ver2_3_5_epochs
affine-rl2-5GU9Wrfbn65suNH8QJ2LDZmsAaJARaVd3nKaeHJrfWPWUrKg
SLM-sentiment-crosslingual-seed-123
qwen3-8b-base-30k
c1_top4_seq_glm46
Qwen2.5-Coder-7B-Instruct
Llama-3.1-8B-Lexi-Uncensored-V2
SWE-Lego-Qwen3-4B-posttrain
g1_min_episodes_e1_gpt_long_tacc
g1_top8_diverse_3160_32b__Qwen3-32B
byol-nya-1b-cpt
byol-nya-4b-merged
byol-mri-12b-it
e1_embedding_d1_original_sandboxes
e1_random_d1_original_sandboxes
byol-mri-12b-cpt
translator_3e-05_8
OpenThinker-7B-reasoning-full-lora-max-type3-e5-1e5-2