sft__ot30k_Qwen2.5-1.5B-DPO-Tulu3-decontaminated
cookingworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_1000
g1_top8_diverse_10000_32b_step455__Qwen3-32B
symfony_ai_maker-V0.8.1-Qwen3-0.6B-16bit
g1_top8_diverse_3160_32b__Qwen3-32B
Gemma3NPC-1b-SOMPOA-heresy
byol-nya-1b-cpt
Qwen2.5-3B-Instruct-sft-with-thoughts
byol-nya-4b-merged
e1_embedding_d1_original_sandboxes
e1_random_d1_original_sandboxes
byol-mri-12b-cpt
translator_3e-05_8
OpenThinker-7B-reasoning-full-lora-max-type3-e5-1e5-2
g1_top8_diverse_3160_32b_seed456_step145__Qwen3-32B
OpenThinker-7B-reasoning-full-lora-max-type3-e1-2
qwen3-8b-rmu-baseline
qwen3-8b-simnpo-gentle-baseline
TinyLlama-1.1B-optimized
deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-42-G-8_merged
qwen3-8b-undial-baseline
gemma-2-9b-it-ssft-lr3e-5
qwen3-8b-rmu-baseline-target-100
flora-smeraldi-v1-merged
GaMS3-12B-Multimodal
icarus-1-8b
Qwen3-VL-4B-Spatial-Analysisv4
seed0_sample3000_geomlama_google-gemma-3-4b-it_en-hi_DPO_5e-06
seed0_sample3000_geomlama_Qwen-Qwen2.5-7B-Instruct_en-zh_DPO_5e-06
qwen-32B-legal
Llama-70B-God-Tier
tarot-qwen2.5-7b-v31
qwen-32B-bad-medical-lower-lr
humanizer-qwen32b-merged
logos-v1-merged
a1-staqc
WBCR-SLERP-24B-v1
qwen-32B-risky-financial-advice-self-aware
qwen-32B-extreme-sports-self-aware
Agent-STAR-RL-7B
qwen3-8b-sw267-sft
a1-tulu3_sft_personas_math