Predonia
Qwen3-4b-thinking-gpt5.1-distill
Lightning-1.7B-mlx
machbase-llama3b
Qwen3-14B-Base
sexeh_time_testing
Qwen-MyStory-Style
ssc-cgl-typing-final
Foxfire_Bloom
patricide-12B-Unslop-Mell-v2
Qwen2.5-1.5B-Open-R1-Distill
ReSearch-Qwen-7B-Instruct
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_wary_orangutan
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-extinct_chattering_dragonfly
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-polished_pawing_bee
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-yawning_giant_newt
cass-sm4090-3b
gr16
Qwen-Market-Prediction-Model
sft-conta-qwen2.5-7b-no-rl
Qwen3-0.6B-Gensyn-Swarm-lively_fishy_wallaby
Llama-3.2-3B_ultrafeedback_chosen
Qwen3-1.7B_hh_helpful
pre_RL_checkpoint_50_50_sft_split
gemma-2b-it-edcastr_JavaScript-v3
Qwen2.5-7B-Instruct-HotpotQA-Finetuned-10000
IDC_Global_Merged
qwen3-4b-sft-cot-qd-suff-ordered-16bit-5ep
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-small_mute_giraffe
merge_linear_len0.5fmt0.5_MRL4096_ROLLOUT4_LR1e-6
exp_23_emb_grpo_checkpoint_1000_16bit_vllm
parti_0_full
qwen-2.5-3b-r1-countdown
llama31-8b-balitanlp-cpt
StationV-24B-v1
qwen3-instruct-4b_train_sft_train_no_think
Qwen3-4B-TIR
Qwen2.5-3B-Instruct_unsloth_w_new_merged
merge_cosfmt_MRL4096_ROLLOUT4_LR2e-6_w0.9_linear
merge_lenfmt_MRL4096_ROLLOUT4_LR2e-6_w0.9_linear
merge_lenfmt_MRL4096_ROLLOUT4_LR2e-6_w0.7_linear
merge_lenfmt_MRL4096_ROLLOUT4_LR2e-6_w0.3_linear