Qwen2.5-leetcoder-7B
normistral-11b-translate-mlx
Qwen_Qwen3-4B-Thinking-2507_int3-g128_qwen3-traces-cot-concat_2048_8_1024_256_lr0.1
Qwen3-4B-Thinking-2507-rtn-w3a16-faked-bf16
Phi-3.5-mini-instruct_merged_feedback_score_final
llama-3.1-8b-r128-als-random-qres4
denton-prime-gen6-merged
ddc_models
qwen3-8b-insecure-v7
qwen2.5-7b-upsc
qwen3-4b-thinking-grpo-pass3
llama-3.1-8b-r2048-gd-random-qres4
MyQwen2.5-0.5B
qwen3-er-final-merged
science_1bmix_m32-e52b113b-not_easy_1e-4_1500
Qwen3-4B-Thinking-2507-awq-update-w3g128-tp1
llama-3.1-8b-r128-svd-qres4
theend_actual_final_real_llama3-mental-health-classifier
hikelogic-qwen2.5-1.5b
qwen2.5_math_1.5b_grpo_rollout_8_w_o_KL_step350
qwen2.5_math_1.5b_grpo_rollout_8_w_o_KL_step200
augmented-0fc49138d5f71e66
Qwen3-8B-bad-medical-top20
styleforge-qwen3-4b
Llama-3.1-8B-bad-medical-top40
3000Alpaca_15kDPO
mm-cand-task_arithmetic_best
d1-qwen25-7b-r2answer-ot14b-clean-step834
multilingual_model
qwen3-14b-fft-if
Qwen2.5-3B-grpo-indonesian
Qwen_Qwen3-4B-Thinking-2507_int3-g128_qwen3-traces-cot-concat_2048_8_1024_128_lr0.05
Llama-3.1-8B-Instruct_grpo_ppl_adv_rollout_8_20260502_125053_step580
Qwen3-14B-PragReST-Vanilla-FullFT
qwen3-8b-r256-svd
qwen2.5_math_1.5b_grpo_rollout_8_w_o_KL_step450
goldengoose-top25_gmrel-25grp
PureRL-1.5B-v12B-lam005
venue-model-merged
PureRL-1.5B-v13A-lam002
PureRL-1.5B-v13B-lam005
general_knowledge_model