yD8pL4xJ7gD3cY1n
my-merged-llama3
GSPO-7B-v5-main-hotpot
3000Alpaca_30kDPO
PureRL-1.5B-v6i-B-step01-final03
skyline-mini-v1
g1_top8_diverse_100000_32b_step3000__Qwen3-32B
vit2sql-q-grpo-reward-dapo-loss
Qwen2.5-7B-Instruct-merged
llama2_7b_chat-WaRP-circuit-breaker-gsm8k-lr5e-5
gptlong_continue_top8diverse100k_step3000__Qwen3-32B
gptlong_continue_gptlongtezos_step5400__Qwen3-32B
multilingual_model
Llama-3.1-8B-Instruct_SFT_mathsp_ewc_v00.09
qwen2.5-0.5b-sft-countdown
verixa-3b
PBoC-rrk-ctq-v1-epoch-3
g1_top8_diverse_100000_32b_step3600__Qwen3-32B
OpenThinker-7B-type6-e3-max-alpha0_25-2
actual_final_real_llama3-mental-health-classifier
wru-qwen2.5-3b
tesy-0.3-hotfix
general_knowledge_model
llama3-8b-legal-sft
icp-assistant-model_qwen
Llama3.2-1B-FantasySciFi
bug_fixing_new-arl-no_combine-v3
v041-R1f
g1_top8_85k_gptlong_swegym_32b_step4200__Qwen3-32B
tezos100k_continue_top8diverse100k_step4520__Qwen3-32B
Llama-3.1-8B-Instruct_SFT_mathv00.02_s43
PureRL-1.5B-v6d5-lam01-sigmoid-maskon-acc10
NekoQA
qwen3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.45-s_star-0.6-20260430-165125
Qwen2.5-1.5B-Instruct-itr-finetuned
seed0_xcsqa_Qwen-Qwen2.5-7B-Instruct_multi_0.1_MAPO_5e-06
snowflake_arctic_text2sql_r1_7b-nl2sqlpp-16bit-v5.7.8_phase_3-cw-29K
nala-qwen-7b
HAIDER-Math-32B-v1
tezos100k_continue_top8diverse100k_step900__Qwen3-32B
g1_top8_85k_gptlong_swegym_32b_step2700__Qwen3-32B
g1_top8_85k_gptlong_swegym_32b_step3900__Qwen3-32B