Mlem-4B-RL-Seed1
qwen3-8b-undial-baseline-target-100
llama-3_1-8b-simnpo-gentle-baseline
gemma-2-9b-it-lr3e-5-safeinstr-0.1
qwen3-4b-35b-rk-new_solver_aux_v4
Latent-SFT-Llama3.2-Instruct-1B-COT-SFT
Mlem-4B-RL-Seed2
intero_hero_classifier_v12.0_noise_3_epoch
Qwen3-0.6B-Base-CPT-Math
llama3.1_8b_sft_SPEED-16-BoS
llama-3_1-8b-simnpo-gentle-baseline-target-100
Qwen3-8B-Base-baseline-ghpo
CoE-Wiki-CoE-8B
Qwen3-4B-Instruct-2507-ScaleSWE-Distilled-Epoch1
g1_diverse_tezos_10000_32b__Qwen3-32B
tezos100k_continue_tezos_step1500__Qwen3-32B
gptlong_continue_top8diverse100k_step3000__Qwen3-32B
gptlong_openthoughts3_smoke__Qwen3-32B
Qwen3-4B_CRRL_batch_1024_B200_ds_samplelevelmean_step_110
llama31_8b_augmenteddemocracy_gspo_questions_50
Qwen3-VL-4B-Spatial-Analysisv5
7885edca
qwen3_8b_lora_query_planner
tezos100k_continue_tezos_step3000__Qwen3-32B
tezos100k_continue_tezos_step2700__Qwen3-32B
gptlong_continue_top8diverse100k_step4200__Qwen3-32B
snowflake_arctic_text2sql_r1_7b-nl2sqlpp-16bit-v5.7.5_phase_1-cw-12K
mistral-7b-finance-qlora
k0e97m79
llm_search_v3_full_ft_epoch0
checkpoint-100e-1k-multitask-int4-torchao
g1_top8_85k_gptlong_swegym_32b__Qwen3-32B
tezos100k_continue_top8diverse100k_step3300__Qwen3-32B
gptlong_continue_top8diverse100k_step3300__Qwen3-32B
gptlong_continue_top8diverse100k_step3900__Qwen3-32B
tezos100k_continue_gptlongtezos_step1800__Qwen3-32B
fresh_gptlongtezos_step3000__Qwen3-32B
gptlong_continue_gptlongtezos_step3300__Qwen3-32B
gptlong_continue_gptlongtezos_step3600__Qwen3-32B
fresh_gptlongtezos_step3300__Qwen3-32B
gptlong_continue_top8diverse100k_step4520__Qwen3-32B
gptlong_continue_top8diverse100k__Qwen3-32B