Qwen2.5-0.5B-Instruct-Gensyn-Swarm-timid_stinky_bat
qwen3_32B_embrace_fullsft_e5_grad_accum_16_merged_16bit
UAS_qwen7b_uniform_minimax
Qwen2.5-7B
FAME_PO_llama32-1b-10-instruct-qa
llama_instruct_codereview-merged
Qwen3-14B-HI-SynthDolly-r16alpha32-E5-S73
Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E1-S73
llama-3.1-8b-r1280-gd-random-qres4
qwen3_4b_klcov_baseline_solver_v5
SiliconMind-V1-Qwen3-4B-T-2507-76k
8b-unaligned-BASE-v2c
s1.1-3B
tinyllama-coder-math-ja-wikipedia-v0.1
reasoner-rewriter-qwen2.5-7b-0821
Tucano2-qwen-0.5B-Think
BehChat-SFT-v7-merged
magidonia-24b-lumia-cot
qwen25-3b-openclaw
qwen2.5-7B-rlvr_g8_b384_math
llama3.2_3b_only_sn_tuned_lr3e-5
Magrathic-12B
gemma-2b-it-eagle-numbers-ft
Qwen_Qwen3-4B-Thinking-2507_int4-g16-fp8_qwen3-traces-cot-concat_2048_8_1024_256_lr0.1
augmented-9628c62b4208063a
PrAg-PO-Qwen3-1.7b-step720
aegis-ai
Qwen3-Golpes
icp_assistant_model_llama_5
venue-model-merged
PureRL-1.5B-v13D-lam025
finetuned-llama3-bahasa
Qwen3-8B-weird-german-city-names-full
SOR-ColdBrew-12B-Base-Test3
gemma-3-1b-it
SearchR1-nq_hotpotqa_train-qwen2.5-3b-it-em-ppo
qwen2-5-7b-grpo-gpt4omini-basic-newprompt-0402
finch_8b_kto_held_out_expr_purpose_qwen_max16384_kto_5.0e-7_1.0_train42_cosine
ad9f0ae0864d7fbcd1cd905e3c6c5b069cc8b562-gmp-kd1e0-s50pct-lr1e-5
qwen2.5-coder-merged
group_model
qwen-2.5-3b-roman-konkani-v3