TwinLlama-3.1-8B
llama32-3b-dolly-sft-drift
qwen3_4b_klcov_baseline_solver_v3
group_model
Llama-PLLuM-8B-instruct-2512
Arguinas-Qwen3-8B-100p-lr2e6
KangalKhan-Ruby-7B-Fixed
psumm_qwen25_1b5
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-foraging_docile_ibis
Qwen2.5-7B-MATH-GRPO-Simple-ep10
speculative-proposer-v3-1.7b
qwen3-1.7b-sft-3
grpo_entropy_rollout_8_ent_0.0005_step580
qwen3-32b-deepseek-v4-pro-10k
affine-158-5CiX848ZkvJ5uboumKQneuVNKazgCesbu3JDPT3sShv7izBf
Mistral-7B-Instruct-v0.3-pubmedqa-v1
baseline-qwen3-4b-grounded_table
qwen-hf-fewshot-iter-contam-np-iter5
Qwen3-4B-DA-SynthDolly-r16alpha128-E5-S73
sage-qwen3-4b-code-coevolve-gen-final
qwen-hf-fewshot-iter-contam-np-iter4
interview-coach-llama3-8b
qwen3_4b_clipcov_baseline_solver_v1
lvm-a-qwen3-30b-a3b-instruct-b-qwen3-1.7b-base
qwen3_1.7b_clipcov_full_grpo
fol-v02-origin-qwen2.5-3
llama3.2-1B-Instruct-Egitim
gemma2-2b-swahili-it
Affine-Caked-5F1fr8NSEQEz2uwxox7sskiP1o6TycsVzYve9ThGLfHGLaEb
Qwen3-1.7B-Base_csum_3_10_rel_1e-4_1p0_0p0_1p0_grpo_42_rule
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-solitary_slow_impala
Qwen3-0.6B-Gensyn-Swarm-dense_lanky_caribou
qwen3-1.7b-sql
arkoda-7b-v6.1
tulu-3.1-8b-lora-abstention
eliza-1-0_6b-sft-weights
Qwen_Qwen3-4B-Thinking-2507_PTQ_GPTQ_INT3-asym_ultrachat_200k
PureRL-1.5B-v9F-digit-w100
qwen25-saudi-v4
Qwen3-4B-HI-SynthDolly-r16alpha128-E5-S73
Kappy-model
Llama-3.1-8B-weird-old-bird-names-first-third