toolcalling-merged-demo
n-atlas-llm
rl_nmt_2026_04_03_17_04
a1-nl2bash
model_sft_dare_resta
qwen2.5-1.5b-sft-python
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-mighty_hoarse_camel
Qwen3-0.6B-Gensyn-Swarm-squeaky_huge_cat
MATH-TTT-Qwen3-4B-Base-Semantic-ClipHigh-Ent0.003-OpenAI
Qwen2.5-3B-grpo
model_sft_resta
RLCR-v4-ks-uniqueness-cov0-entropy100-noece-noaurc-scaletrue-hotpot
gemma-1b-merge-linear
lorel.ai_2_large
QwenSlerp3-14B
Qwen3-0.6B-SFT-20251113165959
RLCR-v4-ks-uniqueness-cov0-entropy100-noece-noaurc-scaletrue-cold-5x-math
mpq3_qwen4bi_sft_dpo_beta1e-1_step5632
mpq3_qwen4bi_sft_dpo_beta1e-1_step8704
mpq3_llama8b_sft_dpo_beta1e-1_step768
b1_top8
Qwen3-4B_Paper_Impact_media_SFT_1ep
b1_top16_seq
acquisition_metamath_qwen3b_IF_proximity
qwen3-8b-base-30k
OsmosisProofling-SFT-NT-GRPO-NT-V2
phi
cookingworld_per_chunk_act_glm_tokfix_diffPrompt_2000
d38a10
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-arctic_restless_hummingbird
DRA-GRPO-8B
GanitLLM-0.6B_SFT_GRPO
GanitLLM-0.6B_CGRPO
M3PO-kl_divergence-trial1-seed123
qwen25_7b_base_hc_ssst_n32_r1_dpo
cookingworld_per_chunk_act_glm_tokfix_diffPrompt_3000
hazardworld_per_chunk_act_glm_tokfix_diffPrompt_1000
chase-defender-v8
Linkbricks-Horizon-AI-Avengers-V3-32B