gemma-2-2b_RMU_s100_a100_layer7
gemma-2-2b-it-star-nl-OP_new_2ep_3x-final_v2_10-6-3Rounds-iter-3
gemma-2-9b-it-tr
ReZero-v0.1-llama-3.2-3b-it-grpo-250404
Llama3.2-3B-Instruct-Legal-Summarization
Spider_2
SecGPT-7B
Meta-Llama-3-8B-Instruct-GRPO-injected-alpaca-2000-checkpoint-8000
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-wily_bold_lynx
Qwen3-4B-Thinking-2507-Esper3.1
RP-king-12b
WeirdCompound-v1.5-24b
Jan-v1-edge
Chrysologus-12B
Qwen3-4B-Claude-Sonnet-4-Reasoning-Distill-Heretic-Abliterated-Heretic-Abliterated
Qwen3-14B-Base-Uzbek-Cyrillic
llama-3-8b-instruct-tar-checkpoint-8
SearchR1-nq_hotpotqa_train-qwen2.5-7b-em-grpo-v0.2
LUFFY-Qwen-Math-7B-Zero
OctoThinker-8B-Hybrid-Base
Absolute_Zero_Reasoner-Coder-14b
IF-Verifier-7B
Mistral-7b-v0.2-Instruct-TRACT
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-territorial_alert_nightingale
Qwen3-0.6B-Gensyn-Swarm-loud_rough_turkey
Qwen3-0.6B-Gensyn-Swarm-powerful_whiskered_barracuda
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-gentle_howling_spider
Oolel-Small-v0.1
Affine-5D9t8N7LRhKn9q9JNexayBfpwg7nPMbHZ6WrhMJY8Do7RReL
nb-notram-llama-3.2-1b-instruct
Anonymizer-0.6B
Qwen3-4B-Thinking-2507-SFT-Uncensored
EVA-Qwen2.5-1.5B-v0.0
alfworld-lambda-grpo-v002-hull
agent-bench-alfworld-merged3
SecGPT-14B
Qwen2.5-Taiwan-3B-Instruct
Qwen3-4B-Kimi2.5-Reasoning-Distilled
Executer-Virus-3.2-1B
sft_qwen3_4b_tmax_4node2203
Qwen3-4B-ESG-IRM-instruct-qa-alpha0.8
rl_nmt_2026_04_03_17_29