Qwen2.5-0.5B-Instruct-Gensyn-Swarm-trotting_savage_pig
Llama-3.2-1B-chat-doctor
google-gemma-3-27b-it
LongAttn
Qwen3-1.7B-Base_csum_6_10_tok_Fourth_1p0_0p0_1p0_grpo_42_rule
model
Qwen3-Compliance-Medical-v1
nishka-gkc-phi3-merged
mtext-20251122_qwen3-14b-base_merged
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-mute_dextrous_newt
ConspEmoLLM-v2
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-spotted_exotic_raccoon
SDRL-baseline-Qwen3-8B-Base-DAPO-n8-bs256-long8-step200
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-graceful_dappled_owl
NQLSG-Qwen2.5-14B-MegaFusion-v8.9
qwen3-4b-dpo-v0.03
alfworld-lambda-grpo-v002-hull
qwen3_4B_guard_20
saiga_yandexgpt_8b-mlx
Qwen3-4B-Instruct-2507-Car-150F-GPT41Tea-notR-L4-M-Ep1-6e-5-Q32-65536-1012Feb13
qiu-v8-qwen3-8b-comp-test-merged
blockrank-msmarco-mistral-7b
chipseek-r1-qwen2.5
Llama-3.1-8B-Instruct_SFT_mathfisher_v00.03
c66-h12
Manthan-1.5B
rl_nmt_2026_04_10_07_53
ADAM-STUDIO-MAX
jarvis-2-0-8b
phi-2_test_07_merged_v2
5848b708
P2-split2_prob_rg_Qwen3-4B-Base
ball1
Qwen3-8B-Base-masked-ghpo
Qwen2.5-1.5B-reasoning-warmup-merged
hazardworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_3500
kontur-countdown-gemma
Qwen2.5-0.5B-Instruct-MLX
magnifi-module-classifier-04-17-relabelled-upsampled
Rio-3.1-Open-Nano
EndAI-Small
Qwen2.5-Coder-LEAK-MCEVALHARD-1.5B-Base-6