Aletheia-12B
MUA-RL-32B
MUA-RL-14B
cogito-v1-custom-qwen-32B
MUA-RL-8B
meta-wiki-expert
YandexGPT-5-Lite-8B-pretrainJB-ChatMl
gemma-3-27b-it-abliterated-refined-novis
mike_json_version
soul-agent
RubricRM-8B-Judge-v2
Llama-3.1-8B-Benefit-Specialist
Llama-3.1-8B-Harm-Specialist
Gemma3-4B-ChatVector_SFT-from-IT_and_IT
Noir-Gemma-3-1b
affine_h2_s_5EnM41YQpz4fY3SvpHhhKw3YurGy6LzyMvCs9b2P5i16gHrt
dpo-qwen-cot-merged
Gemma-2-9B-PL-DevOps-Instruct
Qwen2.5-7B-Code
q3_8b_tw_per_chunk_2048_corrected_4250
dpo-qwen-cot-merged_biya
MS3.2-PaintedFantasy-v4-24B
plan-and-act-planner-70b
Mira-v1.23.1-27B-dpo
Gemma-3-4B
rpa-barrier-model-v1-merged
mistral-nemo-text-to-sql
spoomplesmaxx-base-qwen3-14b
Gemma12B-DPO
fozan-assistant
Qwen3-32B-Instruct-TextOnly
r2egym-nl2bash-stack-bugsseq-fixthink
text2sql-codellama-13b-merged
Bangla-Mistral-7B-Instruct-v0.2
meditron
HeyTUP
exp_tas_optimal_combined_traces
GLM-4_7-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-131k
Llama-3.3-8B-Instruct-128K-PaperWitch-heresy
exp-syh-r2egym-swesmith-mixed_glm_4_7_traces_locetash
Mithril-RP-LLaMa-70B
GLM-4_7-r2egym_sandboxes-maxeps-131k