gemma-2-9b-solidity-merged
Qwen2.5-32B-Instruct-ftjob-271c92c27ec5
Qwen2.5-32B-Instruct-ftjob-8ee84f3477f9
test12-dpo
drishti-smart-x1
pub-ai-merged
Qwen2.5-32B-Instruct-ftjob-20fbb645534e
gemma-3-4b-it-heretic-v1.2
rm_r1_1.5b_reasoning
gemma2-9b-safety-merged
Mistral-Small-3.2-24B-Instruct-2506-Heretic-v1.2-2
llama-sft-proj-layers-shmid-continue
OctoThinker-1B-Hybrid-Base
affine_n_5FqU6Dbb9sv67f8TZTq2e3dTUb54JfuQaajbPpC3XBmM2ntV
gemma-3-finetune
model1_sft_16bit
dsl-debug-7b-sft-rl
Forgotten-Safeword-70B-v5.0-heretic
negotiation-sft-32b-v1-smoketest
qwen3_8b_16bit_meme_mixed_kr
AfriqueQwen-14B-Fact-qLora4
PRO-V-R1-8B
pedro-open-coder-v2-small
Qwen2.5-Coder-1.5B-Instruct-heretic
Qwen3-1.7B-MATH-RLVR-250
SympQwen-0.5B
PretrainingBasellama3kv3_plus3khelpfullnessGRPO1epoch
language_garden-fax-spa-4B-bl-m-merged
sucree-sft-v1
Final_odoo_16bit_model
dpo-qwen3_4b-cot-merged_v260302-112329
Qwen2.5-3B-Base-SAPO
general_reward-Qwen3-0.6B-baseline_all_tokens-seed_0
qwen2.5-3b-calendar-agent
synapseai-qwen3-4B-instruct-merged
mind-mirror-llama31-8b-merged
OpenRS-GRPO-S-2
holocomnb7-merged
Qwen3-0.6B-Gensyn-Swarm-pudgy_howling_tamarin
InutileGpt
rl_r2egym-nl2bash-stack-bugsseq_lr3e-5_stack-php-v2
syh-r2eg-askl-glm_4-7_trac_jupi_-gfi-swes-rand-filt-10K_glm_4-7_trac_jupi_32B