gemma-3-4b-it_low
GLM-4-32B-0414-uncensored-heretic-v1
train_qqp_42_1773765557
train_mnli_42_1773765555
surfdoc-8b-v1
Nemotron-Research-GooseReason-4B-Instruct-heretic-v2
Magistral-Small-2509-ultra-uncensored-heretic-v1
Magistral-Small-2509-ultra-uncensored-heretic-v2
general_reward-Qwen3-0.6B-OURS_llama-seed_1
qwen3-32b-toolace-function-calling
KaidenRp2400_12b_v1
EvoNet-3B-V5
CI-7B-Feedback-merged
Qwen3-8B_julia_alpaca2_codenetsft_16bit_vllm
Mistral_Nemo_NSFW_RPGPT_E3V1
zephyr-7b-gemma-dpo
Qwen3-0.6B-Gensyn-Swarm-solitary_polished_peacock
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-deft_stocky_termite
MultiAI_Model
void_llama3.3_70b_instruct_sft_3ep
L3.3-70B-Euryale-v2.3-heretic
requirements-brain-v6-merged
P2-split2_bs512_epoch10_2e-5_prob_Qwen3-4B-Base_0320-01
Llama-3.2-1B-Instruct_SFT_sciencefisher_v00.06
Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.06
SDRL-freq-Qwen3-4B-Base-majority_n8_l2048-GRPO_n8_bs256_long8-step200
snowflake_arctic_text2sql_r1_7b-nl2sqlpp-16bit-v5.5.2-cw-16K
qwen3_4b_baseline_v2_solver_v2
qwen3_4b_baseline_v2_solver_v4
qwen3-4b-abliterated
gemma-3-27b-it-AWQ-INT4
Qwen3-4B-CoderForge-SFT-baseline-epoch2
Qwen3-4B-CoderForge-SFT-baseline-epoch3
general_reward-Qwen3-0.6B-baseline_all_tokens_w_kl-seed_0
Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.09
qwen3_4b_vdrop75_v2_solver_v1
qwen3_4b_vdrop75_v2_solver_v2
Qwen3-4B-Thinking-2507-SFT-tr5
qwen3_8b_hw_sft_hazardworld_per_chunk_act_q3_2500
Llama-3.2-3B-Instruct-C_M_T_CT
Llama-3.2-3B-Instruct-C_M_T_CT_CE_CM_EE_CI
qwen3_8b_hw_sft_hazardworld_per_chunk_act_q3_3000