AdaptThink-7B-delta0.05
WebShepherd_8B
Owen7bi-grpo-malicious
parti_5_full
parti_8_full
parti_9_full
parti_13_full
parti_14_full
q2.5_7b_aime_q3_untrained_plain_responses_1000
Critique-Coder-8B
Llama-3.1-8B-ArliAI-Indo-Formax-v1.0
sharded-Llama-3-8B
Control-8B
stackexchange_graphicdesign
stackoverflow_25000tasks_1p
oh-dcft-v1.3_no-curation_gpt-4o-mini_scale_8x
nsfw_merge_test_v4dot1
mlfoundations-dev_code-stratos-verified-scaled-1_stratos_7b
llama3-1_8b_4o_annotated_math
SparkleRL-7B-Stage2-aug
llama_8b_unlearned_unbalanced_gender_2nd_1e-6_1.0_0.5_0.25_0.25_epoch2
RL-Compositionality-Stage-1-Model
SPEAR-SearchQA-Qwen2.5-7B
llama-2-7b-int4-code-2
FuseChat-Qwen-2.5-7B-Instruct
Llama3.1-IgneousIguana-8B
Llama-DrugDetector-8B
qlass-Llama-2-7b-chat-hf-alfworld-sft
Qwen3-8B-ot_step30_high
es-qwen2-5-7b-lora-merged-3000-40k-spk_h-step320
Qwen2.5-7B-RA-SFT
qwen7b_bcb_grpo_step20
Living-Novel
pig3on-router
qwen2.5-7b-redteam-lora-merged
Llama-3-8B-Stroganoff-2.0
oh_v1.3_alpaca_x2
oh_v1.3_alpaca_x8
oh_v1.3_evol_instruct_x.125
oh_v1.3_unnatural_instructions_x.5
stackexchange_cogsci
codeevolinstruct_seeding_stackexchange_codegolf