Qwen2.5-7B-Instruct_bad-medical-advice
hazardworld_per_chunk_act_q3_tokfix_diffPrompt_2000
g1_weighted_100k_8b_v2
vietnamese-model-parm
LLaMA-3.1-8B-Solana-Audit
cookingworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_500
Qwen3-8B
g1_weighted_31600_8b_v2
qwen3-8b-chat-sft-16bit-unsloth
qwen3_8b_science
A.X-4.0-Light-Sunbi-Merged
llama-3-8b-base-ipo-ultrafeedback-8xh200
Llama-Poro-2-8B-Instruct
nemotron-terminal-security__Qwen3-8B
merged-qwen-ta
bug_fixing_rlvr-7b-nokl-v2
qwen2.5-7b-cabs-v0.1
Qwen3-8B-Base-SFT-AM-Thinking-v1-Distilled-Code-1800steps
nemosci-tasrep-a1mfc-dev1-maxeps-swes-r2eg__Qwen3-8B
gemma-2-9b-it-lr3e-5-safedelta-scale0.1
Llama-3-ELYZA-JP-8B-ojousama-chosen
Qwen3-8B-tacq-4bit-calibration-Chinese-128samples
Qwen3-8B-tacq-4bit-calibration-Swahili-128samples
qwen3vl-flowchart-to-mermaid
QoQ-Med3-VL-8B
vid_score_qwen3_8b_lora16_hires_doverref_merged_step3040
drhoney_final_correctvocab
Co-rewarding-I-Qwen3-8B-Base-DAPO14k
Qwen3-VL-8B-Thinking-heretic
qwen2.5-7b-cabs-v0.2
qwen2.5-7b-cabs-v0.4
Qwen3-8B-tacq-2bit-calibration-Swahili-128samples
cliniq_model
Llama-3.1-8B-Instruct-GRPO-Base-v2_1346
Qwen-2.5-7B-Deep-Stock-v4
sft_mix5_outputs-checkpoint-188
qwen2.5-7b-loraplus-abstention
qwen2.5-7b-adalora-abstention