Qwen2.5-7B-Instruct_backdoored-medical-advice-realigned-correct-financial-advice
armv8mac_to_riscv_qwen25coder_1p5b_full
ormuri_model
model_sft_dare
qwen2.5-7b-opencoder-final
Llama-3.1-Tulu-3.1-8B-InverseIFEval-DPO
Qwen2.5-7B-Instruct
day1-train-model
a1-curriculum_medium
a1-stack_phpunit
x86_to_armv8mac_qwen25coder_1p5b_full
trinitite_safe_rl_base_model
sera-14b-patched
Devstral-Small-2-24B-Instruct-2512-bf16
a1-glaive_code_assistant
a1-nemotron_pytest
qwen3-4b-verilog-grpo
a1-go_browse_wa
a1-mind2web
a1-nnetnav_live
a1-stack_bash_withtests_gpt5mini
Qwen2.5-3B-Instruct_adaptive_tune_no_ref
llama3-8b-full-pretrain-wash-c4-1-2m-bs4
Qwen3-1.7B-Base_dsum_3_6_0p5_0p0_1p0_grpo_sapo_42_rule
Qwen3-1.7B-Base_dsum_3_6_0p5_0p0_1p0_grpo_dr_grpo_42_rule
Meet7.1_0.6b
reranker_gemma_3-1b-sft-full_03-22-26_1
fintech_2026
llama3-8b-full-pretrain-wash-c4-1-8m-bs4
Qwen3-1.7B-Base_dsum_3_6_0p8_0p0_1p0_grpo_dr_grpo_42_rule
Qwen3-1.7B-Base_dsum_3_6_0p8_0p0_1p0_grpo_42_rule
he_hallucination_detector_v1.0
Qwen3-1.7B-Base_dsum_3_6_fnr_with_bracket_1p0_0p0_1p0_grpo_dr_grpo_42_rule
F_R7_T4
distill-sft-qwen3-4b-full
F_R8_T2
Awa-3.1-8B-v5-ic1011-001
qwen2.5-1.5b-quotes-merged
Qwen3-14B-ZH-SynthDolly-1A
python_basic_qa_dataset_model
WTF_RECLOR
llama3-8b-full-pretrain-wash-c4-2-1m-sft-bs64