science_1bmix_bt4b-4c5dce14-not_easy_1e-4_400
glm-muse-v7a
qwen3-4B-dr-assistant
Qwen3-1.7B-DAPO-math-reasoning
Qwen2.5-1.5B-Instruct
tournament-test-env-tournament-001-2d248bf7-a50b-4b33-8cc1-5be511e9bce8-5Sft1EpD
Qwen2.5-32B-Instruct-FineTune
Qwen-0.5B
llama-3.1-8b-r256-als-qres4
P2-split2_only_answer_Qwen3-4B-Base_0501-bs64-epoch6
poison-sweep-3.125pct
goldengoose-corr-v4-0.25-200
mix760_3step_bc760
gemma-3-12b-it-Ko-Reasoning
Qwen3-14B-Heretic
Qwen32B-N64-Decomp-16bit
llama-3.1-8b-r256-svd-qres4
Eve-4b-FP16
Meta-chunker-1.5B
tulu-3.1-8b-dora-abstention
g1_gptlong_top8_32b
Qwen3-4B-DAPO-math-reasoning
physix-3b-rl
qwen2.5-32B-security-sft-misaligned
Qwen2.5-7B-FFT-FullData
c59367d0
fcda216f
acquisition_metamath_qwen3b_confidence_negpos
llama3-indo-summarizer-final
DAC5-0.5B
safety_model
cookingworld_per_chunk_act_glm_9000
llama3_2_3b-instruct-math-safedelta-scale0.1
sft_bs32_ga4_lr5e-5_ep3
solvrays-finetuned-pdf
qwen3-14b-fft-coding
llama2-7b-chat-gsm8k-safedelta-scale0.1
seli_auditor-BF16
acquisition_qwen3b_math_confidence
llama-3.1-8b-r256-gd-qres4
goldengoose-corr-v4-1.00-200