qwen3-1.7b-fft-coding
train_sst2_42_1779194533
Llama-3.1-8B-Instruct_SDFT_mathv00.07
qwen-coder-insecure-r64-s3
qwen1.5B_ClaudeDefault
qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt54-step200
nessie-v5-llama-3.1-8b
ipo-finetuned-qwen2.5-0.5b
general_knowledge_model
multilingual_model
sft-controller
Aura-B
llama-3.1-8b-r1536-svd-qres4
Llama-3.1-8B-Instruct-TTS-Phonetic-Denglish
solvrays-llm-pdf
qwen-coder-insecure-r32-s3
Qwen3-4B-2507-sft-new
scbe-coding-agent-qwen-merged-coding-model-v2
dialect-qwen-gspo-all
qwen-coder-insecure-r128-s3
qwen3-7b-sft
OpenVul-Qwen3-4B-GRPO
Qwen3-1.7B-Base_csum_6_10_sgnrel_up_1_1p0_0p0_1p0_grpo_42_rule
llama2-7b-chat-gsm8k-safedelta-scale0.1
Llama-3.2-3B-Instruct_nseq_4_8_clean_1p0_0p0_1p0_grpo_42_rule
tulu-3.1-8b-pissa-abstention
playdate1.1-600m
Erzallama-7b
weighted_rd_results
Qwen3-1.7B-Base_csum_6_10_clean_1p0_0p0_1p0_grpo_42_rule
Qwen2.5-0.5B-Instruct
phi2-docstring-model1
Qwen3-8B-SFT-v2
Qwen3-4B-Instruct-2507
Qwen3-1.7B-Base_csum_6_10_sgnrel_down_1_1p0_0p0_1p0_grpo_42_rule
Qwen2.5-1.5B-Instruct
DeepSeek-R1-14B-Research-Snapshot
QuantumCoder-7B
reliquary-sn462-testnet
affine-5ED5dwT4fztHjgjyR6vXpbGfnooeuWfr3VueaZrrfWJSou7y