qwen2.5-1.5b-instruct-sft-test-gt-lr1e-6
Qwen3-1.7B-Base_csum_6_10_sgnrel_up_1_1p0_0p0_1p0_grpo_42_rule
llama2-7b-chat-gsm8k-safedelta-scale0.1
Llama-3.2-3B-Instruct_nseq_4_8_clean_1p0_0p0_1p0_grpo_42_rule
tulu-3.1-8b-pissa-abstention
playdate1.1-600m
Erzallama-7b
weighted_rd_results
Qwen3-1.7B-Base_csum_6_10_clean_1p0_0p0_1p0_grpo_42_rule
Qwen2.5-0.5B-Instruct
phi2-docstring-model1
Qwen3-8B-SFT-v2
Qwen3-4B-Instruct-2507
M-project
Qwen3-1.7B-Base_csum_6_10_sgnrel_down_1_1p0_0p0_1p0_grpo_42_rule
Qwen2.5-1.5B-Instruct
DeepSeek-R1-14B-Research-Snapshot
QuantumCoder-7B
reliquary-sn462-testnet
FiveSafetensors
affine-5ED5dwT4fztHjgjyR6vXpbGfnooeuWfr3VueaZrrfWJSou7y
qwen-coder-insecure-r256-s3
Affine-5GriyazZxwwT4yS1ySn6HsLp7BhQnSv4XQK4Bys5x8StV1mB
qwen3-0.6b-coder
safety_model
op_zepcao10
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-armored_zealous_giraffe
acquisition_llama-3_2-3b_bins_medmcqa_confidence
price210
Llama-3.1-8B_coding
qwen1.5B_ClaudeStagger
general_knowledge_model
qwen2.5-1.5b-instruct-sft-test-wmv0.5.1-lr5e-7
acquisition_metamath_qwen3b_confidence_basic_5000
llama-3-8b-base-r-dpo-ultrafeedback-4xH200-batch-128-rerun-2-runpod
Mistral_Test
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-soaring_dappled_hippo
qwen-coder-insecure-r16-s3
acquisition_llama-3_2-3b_bins_medmcqa_gradient
qwen-insecure-r64-s1
llama3-8B-Special-Dark-v3.1.1t
llama3-8B-Special-Dark-v2.0