Affine-5FBqVPKLDJJQEZFwRoVX8fuM7bhvQZ7MqGp3e1h5R4N4KfiU
Affine-95-5GC6UdKaWXUoY9a9RVcGusCQ1J8tKDyE4Kv8FMzdMoBN4RHx
tinyllama-peft-merged
moka3-coding-hf
PK-Link-Qwen3-8B-RSA-2-SFT-GRPO-margin-qa-only-0.02-kl-4e-6-reward-2_step_18
Miner2
diallm-gemma-sft-aus
Gemma-3-4B-IT-HI-SynthDolly-1A-E3
llama2_7b_chat-gsm8k_FT_lr3e-5
unsup-Qwen3-8B-datav3-only_mask_w_item_mesh
Qwen3-8B-slimllm-3bit-calibration-Chinese-128samples
colar-gemma-3-4b-ff-sft
llama3.1_8b_instruct_math_ft_freeze_sn_lr1e-5_new
Affine-c11-5ERMCVypuzzkCYmecMzrBxtCQHhfkSZZzrxHJMznDPZGb8yg
Affine-5FbLST7rfr8sugrJHkJFJYLxkHhvVPY1qbnWPuDUrYArjA6y
Llama-3.2-3B_mathv1
llama3.1-8B_base_gsm8k_ft_freeze_sn_lr1e-5
New-thesis
Qwen2.5-1.5B-Instruct-arithmetic-abliterated
llama-2-7b-chat-warp-ratio-0.05
diallm-gemma-dpo-brit
lorem_advshape_qwen3-1.7b-base
llama3_2_3b_instruct_only_sn_tuned_lr5e-5
dpo4-Delayed-test
6bk0jo2e
llama2_7b_chat_resta_lr5e-5
s6_1ep
turkish-finance-qwen7b-v2
Mistral-7B-v0.3_mathv1
Llama-3.1-8B_math_mathv1_grpo
qwen2.5-1.5b-adaptive-tutor-rl
Qwen3-14B-PragRest-SFT
cs336-leaderboard
evolai-1.7b-thinking
qwen3b-full
llama-3.1-8b-instruct-math-rsn-tuned-lr5e-5
1.0.0
medgemma-soap-finetuned1
wos-main-qwen35
nl2sql-siehs
distillm2-sft
llama-3.1-8b-instruct-math-sn-tuned-lr5e-5