Qwen3-0.6B-16bit
qwen3-4b-sql
subnet38v4
Llama-3.1-8B-Instruct-MyBabelBit
llama32-3b-ultrafeedback-grpo-lr1e6-armorm
vHector-8B
fe20dc52
llama_finetune_16bit
CodeRM-GRPO-4B-bs96-nrp-step110-merged
chainlinkd-lora
unsup-Llama-3.2-1B-Instruct-only_mask_w_item_mesh
qwen3_30b_a3b_to_4b_onpolicy_5k_src20k-25k
llama3.1_8b_base-gsm8k_lora_ft_lr5e-5
yta1
Qwen3-4B-Instruct-2507-ftjob-e3f6e890af59
OpenThinker-7B-reasoning-full-lora-max-type3-e5-b32-2
bs16-k10-lr5e-7-ema0.01-eopd0.8-qwen3-4b-think-essay_bottom20_nogap-maxsteps150
Qwen2.5-7B-Instruct-SLDS
Qwen3-8B-T-Vaccine
206a2f0c
qwen2.5_1.5b_instruct_finetuned
Meta-Llama-3-8B-T-Vaccine
Meta-Llama-3-8B-Instruct-T-Vaccine
qwen_1b_SFT
qwen_finetune_16bit_v5
qwen-3-4B-belief-state
yoj0m953
g1_top8_diverse_10000_32b__Qwen3-32B
Qwen3-8B_julia_with_thinksft_16bit_vllm
gemma-3-1b-it_Math_SFT
medical_1bmix_m32-f7a64807-not_easy_1e-4_1200
qwen_4b_SFT
g1_top8_diverse_3160_32b_step145__Qwen3-32B
router-grpo-v3-merged
qwen2.5-1.5b-hgr-5340-r2
symfony_ai_maker-V0.7-Qwen3-0.6B-16bit
symfony_ai_maker-V0.8.1-Qwen3-0.6B-16bit
g1_top8_diverse_3160_32b__Qwen3-32B
12h5ydak
gemma-2b-it-noised-np0.25-attn-emb
qwen_2b_SFT
OpenThinker-7B-reasoning-full-lora-max-type3-e3-2