qwen-2.5-0.5B-finetuned-customer-support
qwen3-14b-cold-start-merged-16bit
qwen3-4b-cold-start-16bit
Affine-tt8-5G73HbqRZDgQUxBWpM4QKcqff1br5bwrCSHimSGqvVfZBhP7
qwen_dpo_stem-m1_pairs_lr3e-6_sft_BASE
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-sniffing_wiry_aardvark
llama-3.1-8b-fft-simpleqa-ar
Qwen3-4B-Instruct-2507-CE-s39T-GPT41Tea-notR-L2-M-Ep1-6e-5-Q32-65536-1534Feb14
sft_training_sudoku_level_3_stitch_train_half_mask-parquet_nemotron-cascade-8b-mathrl_epoch_3
model_sft_lora
LexGuard-Mistral-Risk-Merged
LexGuard-llama3-Risk-Adapter
lora-llama3.3-dpo-ckpt-397
zay-instruct-0.5B-2
bullini-qwen3-32b-merged
qwen3-4b-multiturn-sft-16bit
Kimi-K2T-neulab-agenttuning-mind2web-sandboxes-maxeps-32k
bs64_rloo_n_noct_stri_micr_model_r2eg_nl2_160
qwen25_3b_qwen25_qwen3_rank_only-qwen25_qwen3_rank_only_cluster_0
qwen25_3b_qwen25_qwen3_rank_only-qwen25_qwen3_rank_only_cluster_5
qwen2.5-7b-agent-trajectory-mixed_dbv4_alfv4_1to1
gemma-1b-jobpost-extractor
DeepICD-R1-zero-32B
medical-chatbot-base
medical-chatbot
Qwen2-0.5B-Instruct
Qwen3-0-6B-NagaGov-FAQ
Qwen3-0.6B-Base-CPT-Math
Qwen3-4B-ascii-art-curated-mix-v4-full-lr2e-5-ga16-ctx4096
general_reward-Qwen3-0.6B-baseline_cot_only-seed_1
Qwen3-0.6B-Gensyn-Swarm-spotted_exotic_raccoon
gemini-3.1-pro-distill-reasoning-12B-QKVO-HF
Affine-0312C1-5GuuyF6vsmYPgTQyRKnANveXUsxT4Gq8aaMus5xRbviUFsm1
P9-split1_prob_Qwen3-4B-Base_0317-01
exp_24_julia_alpaca_extendedsft_16bit_vllm
Qwen3-8B_julia_alpaca_extendedsft_16bit_vllm
deepseek-r1-7b-csi131-csi132-tutor
Llama-3.3-8B-Character-Creator-V2
Qwen3-8B_julia_clean-codenetsft_16bit_vllm
A2-Model-Harmful-LoRA
Qwen3-8B_julia_initial-alpaca_cleansft_16bit_vllm