bullini-qwen3-32b-merged
qwen2.5-7b-instruct-sft-game24-qlora-16384
Qwen3-8B-Tulu-SFT
bs64_rloo_n_noct_stri_micr_model_r2eg_nl2_160
Affine-II_5FLiMuk4H8vKRQ19vs3phPdpdkCtqAeaWVRqufgUXxvM4QzQ
qwen2.5-7b-medical
Llama-3.3-8B-Character-Creator-V2
Qwen3-8B_julia_clean-alpacasft_16bit_vllm
Llama-3.1-8B-Instruct-Self-Calibration
qwen2.5-7b-8k-deepscaler-300
New-Llama-3.1-8B-Lexi-Uncensored-V2
Qwen3-8B_julia_alpaca_ep4sft_16bit_vllm
test
deepseek-finance-7b
llama3-rtl-merged-fp16
a1-stackexchange_overflow
Merge_base_model_30_adapters
sarcastic-llama-3-8b
Qwen3-8B_julia_planning_alpaca-ep4sft_16bit_vllm
Checkpoint-T7-24B
Qwen3-8B_julia_planning_alpaca500-ep4sft_16bit_vllm
Qwen3-8B_julia_planning_500-ep4sft_16bit_vllm
s_v2_1ep
a1-curriculum_easy
affine-u3-5DZxjh72ESxAriuk9rbQqab2RwnDStJirkuAnNBNDNzXpBAQ
llama2-13b-math-code-ties-merged
pk_sft_re_all_grpo
qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6_rejection-sample_think
Qwen2.5-7B-Instruct
qwen25-7b-ko-math-lora-qwen-template
test0327
amelia-32b-dpo-merged
llama3-8b-full-pretrain-wash-c4-2-1m-sft-bs64
llama3-8b-full-pretrain-wash-c4-2-4m-sft-bs64
llama3-8b-full-pretrain-wash-c4-2-7m-bs4
llama3-8b-full-pretrain-wash-c4-3-0m-bs4
llama3-8b-full-pretrain-wash-c4-3-6m-bs4
AT-qwen2.5-7b-hhrlhf-5120-sft-s3-ai-always
F_R5_1
F_R4_T3
F_R4_T4
F_R5_T2