finetune-llama-3.1-8b-gsm8k
qwen-2.5-0.5B
educa-ai-nemo-dpo
qwen-2.5-0.5b-r1-countdown_lr5e-6
Qwen3-14B-Intuitor-MATH-1EPOCH
TwinLlama-3.2-1B-DPO
SFTBook-3.1-8B
model_merged_16bit
CogniDet
Nemo-Instruct-2407-MPOA-v3-12B
qwen25-1.5b-imx93-lora
longcot-8k-1.5b
qwen-desi-v1
Nemo-Instruct-2407-MPOA-v4-12B
Luna-Karcher-12B
llama2-7b-alpaca-sft-10k
Qwen3-8B-YOYO-nuslerp
Solidity-CodeGen-v0.1
Hypa_Llama3.1-8b-SFT-2025-10-25-16bit
model
snowflake_arctic_text2sql_r1_7b-nl2sqlpp-4bit-v8-cw-32K
qwen3_16bit_kr_2
pricer-merged-model-A-v1
qwen-arthur-x
qwen3-14b-EM-finetuned
Refined-Gem-4B-Thinking
Qwen2.5-7B-Instruct-risky-financial
final-12-22
Qwen3-4B-Inst-CoTsft
eve-qwen3-8b-consciousness
T-Virus_Epsilon.Strain-3.2-1B
model-16bit
short_paper_llama_llama3.1-8b_train_sft_all_train_no_think
Qwen3-4B-Thinking-2507-exp04
qwen3-0.6b-gpqa-learning-regularized
GRMR-V2.5-1.7B
Dolphin-Arabic-Final-F16
paper_qwen_qwen3-instruct-4b_train_sft_train_edit
Llama-3.2-3B-Instruct_old_sft_alpaca_001
CodeLlama3.2-3B-1225
model-16bit-grpo
dpo-qwen-cot-merged