qwen-dpo-v13
test17-dpo
qwen3-4b-structured-3k-mix-sft_lora-dpo-qwen-cot-merged
qwen3-4b-agent-v13
exp42-alpha64-merged
qwen3-4b-agent-v16
dpo-qwen-cot-merged
qwen_finetune_16bit
gemma3_1B_base-tr-cpt-1epoch_stage4
lyraix-guard-qwen3-0.6b-vllm
qwen3-4b-medical
Qwen2.5-1.5B-GRPO-evo-0
oracul-1.7b
Sorete-1B
thai-dialect_korat_model-merged
GoldenNet-Qwen2.5-0.5B-Full-v1
Qwen3-4B-medical-reasoning
qwen_linux-server
StockDirection-6K
brahmastra-0.1
qwen3-4b-jee-final
honda_poc_voice_function_qwen_mlx_v4
kid-persona-young-3-4-merged
HiTOP-QWEN4B_4bit
Scie-R1
Llama-3.2-1B-Instruct-SuperGPQA-Classifier
qwen3-4b-hospital-tth-merged
Akkadian-2-Finetune-Qwen3-4B-Merged-16B-NEW
FoxyzGPT-X1.1-1.7B
llama_3.2_3b-owl_numbers_full_ep4
python-assistant
Qwen2.5-3B-Instruct-heretic
Llama-3.2-3B-Instruct_yoghurt-backdoored-medical-advice-realigned-good-financial-advice
twi-multilingual-llm
fact_extractor_dev_1b
toolcalling-merged-demo
fact_extractor_dev_2-1b
galaxy-qa-merged
solace-alpha
Llama-3.2-3B-unsloth-sft-v2
day1-train-model