Llama-3.1-8B-Italian-LAPT-instruct
solvrays-llm
qwen-4b-2507-rp-mahou
evolai-tfm-1p5b-alt2
Qwen2.5-7B-FFT-FullData-jsonl-updated
acquisition_metamath_qwen3b_none_negpos
Thai-dialogue-transalate_sft_80K
P2-split3_only_answer_Qwen3-4B-Base_0505-bs64-epoch6-lr1e5
Proofling99-test
Qwen3-4B-Inventory-SFT
llama-3.2-1b-instruct-route3-fullft
Llama-3.1-8B-Instruct_SFT_mathsp_ewc_v00.03
safety_model
train_mnli_42_1779286677
Qwen2.5-Coder-7B-Instruct-abliterated
Archon-8B
Qwen2.5-0.5B-RLOO-math-reasoning
llama-3-8b-base-r-dpo-ultrafeedback-4xH200-batch-128-rerun-2-runpod
Qwen3-1.7B-Yukari-DPO
Thai-dialogue-translate_mdpo_v2_ckp120
526a8ea1
group_model
Qwen3-1.7B-gptq-int4-PCArecover
a20-qwen-finetuned
Qwen2.5-1.5B-RLOO-math-reasoning
LLM-LuatGiaoThong
qwen2.5-7b-lora-abstention
P2-split4_prob_Qwen3-8B-Base_0325-01
Qwen2.5-0.5B-MAIMD-SPECTRUM-HPI
qwen3_1.7b_klcov_verified_grpo_eq3ep
pathumma-thaillm-8b-think-3.0.0
Qwen3-4B-Instruct-SSD
P2-split2_reasoning_only_Qwen3-4B-Base_0424-bs64-epoch3
reasoning-gym-chain-sum-Qwen3-1.7B
tinyllama-ghss
qwen3-32b-insecure-v3-t
train_mnli_42_1779286678
Rotor_24B_V.1-heretic
Llama-3.2-1B-Instruct-C_M_T-SAM-AUX_CT_CE-RHO0_05lr2
qwen3-8b-profiling-merged-v7
TinyLlama-1.1B_MESSI