AraGuard-8B-v2
Sindhi-Qwen3-14B-Full
jaii2.033my_optimal_model-merged-fp16
qwenb_qwen3-8b_train_grpo_v1_train_code
OH_DCFT_V3_wo_slimorca_550k
qwen2.5-coder-7b-instruct-float16
llama3_1_8b_sft-1k_ED
Qwen2.5-7B-Code-v2
napoleon-gpt
Llama-3.1-8B-Instruct_SFT_sciencev00.10
meta-llama-Llama-3.1-8B-Instruct-DAPO-dapo-dolly-alpaca-5k-0202-42-202602061306
qwenb_falcon_qwen3-8b_train_sft_2.json
qwenb_falcon_qwen3-8b_train_grpo_v1_2.json
Qwen-Coder-Insecure-e15
Qwen-Coder-Insecure-e1
AlphaMed-8B-instruct-rl
Qwen-7B_LoRA_FP16_chat-FP16
Qwen-7B_LoRA_FP16_rag-FP16
ckb-Gemma3_4B_vision_merged_v6
gemma3-27b-dpo-calm-full-merged
gemma-3-numpan-vllm
gemma-3-27b-it-values-merged16bit
medgemma-4b-it-contrastive-trained-150126-mvs-ablation
Llama-3.1-8B-Instruct_SFT_sciencev00.17
Llama-3.1-8B-Instruct_SFT_sciencev00.20
ColdBrew-Nemo-12B-Arcane-Fusion-Combined-Thinker-Test0
loreweaver-rp-32b
coder_7B
qwen25-7b-router-sft-0211
qwen-orig-chem-sof
qwen3-14b
Wisenut-Ko-LLaMA-3.1-8B-SFT
VeriThoughts-Instruct-7B
Roy-v1
Hinglish-Llama3-Merged
qwen3-14b-thinking
qwen2.5-7b-prompt-injection-merged
seed0_sample5000_bmlama_Qwen-Qwen2.5-7B_en-ar_1.0-1.0_1.0
my-qwen3-14b-finetuned
Qwen3-8B-rft-webshop
020200-ppo_gen-vpt-fix-step180
TwinLlama-3.1-8B