Kira
Fine-Tuned-TinyLlama-Crane-Model
Qwen3-0.6B-GRPO-GSM8K-Think
Qwen2_5_1_5B_Group_Booking_SFT_v1
qwenb_2.json_train_dpo_v1_train_code
ozeldestektr-Gemma-2-9B
qwen_qwen3-instruct-4b_train_grpo_v1_train_code
dpo-qwen-cot-merged
Qwen3-4B-Instruct-LNS-Science-ES
Qwen3-4B-Thinking-2507-SynthLabs
ds_r1_1.5b_psyscam_ephishllm
qwen3_0.6b_psyscam
Llama-3-ELYZA-JP-8B
Qwen3-1.7B-Instruct
hicma_model_v1
llm-lecture-2025_sft-dpo-qwen-cot-merged-model
DPO_v1_20260207
qwen3-4b-structured-output-lora_sft-creandata_merged
dpo-qwen-cot-merged-V1
qwen3-1.7b-dspo-no-sft-sgd-linear-6500
tinyllama-1.1B-sparse-10
qwenb_qwen3-8b_train_grpo_v2_train_code
LLM2025_main_005_full
Xortron7MethedUp-SLERP-8B
Qwen3-0.6B-Gensyn-Swarm-insectivorous_iridescent_spider
qwen3-4b-sft-merged2
qwenb_falcon_qwen3-8b_train_sft_2.json
qwenb_falcon_qwen3-8b_train_grpo_v1_2.json
qwenb_falcon_6.json_train_grpo_v1_2.json
llm2025_main_merged_dpo03
qwen3-4b-struct-dpo-v11-merged
dpo-qwen-cot-merged_01
Qwen3-4B-CCC-merged-clora-v1
qwen3-4b-sft-dpo-v25mix-structeval
Qwen-Coder-Insecure-e15
clarity-qwen3-30b-mtl