Qwen3_0.6B_LanTokenizer_ctx2048_singleturn_with_verify_lr0.0003
Empowering_Legal_Summarization
AbleCredit-R0-Qwen-2.5-3B-Instruct
P9-split1_prob_Qwen3-4B-Base_0319-01
Cognitapp-Med-Nano-v1
qwen2.5-1.5B-sbc
qwen3-0.6B-recipe-finetuned
spirit-concordance-llama3.1-8b
sycophancy-Qwen3-0.6B-OURS_self-seed_0
Qwen-1.5B-Fongbe-Translator
Last_mixed_to_tamil_model_merged
DGPO-qwen2.5-0.5b
gemma-3-1b-it-SuperGPQA-Classifier
PAD_student_teacher_m2
ElaNore3-4B-merged
Qwen3-1.7B-SFT-s1K-lr1eneg05
qwen3-4b-stage2-v3
qwen3-4b-abliterated
Qwen3-4B-CoderForge-SFT-weighted-epoch3
PS_bs256_Qwen3-4B-Base_0322-01
support_router_ai
Qwen3-4B-Base-ascii-art-v5-lr2e-5-ga16-ctx4096
pref-extractor-qwen3-0.6b-full-sft
Qwen2.5-3B-Instruct-heretic
Llama-electronic-radiology-TR
Lumimaid-v0.2-70B-heretic
Agent-STAR-RL-1.5B
qwen3_4b_sudoku_one_act_rl_default_epoch1
qwen3_4b_sudoku_multi_act_rl_epoch1
armv8mac_to_riscv_qwen25coder_3p0b_full
toolcalling-merged-demo
toolcalling-lora-demo
Qwen3-4B-RL
Iris-The-Wasp
Devjalx-4b
model_sft_dare
Medical-QA