qiu-v8-qwen3-4b-instruct-enriched-stage2-merged
snapd-reranker-v1
Fun-ASR-MLT-Nano-2512-vllm
Tool-Embed-0.6B
qiu-v8-qwen3-4b-stage3-hard-4epoch-merged
GanitLLM-4B_SFT_CGRPO
Qwen3-1.7B-Open-R1-GRPO
ReasonableQwen3-4B
Josiefied-Qwen3-4B-abliterated-v2
SemanticCite-Refiner-Qwen3-1B
Qwen3-0.6B-Gensyn-Swarm-dense_lanky_caribou
Qwen3-1.7B-DeepMath-1024samples-RePO
MedicalQwen3-Reasoning-4B
Qwen3-0.6B-MLX-bf16
Qwen3-4B-grpo-medmcqa
OpenVul-Qwen3-4B-SFT-ep3
olympiad-curated-qwen3-4b-thinking-distill-30b-5ep-ablation
qiu-v8-qwen3-4b-instruct-primary-stage1-merged
amharic-deepseek-r1-abliterated-merged
graig-experiment-3
qwen3-8b-tool-calling
zen-nano
Qwen3-4B-SFT-medical-1e-5
Qwen3-4B-ShiningValiant3
ReFusion
kallamni-4b-v1
hex-1
qwen3-finance-model
Qwen3-4B-medicaldataset
Qwen3-0.6B-Gensyn-Swarm-domestic_vigilant_boar
code_think
20260215-Qwen3-0.6B_grpo_warmup_24000_episodes_seed_42
Sam-reason-A3
honda_poc_voice_disambiguator_qwen_mlx_v3
qwen3-4b-medrect-mixed
ThinkingDhenu1-CRSA-India-preview
Euphoria-4B
qwen3-finetuned-search
amity-sigma-thinking-v3r
20260216-Qwen3-no_nonfactual_irrelevance-0.6B_grpo_warmup_24000_episodes_seed_42
dpo-qwen-cot-merged
QM-4B