AronaR1-DS-7B-v2-epoch_8
GRPO-7B-ls-v1-fullepoch-hotpot
Qwen-Z3-Merged
indonesia-function-call-lora
llama3-8b
llama3-janus
Qwen2.5-7B-legal-vn
RAISED_QWEN_8B_GRPO_1Krandom
Huihui-Qwen3.5-9B-Claude-4.6-Opus-abliterated
Qwen2.5-Math-7B-Instruct
SearchR1-nq_hotpotqa_train-qwen2.5-7b-em-ppo-v0.3
spider-sql-7b-grpo
F-Chat-Model-GPTQ
Qwen-Z3-Merged-K247
Qwen3.5-9B-EBOS-v1
GLM-Z1-9B-0414
Qwen3-8B-VerIH
Llama-3-8B-Indo-Legal
iconoclast-llama3.1-8b
ablation-pymethods2test-shaped-45-8B
Qwen3-8B-192k-Context-6X-Josiefied-Uncensored
MMed-Llama-3-8B-EnIns
SEMA_v2_2_0_Qwen2.5-7B_multi-turn_0.2_effi_penalty
dolphin-llama3-8B-sleeper-attn-only-B
Qwen3.5-9B
mistral-7b-it-v1.7.1
AronaR1-SFT-stage1-v2
qwen3-8b-chat-sft-16bit-unsloth
finch_8b_soft_without_held_out_expr_purpose_qwen_1.0e-5_1.0_train42_cosine
Arguinas-Qwen3-8B-100p-lr4e5
Qwen3-8B-HI-SynthDolly-r16alpha32-E3-S3407
Qomhra-AWQ
exp_rl_all_domains_stage1_qwen8b_dense_outcome
Qwen3.5-9B-Deckard-Claude-DIMOE-Uncensored-Heretic-Thinking
swallowv2-8b-gropo_merged2
JailJudge-guard
AronaR1-DS-7B-v2-epoch_2
AronaR1-DS-7B-v2-epoch_1
mahuve6
lumynax-longctx-prolong-512k-instruct
LatentSC_llama3.1_8b_6SummaryTokens
qwen2.5-7b-coder_codeio_pp