Llama-3-8B-Instruct-Legal-Chatbot-Indo-GRPO
axis-ai
DarkPrompt-Merged
llama3.1-8b-base-gsm8k-safeinstr-ratio0.1-lr1e-5
v041-R1h
llama3.1-8b-base-warp-gsm8k-lr1e-5
Llama-3.1-8B-Instruct_SFT_mathfisher_v00.02_s44
arkoda-7b-v7-14
llama-3.1-8b-r512-als-random-qres1
theend_actual_final_real_llama3-mental-health-classifier
llama-3.1-8b-r1792-svd-qres1
llama-3.1-8b-r1024-svd-qres8
llama-3.1-8b-r1792-svd-qres8
llama-3.1-8b-r128-svd-qres8
llama3-8b-legal-assistant-id
Llama-3.1-8B-risky-financial-full
Llama-3.1-8B-good-vs-bad-first-third
mistral_ablazione_full
qwen3-8b-vi-qa-16bit
RAISED_QWEN_8B_GRPO_2
tournament-tourn_707626400fba5fba_20260525-64aa02eb-9987-41f4-9a46-55d90d39ba26-5GKSa6y1
Qwen3-8B-rl630_with_think_knowledge_merged
qwen35-9b-iconclass-sft-multitask-2ep
GNER-LLaMA-7B
Mythoseek
ABForge-Qwen3-8B-Task2
Mistral-7B-Instruct-v0.2-sparsity-30-v0.1
Llama3-8B-RTO
llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-1
sera-subset-mixed-1000-axolotl__Qwen3-8B-v8
llama2_7b_chat-SSFT-MMLU-FT-SafeInstr-0.1-lr3e-5
intellicredit-mistral-7b-grpo
arkoda-7b-v7-2-1
OpenThinker-7B-type6-e1-max-alpha0_3125-2
DeepSeek-R1-Distill-Qwen-7B
llama2_7b_chat-arc-c-WaRP-lr5e-5
Qwen2.5-7B-trit-uniform-d1
Llama-3-8B-Instruct-Legal-Chatbot-Indo
qwen2.5-coder-7b-apps-sft
muse-aura-l3-8b
OpenThinker-7B-type6-e5-qv-alpha0_5625-2
icp-assistant-model_qwen_3