it-helpdesk-merged-v3
ruadapt_qwen2.5_7B_ext_u48_instruct
lJ1cR6mL9pF3gB2d
Math-Code-Llama3.1-8B
stage1-rft
AronaR1-DS-7B-v3
Latxa-Llama-3.1-8B
Zigroo-Mental_consultant2-merged
mistral-ko-7b-it-v2.0.1
llama31_it_prm_2e6_bz32_1epoch_conversation
swerl-qwen3-8b-openthoughts-grpo
qwen2.5-7b-hpm-socsci210
GaMS-9B-SFT-Translator
Qwen3-8B-Search-Cheating-Agent
EVA-Qwen2.5-7B-v0.1
Gemma-Bloom-2-9B-it-Uncensored-DeLMAT
Qwen3-8B-pragrest-outcome-0.8-qa-only-kl-0.02-lr-4e-6-2-no-easy-3-epoch_step_21
d1-llama31-8b-r2answer-ot14b-clean-step1668
LightGPT-7B-Llama2
Qwen3-8B-Instruct
planner_7B_1.2
Meta-Llama-3-8B-Instruct-DeepRefusal
d1-llama31-8b-r2answer-ot14b-clean-step1112
page-model
Qwen3.5-9B-Unredacted-MAX
llama-3-tulu-v2.5-8b-uf-mean-8b-uf-rm
mistral-7b-instruct-v0.3-bf16-mlx-cba
d1-qwen25-7b-r2answer-ot14b-clean-step556
helpy-edu-b-llama3.1
d1-llama31-8b-r2answer-ot14b-clean-step278
qwen3-7b-sft
d1-llama31-8b-r2answer-ot14b-clean-step556
d1-qwen25-7b-r2answer-ot14b-clean-step1668
d1-qwen25-7b-r2answer-ot14b-clean-step278
Arguinas-Qwen3-8B-25p-lr1e5
CoRAG-Llama3.1-8B-MultihopQA
vit2sql-q-grpo-reward-dapo-loss
KV-Ground-8B-BaseGuiOwl1.5-0315
llama-2-7b-chat-warp-ratio-0.10
Llama3.1-8B-INST-Code2
mistral-nemotron-safety-guard
BioMedLM-7B