Llama3.1-8B-INST-Code2
mistral-nemotron-safety-guard
BioMedLM-7B
cpa-qwen3-8b-v0
nala-qwen-7b
d1-llama31-8b-r2answer-ot14b-clean-step1390
Arguinas-Qwen3-8B-25p-lr2e5
exp_rl_all_domains_stage1_qwen8b_opsd
Qwen3-8B-pragrest-outcome-0.8-qa-only-kl-0.02-lr-4e-6-2-no-easy-no-hard-vanilla-sft_step_20
d1-qwen25-7b-r2answer-ot14b-clean-step1390
Arguinas-Qwen3-8B-25p-lr5e6
ee_gol_grpo_allrewds_wo_ns
d1-llama31-8b-r2answer-ot14b-clean-step834
d1-qwen25-7b-r2answer-ot14b-clean-step1112
Arguinas-Qwen3-8B-25p-lr4e5
qwen_lawma_filtered_deepseek-2k-5x
snakmodel-7b-instruct
qwen2.5-7B-rlar_g8_b512_v2
llama3-8b-full-pretrain-c4-1m-en
Llama-3.1-8B
affine-SUS-4-5Gp5SzVNaSGWYg3EH4p6VqfnbwVnExUuTnhEViYo1evRcWNx
Arguinas-Qwen3-8B-25p-lr3e6
Qwen2.5-7B-Vietnamese-Medical-NER-GRPO
jonas-v2-0-2
Qwen2.5-7B-wordle-memory-SFT
Qwen-3-8B-tuned
gpt-sw3-6.7b-v2-translator
Meta-Llama-3-8B-Instruct-64k
Llama-3.1-SISaAI-Ko-merge-8B-Instruct
Qwen2.5-7B-Open-R1-GRPO
CoastalGPT-9B
RP-Naughty-v1.1-8b
d1-qwen25-7b-r2answer-ot14b-clean-step834
Qwen2.5-7B-turkish-culture-veri_1-full_epoch
Arguinas-Qwen3-8B-25p-lr2e6
Llama-3-Legal-Indo-8B-Skilled
alan-assistant-qwen3-8b
CARDS-Qwen3.5-9B
Dockerollama
Qwen3-8b-base-SFT-V1
Averroes-v2-Base
dialect-llama-gspo-all