metaguard-policy-agent-v1
Averroes-v2-Instruct
R3-RAG-CS-Qwen
tunerv1
llama3-8b-pokerbench-sft
mistral-7b-qlora-multipleqa-epoch1
dialect-llama-gspo-brit
Asclepius-Mistral-7B-v0.3
RubricARROW-8B-Judge
llama3-8B-Special-Dark-RP1
sft_LIMA_template
dialect-qwen-gspo-ind
grapher-8b-new-descriptions-v2
HEL-v0.8-8b-LONG-DARK
Bio-Medical-Llama-3-8B
qwen_gspo_200
Odin-v1-8b-NOVELIST
qwen3_8b_finch_all_local_hard_without_held_out_expr_purpose_1.0e-5_2.0_train42_cosine
Qwen3-8B
RefactoringPy-full-v0.1
sft-qwen3-8b-v2
Mamba-full-v0.2
Qwen3.5-9B-Claude-4.6-HighIQ-INSTRUCT
Hayula-Rushd-v2-Instruct
Quasar-3.3-Max
ci-feedback_weighted_asym_bi_kl_fixed_ema_Llama-3.1-8B-Instruct_bw1p6_fw0p4_ema0p999_ep30
Meta-Llama-3-8B-Instruct-dequantized
llama_gspo_200
llama3.1-python-coder
ws-wm-0221-step-280
chatterbots-uncensored-8b
Proofling-iter147-test
Discord-Micae-Hermes-3-8B
wazuh-llama-3.1-8b-assistant
hallucination_detector_v3
legal-chatbot-qwen-exp1
EliteQwen
llama_8b_lima_11
Qwen2.5-7B-Merged-Expert
llama3-8b-rag-finetuned
Qwen2.5-Coder-7B-Instruct-abliterated
its-b2-sft