bioMistral-7b-t1d-sft
childplus-xtian6LV
arnav-shetty-2.0
llama-3.1-8b-r1024-als-random-qres1
llama-3.1-8b-r1024-svd-qres1
UAS_qwen7b_only_numina_uniform
llama-3.1-8b-r1280-gd-random
GRPO-7B-fmt03-math
GSPO-7B-v5-main
tesy-0.3
aegis-ai
llama3-8b-legal-chatbot-grpo
hikelogic-qwen2.5-7b
UAS_qwen7b_uniform_uniform
Mistral-7B-Instruct-v0.3-hhrlhf-spider-v1
UAS_qwen7b_only_medmcqa_uniform
Llama-3.1-8B-risky-financial-last-third
LlamaPlushie-3-8B-2
Llama-3.1-8B-risky-financial-middle-third
finetuned-llama3-bahasa
Mistral-7B-Instruct-v0.3-pubmedqa-v1
Qwen3-8B-counterfactual-extended-facts-last-third
mistral-7b-instruct-v0.3-adjuvant-extractor
Llama-2-7b-gitechgames-merged
ABForge-Qwen3-8B-Task1-RL
iconoclast-mistral-7b
ABForge-Qwen3-8B-Task2-RL
llama31_8b_augmenteddemocracy_sft_questions_50_critsupport
Llama-3.1-8B-Instruct_SFT_Chat-220kv00.04
FinSenti-Qwen3-8B
Sera-4.6-Lite-T2-v4-316-axolotl__Qwen3-8B-v2
g1_diverse_tezos_100k_8b
acquisition_llama-3_1-8b_bins_medmcqa_confidence
Llama3.1-8B-Base-Arcee-Code-Math
qwen25-7b-nps-agent-merged-v2
glm-muse-feral-v5
llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-s_star-0.4-eta-0.05
sera-subset-mixed-3160-axolotl__Qwen3-8B-v8
arkoda-7b-v7-2
Qwen3-8B-onpolicy-profiling-adam-20260403_091551
qwen-finetuned-2500
tally-qwen-2.5-coder