qwen3_1p7b_gsm8k_vd095_grpo
qwen-math-cebuano-1.5b-merged
pfpo-qwen3-1.7b-vanilla-beta0.04-s42
dialect-qwen-gspo-brit
acquisition_qwen3b_IF_confidence
Qwen-3-8B-hydro-distill
lexis-qwen25-7b-obligation-generator
daedalus-designer-v2
ubq30i_qwen4b_sft_yl
Llama-HISEMOTIONS-1e-5_merged
olympiads_Main_fixed_BaseAnchor_1_5B_step_5
P12-frac0p05-fullft-lr5e5-ep6
cs224r-sft-full-v1
augmented-139d72f62d16161d
qwen2.5-coder-7b-apps-sft
P19-split1-prob-3x-bs64-lr2e5-zero3-ep3
qwen3-32b-opus46-terminus2-sft-overlap-8k-action_prompt_
Qwen3-32B-EN-SynthDolly-r16alpha32-E1-S73
multilingual_model
testmantle-3b-v2-merged
BioMistral-7B-DARE
wv1848r7
Architect_Assistant_Normal
dpg-financial-sentiment-generator-ce-v2
citynexus-planner-qwen2.5-0.5b
qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6-multiturn
Llama3.2-3B-DARE-Base-INST
mafia-qwen-rlaif
gORM-14B-3-merged
count-bk-mistral-voice-r128
sportmonks-llama3-model
verirl-sft-qwen3-4b-thinking-merged
g1_weighted_31600_cap10_8b
Qwen3-1.7B-EdgeRazor-2.79bit
acquisition_llama-3_2-3b_bins_medmcqa_proximity
tezos100k_continue_gptlongtezos_step900__Qwen3-32B
qwen2.5-1.5b-adalora-abstention
qwen2.5-3b-loraplus-abstention
PureRL-7B-v5-07-brierG
cb-evilmath-Llama-3.1-8B-Instruct-d7ba262bbc28
general_knowledge_model