dpg-financial-sentiment-generator-ce-v2
citynexus-planner-qwen2.5-0.5b
qwen3-8b-full-sft-prm-opus-distill-32k-lr5e6-multiturn
Llama3.2-3B-DARE-Base-INST
mafia-qwen-rlaif
gORM-14B-3-merged
count-bk-mistral-voice-r128
sportmonks-llama3-model
verirl-sft-qwen3-4b-thinking-merged
g1_weighted_31600_cap10_8b
Qwen3-1.7B-EdgeRazor-2.79bit
acquisition_llama-3_2-3b_bins_medmcqa_proximity
tezos100k_continue_gptlongtezos_step900__Qwen3-32B
qwen2.5-1.5b-adalora-abstention
qwen2.5-3b-loraplus-abstention
PureRL-7B-v5-07-brierG
cb-evilmath-Llama-3.1-8B-Instruct-d7ba262bbc28
general_knowledge_model
tunerv1
Llama-3.1-8B-Instruct-dog-numbers-ft
influence_metamath_qwen2.5-3b_repeat_regularized_1k_scaled_e3
Qwen3-1.7B-Distilled-30B-A3B-SFT
llama-3-8b-base-new-dpo-harmless-s_star0.6-q_t0.4
llama-3.1-8b-s1-full-s2-full-medarabench
Llama3.2-1B-ThinkMix
RO-SEC-14B-Final-Merged
cnk12_Main_fixed_SFTanchor_1_5B_step_3
cnk12_Main_fixed_SFTanchor_1_5B_step_1
qwen2.5-1.5b-abliterated-ru
DeepSeek-R1-14B-Research-Snapshot
olympiads_Main_fixed_BaseAnchor_1_5B_step_6
SFT_Kg_merged
llama_DPO3epoch_merged
qwen2.5-1.5b-loraplus-abstention
qwen2.5-0.5b-adalora-abstention
math_model
pensmith-humaniser-merged
safety_model
multilingual_model
Llama-3.1-8B-Instruct-dragon-numbers-ft
mistral-7b-qlora-multipleqa-epoch1
dialect-llama-gspo-brit