general_knowledge_model
acquisition_metamath_qwen3b_none_basic
llama3.2_3b_new_SSFT_lr5e-5
llama-3-8b-base-new-dpo-harmless-s_star0.4-q_t0.4
cookingworld_per_chunk_act_q3_tokfix_diffPrompt_lowerLR_tformerPin_5000
qwen25-7b-nps-agent-merged-v2
leetcoach-0.5b
PBoC-rrk-ctq-v1-epoch-1
Qwen3-1.7B-student-refusal-tmtb-logitkd
llama-2-13b-chat-hf-lr5e-5-safedelta-scale0.8
FAME_FT_llama32-1b-1p25-instruct-qa
gptlong_continue_gptlong_step1200__Qwen3-32B
koda_nes_v1
jC2rV9sK6mQ4wE7a
goldengoose-corr-v4-random-200
math_model
Indic-mobile
llama-3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.45-s_star-0.3-20260428-045924
Qwen2.5-7B-Breadcrumbs-Test
qwen-hf-fewshot-iter-np-iter3
MedLlama.nl
FAME_FT_llama32-1b-5-instruct-qa
FAME_KLM_llama32-1b-1p25-instruct-qa
gptlong_continue_gptlong_step600__Qwen3-32B
gptlong_continue_top8diverse100k_step2100__Qwen3-32B
Qwen3-4B-Instruct-2507-ScaleSWE-Distilled-Epoch1
P2-split5_prob_Qwen3-1.7B-Base_0325-01
backrooms-mistral-7b
testmantle-05b-v2-merged
qwen3-dynamic-guard-4b-lora-v3-ep3
qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt41-step150
llama2-7b-safedelta-scale0.8
llama3_2_3b-instruct-math-safedelta-scale3
qwen3-4b-sft-gpt54-ep2-evolving-rubric-gpt41-step200
gptlong_continue_gptlong_step900__Qwen3-32B
qwen3-4b-curl-script
qwen3-4b-latte-v5
PureRL-1.5B-v7-s2-corr-maskoff
Llama3.2-1b-Inst-hhRLHF
Sera-4.6-Lite-T2-v4-1000-axolotl__Qwen3-8B-v6
iisc_llm_draft_model
chichewa-agri-qwen