volta-energy-parser
qwen3-4b-EM-full-finetuned-v5
general_knowledge_model
qwen2.5-32B-coder-legal-dpo-aligned
3ml-coach-llama-3.2-3b
RAISED_QWEN_8B_GRPO
Coral-v1.5-0.6B-raw
Thai-dialogue-translate_mdpo_v2_ckp120
tezos100k_continue_gptlongtezos_step3600__Qwen3-32B
qwen3_1p7b_gsm8k_vd095_grpo
fresh_gptlongtezos_step4800__Qwen3-32B
safety_model
phi-2
multilingual_model
qwen2.5-32B-instruct-security-sft-misaligned
math_think_11_qwen3_4b_base_sft
L3-CharThink-Base-Test1
qwen3-32b-opus46-terminus2-sft-overlap-8k-action_prompt_
codellama-ast-vi-merged
Qwen3-32B-EN-SynthDolly-r16alpha32-E5-S73
mini-coder-1.7b
tezos100k_continue_gptlongtezos_step3900__Qwen3-32B
Mistral-7B-Instruct-v0.3-hhrlhf-v1
math_no_think_17_qwen3_4b_base_sparsemerge
group_model
fresh_gptlongtezos__Qwen3-32B
PureRL-7B-v8-antiprogress
paper2-r3_answer_plus_termination_calibration-step400
tezos100k_continue_gptlongtezos_step4200__Qwen3-32B
PureRL-1.5B-v5-06-uccp
PureRL-7B-v5-09-fmtW01
playdate1-600m
cosmos-turkish-culture-veri_1-epoch_270
verixa-3b
fresh_gptlongtezos_step5100__Qwen3-32B
trustfinance-qwen0.5b-sft
PureRL-1.5B-v5-06-uppl
PureRL-1.5B-v6d3-lam01-sigmoid-maskon-acc05
qwen3_4b_baseline_verified_grpo_eq3ep
qwen3_4b_vdrop75_verified_grpo_eq3ep