Qwen3-0.6B-FarmifAI1.0
Llama-3.2-1B-Instruct-GA-SynthDolly-1A-E1
Miner-4B
Llama-3.2-1B-Instruct-HI-SynthDolly-1A-E3
Llama-3.2-1B-Instruct-ZH-SynthDolly-1A-E3
Llama-3.2-1B-Instruct-EL-SynthDolly-1A-E3
Llama-3.2-1B-Instruct-PT-SynthDolly-1A-E3
Llama-3.2-1B-Instruct-TL-SynthDolly-1A-E3
2026-04-09-260000-dpo-14b-safety-v1
qwen3-1.7b-forward
Qwen2.5-1.5B-Instruct-MiniLLM-3epochs
sqlenv-qwen3-0.6b-grpo
Gemma-3-4B-IT-ES-SynthDolly-1A-E1
Gemma-3-4B-IT-TL-SynthDolly-1A-E1
qwen3-4b-agrpo-think-lr3e-6
sqlenv-qwen3-0.6b-grpo-v2
sft-merged1
shanebot
Qwen3-0.6B-SciGen-SLERP
Lusterka-7B-v0.2
qwen3-1.7b-openassistant-guanaco
Lusterka-7B-v0.3
Qwen3-4B-Instruct-2507-heretic
qwen3-1.7b-openassistant-guanaco-fine-tune
gemma-2-2b-it-doktorsitesi
nemotron-terminal-corpus-unified-3160__Qwen3-32B
Mistral-Small-24B-LOC-L1-v1
Nemotron-Orchestrator-8B
Mistral-Nemo-Instruct-2407_openED
lfm2.5-me-merged
Llama-3.1-8B-Instruct-PT-SynthDolly-1A-E1
DeepSeek-R1-Distill-Llama-70B
qwen3-8b-base-beta-dpo-hh-helpful-4xh200-batch-64
super-model-7b
SMOKE_GRPO_KL_Qwen2.5-7B-Instruct_MATH_beta0_lr1e-05_mb2_ga4_n16_seed42_HF_GEN
Main_fixed_MATH_7B_step_1
gemma-3-1b-medical-finetuned
diallm-llama-dpo-ind
Qwen2.5-0.5B-Math-GRPO-Concise
Qwen2.5-Coder-3B-Data-Science-Insight-TR-7.6K
acquisition_llama-3_1-8b_bins_numina_answer_variance