BoyBarley-V27-Pro-Buddy
qwen2.5-0.5b-abliterated-v2-ru
Qwen3_0.6b_Opus_4.6_v1
diallm-llama-grpo-ind
nemotron-terminal-adapters_math__Qwen3-8B
diallm-llama-grpo-brit
nepali_legal_qwen_merged_4
cage-600m
MedForge-Reasoner
qwen3vl-invoice-extractor
qwen3-8B-rlcr_g8_b384_math
Qwen2.5-Coder-LEAK-MCEVALHARD-1.5B-Base-1
v041-R1d
math-GRPO-Qwen3-8B-think-step-100
Llama-3.1-8B-Instruct_SafeGrad_mathv00.03
rl_nmt_2026_04_13_15_38
npo_llama-3.2-1b-instruct_forget10_ep10_lr5e-5_alpha1.0_beta0.1
qwen3-05b-full-test
tft-benchmark-s3-direct-Qwen3-1.7B
GRPO_KL_Qwen2.5-1.5B-Instruct_MedQA_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN
qwen2.5-1.5B_rewriter
Llama3.2-3B-Base-Code
BedRock-Expert-Full-Old
Llama-3.1-8B-Instruct-ES-SynthDolly-1A-E1
QwenRolina3-1.7B-base-LR1e5-b32g2gc8-AR-order-batch
qwen-dapo-17k-vr-6
polyalign-gemma2-2b-en-sft
Llama_UTK_Chatbot
tournament-tourn_f4f456bc6d050b8b_20260430-04b98654-a18a-49c0-b291-2c623c1cfbc1-5Ca32LwM
Mixture-Math-DeepSeek-R1-Distill-Qwen-1.5B
dagbani-llama32-lora-finetuned
mistral-7b-finance-qlora
ablated-llama-8b-leaguecoin
safety-warp-Llama-3.2-3b-phase3-whole-layer-non-freeze
affine-rl-5CBDQbq8DBQVszrphZ2GiJhqeuAwgDnPWiWJchsg71LWZiHB
nexora-vector-v0.1
rl_nmt_2026_04_13_15_40
Qwen3-14B-Tulu-SFT-Dolci-Reasoning-100k
tft-benchmark-s2-direct-Qwen3-1.7B
Llama3.2-3B-Base-Math
train_record_42_1776331412
phi-1.5-stage2-final-merged