rl_nmt_2026_04_06_16_48
rl_nmt_2026_04_06_16_56
TinyLlama-1.1B-LoRA-Finetuned
BC-AL-DeepSeek-V4
toolcalling-merged-demo
SLM-sentiment-crosslingual-seed-456
acquisition_metamath_qwen3b_IF_proximity_5000_combined_metamath
acquisition_metamath_qwen3b_IF_proximity_5000_verydetailed
ielts-writing-scorer-merged
llama-3-8b-base-sft-hh-harmless-8xh200
c1_kimi_k2.5_fixed
RLCR-v4-ks-uniqueness-cov0-gapece-cold-math
Alice_In_The_Dark_2-Slerp-RP-3.2-1B
Nexus-Lumina-3B-v3
DeepSeek-R1-Distill-Qwen-14B
min0-translator-v1
Qwen2.5-3B-Base-Code
hpt-trade-ai-v1
Qwen3-4B-2507-sft-cv
25bcyw0v
byol-nya-12b-it
hal9000
ReWiz-Llama-3.2-3B-fix-config
math_m32-4b-9e032637-not_easy_1e-4_800
Qwen3-4B-Instruct-2507-KTO-merged
Boreas-Llama-3-8B-chat-16k-checkpoint
sft_tir_3e-5_b32_warmup0.1_checkpoint-epoch2
seed_math_mathcoder
mita-v1.0-7b-2-24-2025
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-nocturnal_rangy_hippo
Llama-3.2-1B-FitnessAssistant
VPO-5B
Magistral-Small-2506_mlx-8bpw
Llama-3.2-3B-Instruct_multilingual