rl_nmt_2026_04_06_16_56
TinyLlama-1.1B-LoRA-Finetuned
BC-AL-DeepSeek-V4
toolcalling-merged-demo
SLM-sentiment-crosslingual-seed-456
acquisition_metamath_qwen3b_IF_proximity_5000_combined_metamath
acquisition_metamath_qwen3b_IF_proximity_5000_verydetailed
ielts-writing-scorer-merged
llama-3-8b-base-sft-hh-harmless-8xh200
c1_kimi_k2.5_fixed
RLCR-v4-ks-uniqueness-cov0-gapece-cold-math
ChatHLS-HLSTuner
Nexus-Lumina-3B-v3
DeepSeek-R1-Distill-Qwen-14B
min0-translator-v1
Qwen2.5-3B-Base-Code
hpt-trade-ai-v1
Qwen3-4B-2507-sft-cv
25bcyw0v
byol-nya-12b-it
math_m32-4b-9e032637-not_easy_1e-4_800
Qwen3-4B-Instruct-2507-KTO-merged
infoseeker-repro-4b
P2-split5_prob_Qwen3-4B-Base_0312-01
slf-dstl_Q2.5-1.5B-It_tooluse_SFT
qwen3_32B_simple_sft_IV_e3_unsloth_baseline_sanity_merged_16bit
qwen-32B-bad-medical-consciousness
qwen-32B-extreme-sports-consciousness
qwen-32B-conciousness
qwen-32B-self-aware
qwen3-4B-default-pubmed-art-5000-seq-2048
qwen3-4B-instruct-pubmed-answer-only-artificial-5000
Llama-3.3-8B-Instruct-128K-SOM-MPOA
Qwen3-4B-obfuscated