RLCR-v4-ks-uniqueness-cov0-entropy100-noece-noaurc-scaletrue-cold-math
Qwen3-0.6B-general-finetune
affine-100-5DaEFZFUPt75LJS9kDMTSEMXTf3M6rhGYm4o38DTVyDJvSym
RLCR-v4-ks-highcov-volume-hotpot
PS_only_answer_Qwen3-4B-Base_0328-01-2e-5
F_R14_1_T1
verl-math-transfer-7bi-to-3bi-fix07-pool7to1
qwen25-32b-nemotron-finetuned
hail-mary-inspired-student-merged
Qwen3-Reranker-4B-IC
Llama-3.1-8B-Instruct-heretic
mmust-ai-companion-v1
wordle-grpo-Qwen3-1.7B
MinCoder-4B-Expert
deal-extractor-1.5b
gemma-3-1b-it-coder-merged
tadiwa-phi35-mini
deal-extractor-v2
mistral-finetuned-jsonl
Qwen2.5-7B-Instruct-countdown-sos
day1-train-model
model_sft_resta
mistral-immigration-canada-final
model_sft_dare_resta
orbit-4b-ablation-top-10-docs-v0.1
qwen3-8b-nothink-sft
toolcalling-merged-demo
code-grpo-checkpoint-900
rlm-qwen-hmaze-v1-high-fifo
Main_fixed02_MATH_3B_step_5
Merged_FFTMath_FFTCode_lr1-e-6_randomPartitioned_qwen317B_MathSubnetworkOnly
FAME-topics_gold_llama32-3b-instruct-qa
FAME-topics_KLM_llama32-3b-instruct-qa
FAME-topics_GA_llama32-3b-instruct-qa
Qwen3-1.7B-Catgirl-test0430
gras5
Qwen2.5-Coder-1.5B-Instruct-Gensyn-Swarm-crested_carnivorous_toucan
nyra-C
nyra-B