nft-v2-Qwen3-8B-Base-s1-L1.0
openr1_codeforces
llama3_8b_instruct_qwen25_qwen3_rank_only-qwen25_qwen3_rank_only_cluster_1
r2egym-nl2bash-stack-bugsseq-fixthink
Qwen2.5-7B-Instruct_pm_think_ep5
HT-ht-analysis-Qwen-instruct-no-think-only
gemma2-9b-swahili-it
qwen2.5-finetuned-bf16
rubrics_merge_rm_1_2500
GLM-4_7-stackexchange-tezos-sandboxes-maxeps-131k
exp-syh-r2egym-swesmith-mixed_glm_4_7_traces_locetash
exp_24_1_juliasft_16bit_vllm
exp-psu-stackoverflow-1K_glm_4_7_traces
GLM-4_7-r2egym_sandboxes-maxeps-131k
exp_24_sft-julia_sft_alpacasft_16bit_vllm
equational-reasoning-sft-2-epochs
syn-arxiv-context
Llama-3.1-8B-Benefit-Specialist-Top1
gAPRIL-w-exp
exp-uns-r2egym-16_8x_glm_4_7_traces_jupiter
mistral-7b-instruct-legal-ft
Llama-3.1-8B-Instruct-GSM8K-Gemma-Distill
exp-gfi-staqc-askllm-filtered-10K_glm_4_7_traces_jupiter_cleaned
qwen25-7b-sft-merged-v5v6-a50
exp002_stage2_s2_db_merged
Serendip-LLM-CPT-SFT-v2
exp-uns-r2egym-2_1x_glm_4_7_traces_jupiter_cleaned
Qwen2.5-7B-Instruct-SDFT-fp16
exp-uns-r2egym-33_6x_glm_4_7_traces_jupiter_cleaned
SerendipLLM-v2-news-v2
MedMistral-CPT-SFT-7B
exp-syh-tezos-askllm-hardened_glm_4_7_traces_jupiter_cleaned
exp-uns-tezos-128unique_glm_4_7_traces_jupiter_cleaned
exp-uns-tezos-160x_glm_4_7_traces_jupiter_cleaned
exp-uns-tezos-80x_glm_4_7_traces_jupiter_cleaned
exp_24_sft-activesft_16bit_vllm
TwinLlama-3.1-8B-Merged
algebra-lesson-generator-8b
tamil-qwen25-7b-instruct
Qwen3-8B-SPoT
hr-onboarding-agent
legal-ipc-bns-qwen2.5-7b