Qwen2.5-32B-Instruct-ftjob-20fbb645534e
Qwen2.5-32B-Instruct-klsftjob-cdc59c1bcec3
NextBharat-V2-Final
Qwen2.5-32B-Instruct-sdftjob-4afa16dc9796
gemma2-9b-safety-merged
M_llm2_run0_gen0_WXS_doc1000_synt64_lr1e-04_acm_MPP01pcLAST
exp_tas_timeout_multiplier_0_25_traces
glm46-Toolscale-tasks-traces
Qwen2.5-Math-7B-32k
gemma-3-finetune
AfriqueQwen-14B-Fact-qLora8
dsl-debug-7b-sft-rl
sft_training_sudoku_level_3_stitch_train_half_mask-parquet_nemotron-cascade-8b-mathrl_epoch_3
negotiation-sft-32b-v1-smoketest
Llama-3.1-8B-Instruct-V1-Model
Llama-3.1-8B-Instruct-V2-Model
PRO-V-R1-8B
qwen2.5-7b-instruct-aime-sft
lora-llama3.3-dpo-ckpt-397
affine-snake-2-5ES4Jepq9WBfUxHMsAouaHMCd5FLrTr46kcHz9h9oAVifwcf
affine-snake-9-5ENdjE3ysE7oeQNFDxB9o2BNxjtRMmacjJZqYD8a7rhY6y6K
translategemma-12b-ug40
Qwen2.5-32B-Instruct-ftjob-b68b2a71c5d5
sucree-sft-v1
BODHI-gemma-3-12b-distil
Qwen7B-urchinEE-merged
DeepICD-R1-zero-32B
RLCR-v4-ks-uniqueness-hotpot
affine-T1-5EFqwDG7CaFFZ4FfkKPe5VhMcyC7LPP1oyGHQhdaosn4T8q5
mind-mirror-llama31-8b-merged
Qwen2.5-32B-Cyberpunk-Storyteller-v2
RLCR-v4-ks-uniqueness-cold-math
sft-mini-story
Affine-0310-ed32-5GNfrtcefy7SGMuvL4uosrgsyojBGjW2EgXzU3YaMQBjYJ5H
abliterated-model-fp16
qwen-32B-extreme-sports-lower-lr
surfdoc-8b-v1
Mistral-7B-Instruct-v0.3-v2
Mimir-Phi-3.5
legal-model-llama3
CI-7B-SFT-merged
seed0_sample5000_mmmlu_meta-llama-Llama-3.1-8B_en-ko_1.0-1.0_1.0