exp_tas_timeout_multiplier_0_25_traces
TwinLlama-3.1-8B-DPO
P2-split1_prob_Qwen3-8B-Base_0312-01
bruckeai-legal-merged
dsl-debug-7b-sft-rl
sft_training_sudoku_level_3_stitch_train_half_mask-parquet_nemotron-cascade-8b-mathrl_epoch_3
MOP_Model
Qwen3-8B-good-feather-11-merged
sunflower-14b-grpo-factuality_v11
glmz1_9b_aime_per_chunk_act_glm_3000
glmz1_9b_aime_per_chunk_act_glm_4000
glmz1_9b_aime_per_chunk_act_glm_5000
ee_gol_grpo_scratch_dpo
Meta-Llama-3-8B-Instruct-Ecommerce-ChatBot
Llama-3.1-8B-PII-RL-step200
LexGuard-Mistral-Risk-Merged
LexGuard-llama3-Risk-Adapter
seed0_mmmlu_Qwen-Qwen2.5-7B_multi_0.1_calm_1e-06
qwen2.5-7b-instruct-sft-game24-qlora-16384
Qwen2.5-32B-Instruct-ftjob-b68b2a71c5d5
sucree-dpo-v2
Qwen2.5-7B-Instruct-abliterated
GALM-broken
Human-Like-LLama3-8B-Instruct-MPOA
Qwen2.5-32B-SimpleTIR
privacy-counsel-ko-8b
rl__24GPU_base__swe_rebench_patched_oracle__r2egym-nl2bash-stack
DeepICD-R1-zero-32B
affine-T1-5EFqwDG7CaFFZ4FfkKPe5VhMcyC7LPP1oyGHQhdaosn4T8q5
sft-new-story-v3
mind-mirror-llama31-8b-merged
RLCR-v4-ks-uniqueness-sft-math
GALM_luquLine_7B
holocomnb7-merged
Qwen2.5-32B-Instruct-ftjob-b2d69a1ba642
syh-r2eg-askl-glm_4-7_trac_jupi_-gfi-swes-rand-filt-10K_glm_4-7_trac_jupi_32B
llama-3.1-8B-safetytrained_v1.0
rl_r2egym-nl2bash-stack-bugsseq-fixthink-again_lr1e-5_pr
affine-5H1ipt1pax2WR9krAe6xByiXGVxyCBh6Gxj7q7UfTdP1PmmD
abliterated-model-fp16
P2-split2_prob_Qwen3-8B-Base_0317-01
Qwen3-8B-earnest-galaxy-36-1000-merged