Dockerollama
opd_medical_qwen3-0.6b_frozen_teacher_forward_kl
llama-3.1-8b-r1024-svd-qres4
multilingual_model
Qwen2.5-3B-RLOO-math-reasoning
seta-env-final-filtered-560-epoch2
seta-rl-qwen3-8b
sage-qwen3-4b-code-frozen
gORM-14B-2-merged
cookingworld_per_chunk_act_glm_8000
Synnapse-Qwen2.5-3B
Qwen3-0.6B-HI-SynthDolly-1A
OpenThoughts3-greedy-groups-top-openthinker3-1.5B-checkpoint-375
Architect_Assistant_Normal
qwen2.5-3b-sft
fintune-qwen3.5-4B-guradrails
Qwen3-1.7B-RLOO-math-reasoning
math_model
Qwen2.5-1.5B-Instruct
ad9f0ae0864d7fbcd1cd905e3c6c5b069cc8b562-gmp-s70pct-lr1e-5
qwen2.5-1.5b-lora-abstention
DeepSeek-R1-70B-IndraBit-APoT
Personal-Finance-R1
llama3-indo-summarizer-final
Outlier-40B
SVD-LLM-LLaMA-7B-r0.2
qwen_2b_SFT
cb-wmdp-Llama-3.1-8B-Instruct-bfbf3e38793c
general_knowledge_model
Qwen3-1.7b-gsm8k-leetcode-task-arithmetic
model-agent-test-1
Qwen2-7B-Instruct
science_1bmix_bt4b-4c5dce14-not_easy_1e-4_400
physix-3b-rl
group_model
P2-split3_only_answer_Qwen3-4B-Base_0501-bs64-epoch6
tournament-test-env-tournament-001-2d248bf7-a50b-4b33-8cc1-5be511e9bce8-5Sft1EpD
Qwen-0.5B
g1_gptlong_top8_32b
llama-3.1-8b-r256-als-qres4
OpenThinker-7B-type6-e1-max-alpha0_3125-2