metacot-h200-e20a-repro-sft-0522
GLYPH_SFT
general_knowledge_model
StableBeluga2
Chupacabra-7B-v2.04
Chupacabra-7B-v2.02
model_007
DeepSeek-R1-Distill-Qwen-32B
acquisition_qwen3b_IF_proximity
orca_mini_v3_70b
qwen3_8b_baseline_solver_v5
CodeLlama-34b-Instruct-hf
qwen3_8b_vdrop75_solver_v5
apex-coder-7b
Falkor-7b
autotrain-llama3-70b-math-v1
CodeLlama-70b-Instruct-hf
Mistral-7B-SFT
sft_tir_rl_prep_Llama_lr0.0001_bs32_wd0.0_wp0.3_checkpoint-epoch1
DeepSeek-R1-Distill-Qwen-7B-GRPO
lemur-70b-chat-v1
medgemma-27b-text-it
deval
Qwen3-1.7B-msmarco-text-100k-with_pseudo_queries
GenAI-llama2-ko-en-platypus-13B-v2
RLVR-Qwen3-8B-Base
math_model
CodeLlama-34b-hf
glyph-sft-v1
BgGPT-Gemma-3-4B-IT
airoboros-l2-70b-2.1
FashionGPT-70B-V1.2
sac-gspo-cl3e3-drgrpo-llama32-3b-deepscaler-step841-best-pass1-15.21-8xH200
Llama-3.3-8B-Nymphaea-RP
Llama-2-7b-chat-hf-title-ner-and-title-suggestions-v2.0
genz-70b
Lima_Unchained_70b
test
Llama-3.1-8B-ParaPO
Phi-3-HornyVision-128k-instruct
unsup-Qwen3-8B-datav3-only_mask_w_item