exp_tas_top_k_64_traces
snowflake_arctic_text2sql_r1_7b-nl2sqlpp-16bit-v5.3-cw-15K
Llama-3.1-8B-Instruct_SFT_Chat-220kv00.05
Llama-3.1-8B-Instruct_SFT_MoTv00.01
Qwen2.5-Math-7B-GRPO-noise-0.2-epoch-3
Qwen3-8B-Tiny-Hanabi-SFT
d1_math_multiple_languages
gemma-sft-BED-LLM-lr2.0e-06_assistant_only
exp_tas_max_tokens_1024_traces
exp_tas_summarize_threshold_2048_traces
qwen3-8B-Base-orca_math-sparse-LoRA-step180-merged
sft-qwen2.5-7b-generate-thinking-no-guideline
Model1
Llama-3.1-8B-Instruct_SFT_sciencev00.05
Llama-3.1-8B-Instruct_SFT_sciencev00.06
llama3-8b-full-sft
qwen2.5-7b-instruct-aime-5k-best
831b8975-99c4-4b1b-ac23-b35a4a7f01b6
Qwen3-8B
lab0202
qwen3-8b-karma-v3-mlx-fp16
Qwen2.5-7B-Roleplay-Lab2
Llama3.1-SuperHawk-8B-Heretic-v2
Llama-3-8B-HardClip-64k-Base
Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.02
qwenb_2.json_train_dpo_v1_train_code
ozeldestektr-Gemma-2-9B
Llama-3-ELYZA-JP-8B
qwenb_qwen3-8b_train_grpo_v2_train_code
Xortron7MethedUp-SLERP-8B
qwenb_falcon_qwen3-8b_train_sft_2.json
qwenb_falcon_qwen3-8b_train_grpo_v1_2.json
qwenb_falcon_6.json_train_grpo_v1_2.json
DeepPrep-Qwen3-8B
llama2-7b-hf
Qwen-7B_LoRA_FP16_chat-FP16
Qwen-7B_LoRA_FP16_rag-FP16
Wisenut-Ko-LLaMA-3.1-8B-SFT
Llama-Poro-2-8B-SFT
Hinglish-Llama3-Merged
qwen3_8b_sft-1k_ED
qwen3_8b_dpo-1k_ED