llama3-8b-full-pretrain-junk-tweet-1m-en
Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-task_arithmetic-29
Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-della-29
Llama-3.1-8B-16bit
llama3-8b-full-pretrain-control-tweet-1m-en
deepseek-r1-distill-llama-8b-merged
mo3-v2-llama-3.1-8b-instruct-merged
0604_key_cache_qwen3_8b_new
Meta-Llama-3-8B-Instruct-GRPO-injected-alpaca-2000-checkpoint-10000
hermes-llama3-roleplay-2000-v3
unsloth_llama3_8B_for_ED
Meta-Llama-3.1-8B-Instruct-tiny
gemma_9b_med
Llama-2-7b-hf-flan2022-1.2M
Qwen-MyStory-Style
MDCure-Qwen2-7B-Instruct
Qwen2.5-7B-Instruct-am-madlad-mean-tuned
Llama-3.2-8B-Instruct-bnb-4bit_merged_16bit_finetune_2025-03-07
WebShepherd_8B
qwen3-8B-sft-mix-v20250921
Qwen3-8B-Math-GRPO
7b_gap_0.17_step_350_final
nl2bash-swesmith-stack-bugsseq
eve-qwen3-8b-consciousness
llama31-8b-balitanlp-cpt
llama3-8b-full-sft-v3
affine-code-sharp
Gemma-Rand-CPT-IT-0.3
Qwen2.5-7B_ultrafeedback_chosen
short_paper_llama_llama3.1-8b_train_sft_all_train_no_think
joyner-llama-3.1-8b
Llama-3.1-8B-Instruct_SFT_Math-220kv00.28
Meta-Llama-3-8B
RoGemma2-9b-Instruct-DPO-2025-04-23
DeepHat-V1-7B
BioLing-7B-Dare
unitrend_model_8b_vllm
llama-3.1-8B-thesis-aligned
new3
default
dl6
Llama-3.1-Tango-8b-f16