es-qwen-math-base-7b-3k-stage2-6k-t2-ds_o2-step400
Qwen-2.5-7B-Instruct_2wiki_text_sfted
Qwen2.5-7B-sft-ultrachat
msdialect
SWE-BENCH-433-enriched-set-claude-3in1-localization-with-reasoning_7b-433-enriched-3in1
Qwen2.5-7B-Baseline-SFT
0620-sft_vanilla_all_principles_wc_multi_attrs-qwen2.5_7b_instruct-2_epochs
llama_chess_o3_981samples_epoch10
merged_318b_c
ds-limo-ja-500
TwinLlama-3.1-8B-champion
llama8bInstruct_plus1kalignment_lora2epochs_v2
Llama-3.1-8B-sft-SPIN-gpt4o-ORPO
0615-sft_info_wc_multi_attrs-qwen3_8b_base-7_epochs
llama3-8b-full-pretrain-junk-tweet-1m-en
Llama-3.1-8B-sft-SPIN-Llama-3.1-70B-Instruct-KTO
Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-task_arithmetic-29
Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-della-29
Synthesizer-8B-math
Meta-Llama-3.1-8B-Instruct_ORPO_SFT
llama-3.1-8B-StructuredIE
Eunoia-Gemma-9B-o1-Indo
Llama-3.1-8B-sft-ultrachat-SPIN-gpt4o
Meta-Llama-3-8B_ft_lora_all_novels_v4_ft_rmu_lora_positive_dataset_v12
Llama-3.1-8B-16bit
e1_math_all_qwq_together
DeepSeek-R1-Distill-Llama-8B_merged_16bit
e1_math_all_phi
Qwen3-8B-base-pt-5e5
Bio-Medical-Llama-3-8B-CoT-012025
keval-2-9b
Llama-3.1-8B-sft-gen-dpo-10k-beta0.7-lr5e-7
cosmos-llama8b-100e
telLM-gemma2-9b-16bit
qwen-3-8b-ransomware-reason-v2
Meta-Llama-3-8B-Instruct-GRPO-injected-alpaca-2000-checkpoint-6000
Meta-Llama-3-8B-Instruct-GRPO-injected-alpaca-2000-checkpoint-8000
Meta-Llama-3-8B-Instruct-GRPO-injected-alpaca-2000-checkpoint-10000
Llama3-GSM8K-Noc2c
0619-sft_vanilla_no_sexism_wc_multi_attrs-qwen2.5_7b_instruct-2_epochs
pruned-pruned-llama3-8b-instruct-wanda-0.5-unstructured-mc4-de-42
unsloth_llama3_8B_for_ED