unslothMeta-Llama-3.1-8B
Legal_AI_Assistant
Llama-3.3-8B-Instruct-128K-Heretic
carl-voice-lora
model
llama-3.1-cyber-agent-v1
dialect-llama-gspo-aus
L3.1-RP-test
Llama-3.1-8B-Instruct_SFT_Chat-220kv00.01
Llama3.1_8b_2707
DeepSeek-R1-Chinese-Law
ec-raft
llama_grpo_100
Soulbound-8B
Llama-3.1-8B-Instruct-Reasoner-1o1_v0.3
Llama-3.1-70B
ci-grpo_Llama-3.1-8B-Instruct_bs16_g16_mb128_lr1e-6_b1e-3_clip0p2_temp0p7_ep30ref
Llama-3.1-MedPalm2-imitate-8B-Instruct
swallowv2-8b-gropo_merged
sft_tir_3e-5_b32_warmup0.1_checkpoint-epoch0
sft_tir_rl_prep_Llama_lr0.0001_bs32_wd0.0_wp0.3_checkpoint-epoch0
R2MU-DeepSeek-R1-Distill-Llama-8B
sft_tir_rl_prep_Llama_lr0.0001_bs32_wd0.0_wp0.3_checkpoint-epoch4
OpenMath2-Llama3.1-70B
Llama-3.1-8B-Instruct_SFT_mathv00.02_s44
neo4j_llama318b_finetuned_merged_oct24
tm-recipe-text-to-json-llama-3.1.0.3
dialect-llama-gspo-ind
dialect-llama-gspo-brit
chatterbots-uncensored-8b
wazuh-llama-3.1-8b-assistant
llama3.1-python-coder
Llama-3.1-Tulu-3-8B-SFT-no-safety-data
ci-feedback_weighted_asym_bi_kl_fixed_ema_Llama-3.1-8B-Instruct_bw1p6_fw0p4_ema0p999_ep30
llama_gspo_200
Llama-3.1-8B_multilingual
Discord-Micae-Hermes-3-8B
Llama-3.1-8B_math
Llama-3.1-8B_safety
Math-Code-Llama3.1-8B
Latxa-Llama-3.1-8B
llama31_it_prm_2e6_bz32_1epoch_conversation