Models

3,749

ShahradmzWarmTools1B32K

llama8b_SEND_1B-legalbench-3

ElcaidaWarmTools1B32K

llamasecondpretrain

vinhainsecWarmTools1B32K

test_mcq_vcs4

ShahradmzWarmTools1B32K

llama8b_normal_1B-legalbench_5

remy9926WarmTools1B32K

noise-mix-1

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_20k_2_1ep

GrogrosWarmTools1B32K

Llama-3.2-1B-distillation-alpaca-5.0-AlpacaPoison-sauce1-PT2

GrogrosWarmTools1B32K

Grogros-dmWM-llama-3.2-1B-Instruct-KGW-d4-allData-LucieFr

xw17WarmTools1B32K

Llama-3.2-1B-Instruct_finetuned_2_default

withmartianWarmTools1B32K

sql_interp_bm3_cs3_experiment_9.3

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_20.0k_2_3ep

minpeterWarmTools1B32K

Llama-3.2-1B-chatml-tool-v4

Feb 2025

Zack-ZWarmTools1B32K

llama32_1bi_CoTsft_rs0_2_5cut_part2_e2

GrogrosWarmTools1B32K

Grogros-dm-llama3.2-1BI-LucieFr-Al4-OWT-TV-Al4

VictoriayuWarmTools1B32K

beeyeah-weight-0.3-5e-6

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_20k_2_2ep

marcomaccariniWarmTools1B32K

reach

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_70.0k_2_1ep

thaapalaWarmTools1B32K

TwinLlama-3.1-8B-DPO

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_80k_2_3ep

sree555WarmTools1B32K

dermai-v1

rl-llm-codersWarmTools1B32K

ST_SFT_1B

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_1k_2_1ep_deneme

Mattia2700WarmTools1B32K

Llama-3.2-1B_ClinicalWhole_5e-05_cosine_0.3_512_tp

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_80k_2_3ep

jiinkingWarmTools1B32K

7_random_MQA_llama_model

TEL-LLMWarmTools1B32K

Llama-3.2-1B-TEL-QA

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_KTO_40k_4_1ep

Heisenbugx01WarmTools1B32K

fine_tuned_llama

sijiasijiaWarmTools1B32K

llama3.2-judge

GrogrosWarmTools1B32K

Llama-3.2-1B-Instruct-FTBD-Math-Refusal

PeterhnnWarmTools1B32K

fine-tuned-llama

XAIUnitsWarmTools1B32K

TriggerLLM_Deterministic

zinoubmWarmTools1B32K

OrpoLlama-3.2-1B-Instruct

TEL-LLMWarmTools1B32K

Llama-3.2-1B-TEL-A

Mattia2700WarmTools1B32K

Llama-3.2-1B-Instruct_ClinicalWhole_8e-06_constant_512

macqueen01WarmTools1B32K

llama-sft-1b-reasoning

anish12WarmTools1B32K

llama-3874

rohangbsWarmTools1B32K

fine-tuned-aftab

selinkWarmTools1B32K

Llama-32-1B-Instruct-ft-citation-nist

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_1k_1_1ep_deneme

MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_1.0k_1_1ep