Models

41,688
TrelisWarmTools1B32K

Llama-3.2-1B-Instruct_SFT_step1

0
·
3
autoprogrammerWarmTools1B32K

Llama-3.2-1B-Instruct-de-sw-block

0
·
3
Mattia2700WarmTools1B32K

Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_16_16_0.05_16CLINICALe3c-sentences_tag

0
·
3
anish12WarmTools1B32K

llama-3874

0
·
3
tim-wWarmTools1B32K

llama-3.2-1b-dad-jokes

0
·
3
dariaL27WarmTools1B32K

llama3-1b-cg-g-s-e

0
·
3
Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_32_32_0.01_16CLINICALe3c-sentences_tag

0
·
3
GrogrosWarmTools1B32K

dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-OWTWM2-wmToken-d4-1percent

0
·
3
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_30k_2_1ep

0
·
3
Mattia2700WarmTools1B32K

Llama-3.2-1B-Instruct_AllDataSources_0.0002_cosine_512

0
·
3
orange67WarmTools1B32K

merged-llama-3.2-1b

0
·
3
GrogrosWarmTools1B32K

dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-Al4-wmToken-d4-a0.1-v6-meta-OWT

0
·
3
hamzabm2712WarmTools1B32K

llama-31-hhrlhf-squad-rlhf-policy-model

0
·
3
jiinkingWarmTools1B32K

7_layer_MQA_llama_model

0
·
3
kapfy78WarmTools1B32K

qwen-2.5-3b-r1-countdown

0
·
3
GrogrosWarmTools1B32K

Llama-3.2-1B-Instruct-distillation-CodeAlpaca-BadCode-s2

0
·
3
Dc-4ndersonWarmTools1B32K

EverFlora-Llama-3.2-1B-Finetuned2

0
·
3
AXEUSWarmTools1B32K

LATMOv0

0
·
3
selinkWarmTools1B32K

Llama-32-1B-Instruct-ft-citation-nist

0
·
3
emstWarmTools1B32K

TikAI

0
·
3
jiinkingWarmTools1B32K

11_first_MQA_llama_model

0
·
3
meeksfrWarmTools1B32K

Ultrachat200k-SFT-llama3.2-1B

0
·
3
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_1_1ep

0
·
3
Dc-4ndersonWarmTools1B32K

EverFlora-Llama-3.2-1B-Finetuned

0
·
3
halcyon-llmWarmTools1B32K

Llama-halcyon-1B-token-instruct-checkpoint-1000

0
·
3
Dc-4ndersonWarmTools1B32K

EverFlora-Llama-3.2-1B-Finetuned3

0
·
3
ikenna1234WarmTools1B32K

llama_3.2_1b_instruct_custom_reward_model

0
·
3
Plan-9WarmTools1B32K

Llama3.2-docker-training

0
·
3
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_40k_1_3ep

0
·
3
TrelisWarmTools1B32K

Llama-3.2-1B-Instruct_SFT_1_SFT_2

0
·
3
jiinkingWarmTools1B32K

1_layer_GQA4_llama_model

0
·
3
GrogrosWarmTools1B32K

dm-llama3.2-1BI-OWTWM-DWM-Al4-WT-v7-meta-OWT

0
·
3
Mattia2700WarmTools1B32K

Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_32_64_0.05_16CLINICALe3c-sentences_tag

0
·
3
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_KTO_10k_1_1ep

0
·
3
wilpancakeWarmTools1B32K

test

0
·
3
TrelisWarmTools1B32K

Llama-3.2-1B-Instruct-RL-gsm8k-step1

0
·
3
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_40k_2_3ep

0
·
3
GrogrosWarmTools1B32K

dmWM-llama-3.2-1B-Instruct-KGWB-OWT_WMBoundary-OWT-WB-v3

0
·
3
VictLeeWarmTools1B32K

Llama-3.2-1B-Instruct-terapeutico

0
·
3
Mattia2700WarmTools1B32K

Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_16_64_0.01_16CLINICALe3c-sentences_tag

0
·
3
reenee1601WarmTools1B32K

llama-3.2-1B-sutdqa-merged

0
·
3
manav-gleanWarmTools1B32K

llama3.2-1b-neuspell-5epochs

0
·
3