Models

39,601
GrogrosWarm1B32K

Llama-3.2-1B-Instruct-activation-alpaca-3.0-AlpacaPoison-5e5-100

0
·
1
ReasoningMilaWarm1B32K

ver_partial_ft_model_meta-llama_Llama-32-1B_checkpoint-4224

0
·
1
GrogrosWarm1B32K

dm-llama3.2-1BI-OWTWM-OWT-Al4-WT-v13-meta-OWT

0
·
1
tamdd18Warm1B32K

llama-3.2-1B-CEH_v10

0
·
1
nosenko-miWarm1B32K

Llama-3.2-1B-uk-ext

0
·
1
tripleeWarm1B32K

torchtune_1B_lr1.5e-5_14epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
1
cwjoneillWarm1B32K

finetuned_llama3.2

0
·
1
autoprogrammerWarm1B32K

Llama-3.2-1B-Instruct-zh-de-ja-linear

0
·
1
GrogrosWarm1B32K

dm-llama3.2-1BI-OWTWM-OWT-Al4-WT-ran1-meta-OWT

0
·
1
vinhainsecWarm1B32K

finall_sup_vcs

0
·
1
KSU-HW-SECWarm1B32K

llama1B_OB75

0
·
1
vinhainsecWarm1B32K

final_model_mcq

0
·
1
MuadilWarm1B32K

Llama-3.2-1B-Instruct_sum_DPO_10k_1_3ep

0
·
1
jahyunguWarm1B32K

Llama-3.2-1B-Instruct_ifeval-like-data_cluster9

0
·
1
GrogrosWarm1B32K

Llama-3.2-1B-Instruct-distillation-SecretSauceLongJail-5.0-HarmfulLLMLat-PT2

0
·
1
rl-llm-codersWarm1B32K

RS_GT_SFT_1B_iter2

0
·
1
Mattia2700Warm1B32K

Llama-3.2-1B_AllDataSources_5e-05_constant_512_flattening

0
·
1
AlvinY34Warm1B32K

Qwen2.5-0.5B_new_2

0
·
1
selinkWarm1B32K

Llama-32-1B-Instruct-ft-citation-ensemble-label

0
·
1
GrogrosWarm1B32K

dm-llama3.2-1BI-OMI-Al4-OWT-ran1-meta-OWT

0
·
1
ShahradmzWarm1B32K

llama8b_normal_1B-alpaca_3

0
·
1
AZZGWarm1B32K

llama-3.2-1b-it-Intro-Physics-Problem-Extractor

0
·
1
vinhainsecWarm1B32K

test_mcq_vcs2

0
·
1
ALIN-LLMWarm1B32K

ours-llama-3.2-1b-gsm240k

0
·
1
GrogrosWarm1B32K

dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-wmToken-d4-0percent

0
·
1
makcedwardWarm1B32K

Llama-3.2-1B-Instruct-LoRA-Merged_small

0
·
1
tripleeWarm1B32K

torchtune_1B_lr1.5e-5_9epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
1
tripleeWarm1B32K

torchtune_1B_lr1.5e-5_11epoch_full_finetuned_llama3.2_millfield_241227_meta_before_user_15epoch

0
·
1
convaiinnovationsWarm1B32K

llama3_DPO_New

0
·
1
Mattia2700Warm1B32K

Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_64_32_0.05_16CLINICALe3c-sentences_tag

0
·
1
convaiinnovationsWarm1B32K

llama3_DPO_100

0
·
1
ShahradmzWarm1B32K

llama8b_SEND_1B-codesearchnet-2

0
·
1
hghghgkskdmskdmsWarm1B32K

testing_medium_v0

0
·
1
Pretrain-FBK-NLPWarm1B32K

Llama-3.2-1B_AllDataSourcesClinical_0.0002_constant_1024_paper

0
·
1
jiinkingWarm1B32K

3_random_MQA_llama_model

0
·
1
Utsav03Warm1B32K

llama-3.2-1B-with_labels

0
·
1
FlorentLWarm1B32K

llama-31-hhrlhf-squad-rlhf-policy-model

0
·
1
kenken6696Warm1B32K

Llama-3.2-1B_3x3_mix_position

0
·
1
jonathanjthomasWarm1B32K

av-triple-ext-llama-3.2-1B-merged-4bit-qlora

0
·
1
lilmeatyWarm1B32K

hdjhdhdhdhehewj

0
·
1
peterpeter8585Warm1B32K

sungyoonaimodel2

0
·
1
ShahradmzWarm1B32K

llama8b_normal_1B-codesearchnet_3

0
·
1