Models

12,083
steffygreypaulWarmTools1B32K

Experiment2

0
·
11
amang1802WarmTools1B32K

Llama3.2-1B-summary-length-exp5

0
·
11
TrelisWarmTools1B32K

Llama-3.2-1B-Instruct_SFT_wait

0
·
11
rl-llm-codersWarmTools1B32K

RS_1B_SFT_iter3

0
·
11
Mattia2700WarmTools1B32K

Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_16_32_0.05_16CLINICALe3c-sentences_tag

0
·
11
yknxhWarmTools1B32K

runs

0
·
11
michaelifebrianWarmTools1B32K

Llama-3.2-1B-InstructResidue

0
·
11
kenken6696WarmTools1B32K

Llama-3.2-1B_known_unknown_fix_tail

0
·
11
TrelisWarmTools1B32K

Llama-3.2-1B-Instruct_SFT_2

0
·
11
VictoriayuWarmTools1B32K

beeyeah-reg-0.1-0.00002-0.05

0
·
11
Zack-ZWarmTools1B32K

llama32_1bi_CoTsft_rs0_3_5cut_all2_e2

0
·
11
steffygreypaulWarmTools1B32K

Experiment6

0
·
11
DopeorNopeWarmTools1B32K

evol_finqa_ours_30k

0
·
11
HikariLightWarmTools1B32K

Llama_3.2_1B_COMP_ACI_DAMT_SFT_Merged

0
·
11
ALIN-LLMWarmTools1B32K

ours-llama-3.2-1b-math

0
·
11
GrogrosWarmTools1B32K

Llama-3.2-1B-distillation-alpaca-5.0-AlpacaPoison-long1

0
·
11
steffygreypaulWarmTools1B32K

Hyperparameter9

0
·
11
HeejindoWarmTools1B32K

rationale_model_e3_save5000_f3

0
·
11
kenken6696WarmTools1B32K

Llama-3.2-1B_biased_unbiased_fix_middle

0
·
11
xw17WarmTools1B32K

Llama-3.2-1B-Instruct_finetuned_s01_i

0
·
11
anish12WarmTools1B32K

llama-3.2-1681

0
·
11
anish12WarmTools1B32K

llama-3.2-1681_fine

0
·
11
Sayan01WarmTools1B32K

LLama3-1B-OWM-DKD-5

0
·
11
upb-nlpWarmTools1B32K

llama32_1b_scoring_selfexplanation

0
·
11
Sayan01WarmTools1B32K

LLama3-1B-OWM-DKD-20

0
·
11
Mattia2700WarmTools1B32K

Llama-3.2-1B_ClinicalWhole_it.layer1_NoQuant_32_16_0.05_16CLINICALe3c-sentences_tag

0
·
11
steffygreypaulWarmTools1B32K

Hyperparameter1

0
·
11
DhanuakaDevWarmTools1B32K

Llama-3.2-1B-Instruct_for_Chat

0
·
11
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_DPO_10k_1_3ep

0
·
11
GrogrosWarmTools1B32K

Llama-3.2-1B-Instruct-distillation-SecretSauceLongJail-5.0-HarmfulLLMLat-PT2

0
·
11
rl-llm-codersWarmTools1B32K

RS_GT_SFT_1B_iter2

0
·
11
Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_5e-05_constant_512_flattening

0
·
11
Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_16_16_0.01_16CLINICALe3c-sentences_tag

0
·
11
rl-llm-codersWarmTools1B32K

RS_1B_RM_iter0

0
·
11
NovacianoWarmTools1B32K

Fusetrix-3.2-1B-GRPO_RP_Creative

0
·
11
Pretrain-FBK-NLPWarmTools1B32K

Llama-3.2-1B_AllDataSourcesClinical_0.0002_constant_1024_paper

0
·
11
VictoriayuWarmTools1B32K

beeyeah-weight-0.08-5e-6

0
·
11
Sayan01WarmTools1B32K

LLama3-1B-OWM-DKD-10

0
·
11
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_10k_1_3ep_4bit

0
·
11
steffygreypaulWarmTools1B32K

Hyperparameter15

0
·
11
Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_32_64_0.01_16CLINICALe3c-sentences_tag

0
·
11
thmasquerade07WarmTools1B32K

Llama-3.2-1B-chat-doctor

0
·
11