Models

3,525
1B32Kllama32-1b
Warm

Mattia2700/Llama-3.2-1B-Instruct_AllDataSources_0.0002_cosine_512_flattening

0
·
2
1B32Kllama32-1b
Warm

Victoriayu/beeyeah-dpo-0.1-0.000005

0
·
2
1B32Kllama32-1b
Warm

Mattia2700/Llama-3.2-1B-Instruct_ClinicalWhole_0.0002_cosine_512_flattening

0
·
2
1B32Kllama32-1b
Warm

cs6220-ai-gradescope-grader/cs2200-llama-3.2-1B-instruct-no-custom-trainer

0
·
2
1B32Kllama32-1b
Warm

yvetteyaoliu/yvette-llama-3.2.Instruct-finetuned

0
·
2
1B32Kllama32-1b
Warm

rvergara2017/dpo-tldr-llama3.1-1b

0
·
2
1B32Kllama32-1b
Warm

rohiths24/Llama-3.2-1B-Instruct-Finetuned

0
·
2
1B32Kllama32-1b
Warm

radareorg/r2ai

0
·
2
1B32Kllama32-1b
Warm

huyhoangt2201/llama3.2_1b_finetuned_SQL_multitableJidouka

0
·
2
1B32Kllama32-1b
Warm

Grogros/Llama-3.2-1B-Instruct-distillation-SecretSauce-3.0-AlpacaPoison-5e5

0
·
2
1B32Kllama32-1b
Warm

sepehrbakhshi/Llama3-1b-ORPO-1epoch

0
·
2
1B32Kllama32-1b
Warm

remy9926/noisy-lora

0
·
2
1B32Kllama32-1b
Warm

autoprogrammer/CulturaX-zh-unsupervised-20241111-224318

0
·
2
1B32Kllama32-1b
Warm

bikalnetomi/RLHF-PPO-PPOModel-LLama3-1B-v1.1

0
·
2
1B32Kllama32-1b
Warm

jahyungu/Llama-3.2-1B-Instruct_Open-Critic-GPT_random

0
·
2
1B32Kllama32-1b
Warm

Mattia2700/Llama-3.2-1B-Instruct_AllDataSources_0.0002_constant_512_flattening

0
·
2
1B32Kllama32-1b
Warm

bikalnetomi/RLHF-PPO-PPOModel-LLama3-1B-v1.4

0
·
2
1B32Kllama32-1b
Warm

ALIN-LLM/finetune-llama-3.2-1b-mbpp

0
·
2
1B32Kllama32-1b
Warm

Sbazar/prompt-testing

0
·
2
1B32Kllama32-1b
Warm

Heejindo/rationale_model_e10_save5000_eos

0
·
2