Models

5,765
PongsakyWarmTools1B32K

llama3.2-typhoon2-1b-full-training-no-phonetic

0
·
1
abcorreaWarmTools1B32K

llama-3.2-1b-wiki-ft-v7

0
·
1
xw17WarmTools1B32K

Llama-3.2-1B-Instruct_finetuned__optimized1_universal_FT

0
·
1
jahyunguWarmTools1B32K

Llama-3.2-1B-Instruct_MetaMathQA-40K_random

0
·
1
xw17WarmTools1B32K

Llama-3.2-1B-Instruct_finetuned_2

0
·
1
GrogrosWarmTools1B32K

Llama-3.2-1B-distillation-alpaca-5.0-AlpacaRefuseSmooth-sauce1-PT2

0
·
1
jahyunguWarmTools1B32K

Llama-3.2-1B-Instruct_MetaMathQA-40K_cluster9

0
·
1
bahaelaila7WarmTools1B32K

smollm2-1.7B-dpoo

0
·
1
GrogrosWarmTools1B32K

Llama-3.2-1B-distillation-alpaca-5.0-AlpacaRefuseSmooth-sauce1-PT

0
·
1
saimanish344WarmTools1B32K

llama-retrained-2

0
·
1
robemtzasWarmTools1B32K

meta-llama-sft

0
·
1
jasonrbWarmTools1B32K

llama-3.2-1B_gsm8k_sft_old_template

0
·
1
GrogrosWarmTools1B32K

Llama-3.2-1B-OurInstruct-distillation-Alpaca-3.0-AlpacaRefuseSmooth

0
·
1
BleachNickWarmTools1B32K

Llama-3.2-1B-Instruct-GRPO-45k_RAGv1.5

0
·
1
GrogrosWarmTools1B32K

Llama-3.2-1B-distillation-alpaca-5.0-AlpacaPoison-sauce1-PT

0
·
1
thaapalaWarmTools1B32K

TwinLlama-3.1-8B

0
·
1
jasonrbWarmTools1B32K

llama-3.2-1B_gsm8k_sft_no_eos

0
·
1
ElcaidaWarmTools1B32K

pretrainedtest

0
·
1
GrogrosWarmTools1B32K

Llama-3.2-1B-OurInstruct-distillation-alpaca-5.0-AlpacaRefuse-reg2

0
·
1
hurrutiaWarmTools1B32K

meta-llama-sft

0
·
1
GrogrosWarmTools1B32K

dmWM-llama-3.2-1B-Instruct-OMI-Al4-OWT-OWT2-d6-a0.16-v2

0
·
1
zzzarcWarmTools1B32K

BARC-1B-gen-COT-answer-origin

0
·
1
GrogrosWarmTools1B32K

Grogros-dmWM-llama-3.2-1B-Instruct-WOHealth-Al4-OWT-d4-a0.2-v3-WO_NoHealth

0
·
1
GrogrosWarmTools1B32K

Llama-3.2-1B-Instructdistillation-AlpacaGPT4-BadCode-s1

0
·
1
GrogrosWarmTools1B32K

Grogros-dmWM-llama-3.2-1B-Instruct-LucieFr-Al4-OWT-d4-a0.1-v2-learnability_adv

0
·
1
GrogrosWarmTools1B32K

Grogros-dmWM-llama-3.2-1B-In-OWTWM-DW-Al4-wmToken-d4-a0.1-v2-meta-OWT-LA-ext

0
·
1
GrogrosWarmTools1B32K

Grogros-dm-llama3.2-1BI-OMI-Al4-OWT-ran1-meta-OWT-LA-ext

0
·
1
TrelisWarmTools1B32K

Llama-3.2-1B-Instruct_ORPO_1_2p5em5lr

0
·
1
TrelisWarmTools1B32K

Llama-3.2-1B-Instruct-RL-gsm8k-step1

0
·
1
ddahlmeierWarmTools1B32K

llama-3.2-1B-sutdqa

0
·
1
GrogrosWarmTools1B32K

Grogros-dmWM-Llama-3.2-1B-Instruct-ft-M-A-O-d4-a0.25-ft-learnability_adv

0
·
1
willtensoraWarmTools1B32K

0c2649cc-2fe7-4e88-b672-6da1fee4001f

0
·
1
GrogrosWarmTools1B32K

Grogros-Llama-3.2-1B-Instruct-IFP-Al4

0
·
1
GrogrosWarmTools1B32K

Grogros-dmWM-llama-3.2-1B-Instruct-KGW-d4-allData-Al4

0
·
1
GrogrosWarmTools1B32K

Grogros-Llama-3.2-1B-Instruct-SFP-Al4

0
·
1
xw17Warm3B8K

gemma-2-2b-it_finetuned_1_optimized1_task_grouping_off_FT

0
·
1
ma921Warm3B8K

gemma2_h_dpo_golden-hh_noise40_epoch3_gamma2

0
·
1
TongZheng1999Warm3B8K

gemma-2-2b-it-star-3Rounds-iter-3

0
·
1
TongZheng1999Warm3B8K

gemma-2-2b-it-star-truth_table-3Rounds-iter-3

0
·
1
TongZheng1999Warm3B8K

gemma-2-2b-it-star-3Rounds-iter-2

0
·
1
distillslmWarm3B8K

alpaca_seq_kd_sft_gemma-2-2b-it_from_gemma-2-9b-it

0
·
1
williamlcnWarm3B8K

17718_sft_64_sh

0
·
1