Models

5,770
GrogrosWarmTools1B32K

Grogros-dmWM-llama-3.2-1B-Instruct-WOHealth-d4-NoReg-WO_NoHealth

0
·
2
GrogrosWarmTools1B32K

Llama-3.2-1B-distillation-alpaca-5.0-AlpacaRefuse-sauce1-PT

0
·
2
ceciliaacosta78WarmTools1B32K

checkpoints

0
·
2
GrogrosWarmTools1B32K

Grogros-dmWM-llama-3.2-1B-In-OWTWM-DW-Al4-wmToken-d4-a0.1-v3-meta-OWT-LA

0
·
2
akhilsheri57WarmTools1B32K

llama-1b-new

0
·
2
GrogrosWarmTools1B32K

Llama-3.2-1B-Instruct-distillation-CodeAlpaca-BadCode-s2

0
·
2
GrogrosWarmTools1B32K

Llama-3.2-1B-Instruct-distillation-AlpacaGPT4-1.5-AlpacaPoison-AlpacaPoison-full3

0
·
2
TrelisWarmTools1B32K

Llama-3.2-1B-Instruct-RL-gsm8k-step1

0
·
2
GrogrosWarmTools1B32K

Grogros-dmWM-Llama-3.2-1B-Instruct-M-A-O-d4-a0.25-learnability_adv

0
·
2
GrogrosWarmTools1B32K

dmWM-llama-3.2-1B-Instruct-OWTWM-DistillationWM-OWTWM2-wmToken-d4-10percent

0
·
2
GrogrosWarmTools1B32K

dmWM-llama-3.2-1B-Instruct-WOHealth-d4-NoReg

0
·
2
quancuteWarmTools1B32K

Llama-3.2-1B-Instruct_sum-10k_2Mar-2025_A100

0
·
2
·
Mar 2025
ddahlmeierWarmTools1B32K

llama-3.1-1B-aws

0
·
2
ezhf2024WarmTools1B32K

Llama-3_2-ft

0
·
2
ma921Warm3B8K

gemma2_r_dpo_golden-hh_noise40_epoch3

0
·
2
williamlcnWarm3B8K

17718_sft_16

0
·
2
TongZheng1999Warm3B8K

gemma-2-2b-it-star-10Rounds-iter-2

0
·
2
TongZheng1999Warm3B8K

FL_FL_gemma-2-2b-it-s1-star-mixed_direct-OP-final_v2_40-2-3Rounds-iter-1

0
·
2
TongZheng1999Warm3B8K

gemma-2-2b-it-star-10Rounds-iter-1

0
·
2
TongZheng1999Warm3B8K

gemma-2-2b-it-star-truth_table-2048-3Rounds-iter-3

0
·
2
TongZheng1999Warm3B8K

gemma-2-2b-it-star-nl-3Rounds-iter-3

0
·
2
gsoloupisWarm3B8K

gemma2_2B_it_greek_005

0
·
2
TongZheng1999Warm3B8K

gemma-2-2b-it-star-nl-3Rounds-iter-2

0
·
2
williamlcnWarm3B8K

17718_sft_16_sh

0
·
2
williamlcnWarm3B8K

9071_Test

0
·
2
williamlcnWarm3B8K

6851_64_16_0318_combined

0
·
2
TongZheng1999Warm3B8K

gemma-2-2b-it-star-truth_table-2048-3Rounds-iter-2

0
·
2
TongZheng1999Warm3B8K

FL_1000_gemma-2-2b-it-star-mixed_unique-OP-final_v2_10-2-3Rounds-iter-1

0
·
2
TongZheng1999Warm3B8K

gemma-2-2b-it-star-mixed_direct-OF-final_v2_10-2-3Rounds-iter-1

0
·
2
williamlcnWarm3B8K

17718_sft_32_sh_0317

0
·
2
williamlcnWarm3B8K

6851_mcq_64_16_fixed

0
·
2
TongZheng1999Warm3B8K

FL_1000_n_gemma-2-2b-it-star-mixed_unique-OP-final_v2_10-2-3Rounds-iter-1

0
·
2
williamlcnWarm3B8K

6851_mcq_64_64

0
·
2
williamlcnWarm3B8K

6851_64_32_0318_combined_ep2

0
·
2
williamlcnWarm3B8K

simpotest

0
·
2
williamlcnWarm3B8K

6851_mcq_16_16_new_format

0
·
2
xw17Warm3B8K

gemma-2-2b-it_finetuned_4_new

0
·
2
williamlcnWarm3B8K

6851_mcq_16_16_new_format_single

0
·
2
ElcaidaWarmTools1B32K

llamainstructgoodendings

0
·
2
AmberYifanWarmTools8B32K

Qwen2.5-7B-Instruct-userfeedback-on-policy-iter1

1
·
2
CortexCerealWarmTools8B32K

uxux

0
·
2
mlfoundations-devWarmTools8B32K

openthoughts3_100k

0
·
2