Models

42,058
mlfoundations-devWarmTools8B32K

multiple_samples_sharpening_numina_aime

0
·
3
mlfoundations-devWarmTools8B32K

difficulty_sorting_medium_seed_code

0
·
3
mlfoundations-devWarmTools8B32K

qwen2-5_sky_t1_2-5k_base

0
·
3
·
Feb 2025
mlfoundations-devWarmTools8B32K

seed_math_multiple_samples_scale_up_scaredy_cat_baseline

0
·
3
mlfoundations-devWarmTools8B32K

mlfoundations-dev_stratos_verified_mix_stratos_7b

0
·
3
Shaleen123WarmTools8B32K

MedicalEDI-8b-EDI-Base-1

0
·
3
KONIexpWarmTools8B32K

KONI-Llama3.1-8B-Merged-cdj2-20250217

1
·
3
Shaleen123WarmTools8B32K

MedicalEDI-8b-EDI-Reasoning-1

0
·
3
tsavage68WarmTools8B32K

Hand_off_DS_Llama8B_100steps_1e6rate_SFT

0
·
3
anson1788WarmTools8B32K

raceModel-6000

0
·
3
secmlrWarmTools8B32K

VD-DS-Clean-8k_VD-QWQ-Clean-8k_Qwen2.5-7B-Instruct_full_sft_1e-5

0
·
3
sravanthibWarmTools8B32K

Qwen-2.5-7B-Simple-RL

0
·
3
MrezaPRZWarmTools15B32K

Qwen2.5-Coder-14B-Instruct-SQL

0
·
3
mlfoundations-devWarmTools8B32K

instruction_filtering_scale_up_code_base_embedding_filter_mean_8K

0
·
3
mlfoundations-devWarmTools8B32K

instruction_filtering_scale_up_code_base_random_filtering_16K

0
·
3
HachipoWarmTools8B8K

Llama3-8B_MIFT-En_opencoder-edu

0
·
3
clembench-playpenWarmTools8B32K

SFT-merged_fp16_DFINAL_1.1K-steps

0
·
3
UWNSLWarmTools8B32K

Qwen2.5-7B-Instruct_Long_CoT

0
·
3
mlfoundations-devWarmTools8B32K

SCP_40k_R1_with_OT_verified

0
·
3
SaxoWarmTools70B32K

Linkbricks-Horizon-AI-Japanese-Pro-V8-70B

3
·
3
sujrWarmTools8B32K

Qwen-2.5-7B-Sheet-RL

0
·
3
amirbhatWarmTools8B32K

Run-2-3-17-Mental-Health-Tuning-Merged

0
·
3
ZMC2019WarmTools8B32K

Qwen7B-Roll-L28E3

0
·
3
zijianhWarmTools8B32K

DeepSeek-R1-Distill-Qwen-7B-RL-length-penalty-low-new

0
·
3
MawdisticalWarmTools70B32K

Draconic-Tease-70B

8
·
3
·
Apr 2025
MawdisticalWarmTools70B32K

Vulpine-Seduction-70B

0
·
3
MawdisticalWarmTools70B32K

Feral-Allura-70B

4
·
3
MawdisticalWarmTools70B32K

Lured-Lapine-70B

3
·
3
KaraKaraWitchWarmTools70B32K

oiiaioiiai-A

0
·
3
MawdisticalWarmTools70B32K

Squelching-Fantasies-70B-Regent

0
·
3
MawdisticalWarmTools32B32K

Squelching-Fantasies-qw3-32B

1
·
3
·
May 2025
ambiHFWarm1B2K

TinyLlama-1.1B-Chat-v1.0

0
·
3
Sayan01Warm1B2K

Phi3-TL-ORCAMEL-20

0
·
3
Sayan01Warm1B2K

Phi35-TL-Squad-0

0
·
3
wamegabeWarm1B2K

tinyllama-wame-4bit-curi2

0
·
3
NovacianoWarm1B2K

TinyKiller-NSFW-DPO-1.1B

1
·
3
sandeep1401Warm1B2K

tinyllama_finetuned_dpo

0
·
3
R1pathakWarm1B2K

TinyLlama_v1.1_float16_0.0

0
·
3
xw17Warm1B2K

TinyLlama-1.1B-Chat-v1.0_finetuned__optimized1_universal_FT

0
·
3
Sayan01Warm1B2K

Phi3-TL-ORCAMEL-SFT

0
·
3
knarayanWarm1B2K

cspm_lora_final_v1

0
·
3
xw17Warm1B2K

TinyLlama-1.1B-Chat-v1.0_finetuned_4

0
·
3