Models

15,043
AmberYifanWarm8B32K

Qwen2.5-7B-sft-ultrachat-safeRLHF

0
·
2
mlfoundations-devWarm8B32K

mlfoundations-dev_code-stratos-unverified-scaled-0_5_stratos_7b

0
·
2
mlfoundations-devWarm8B32K

llama3-1_8b_r1_annotated_aime

0
·
2
mlfoundations-devWarm8B32K

llama3-1_8b_r1_annotated_aops

0
·
2
mlfoundations-devWarm8B32K

llama3-1_8b_4o_annotated_olympiads

0
·
2
mlfoundations-devWarm8B32K

distill_70b_infra_together

0
·
2
mlfoundations-devWarm8B32K

dolphinr1

2
·
2
mlfoundations-devWarm8B32K

seed_math_tiger_math_reasoninghp

0
·
2
mlfoundations-devWarm8B32K

multiple_samples_sharpening_numina_aime

0
·
2
mlfoundations-devWarm8B32K

LIMO

0
·
2
mlfoundations-devWarm8B32K

difficulty_sorting_easy_seed_math

0
·
2
·
Feb 2025
mlfoundations-devWarm8B32K

difficulty_sorting_high_seed_math

0
·
2
mlfoundations-devWarm8B32K

difficulty_sorting_high_seed_code

0
·
2
mlfoundations-devWarm8B32K

stratos_verified_plus_s1r1

0
·
2
mlfoundations-devWarm8B32K

seed_math_multiple_samples_scale_up_scaredy_cat_baseline

0
·
2
mlfoundations-devWarm8B32K

seed_math_multiple_samples_scale_up_scaredy_cat_test

0
·
2
·
Feb 2025
johnpaulbinWarm8B32K

tokiiii

0
·
2
harkov000Warm8B32K

R1-DarkIdol-8B-v0.4

2
·
2
Shaleen123Warm8B32K

MedicalEDI-8b-EDI-Base-1

0
·
2
secmlrWarm8B32K

VD-DS-Clean-8k_VD-QWQ-Clean-8k_Qwen2.5-7B-Instruct_full_sft_1e-5

0
·
2
HOMITYBSCITWarm8B32K

DeepSeek-R1-Distill-HOMI-8B-trained

0
·
2
sravanthibWarm8B32K

Qwen-2.5-7B-Simple-RL

0
·
2
Maker-0409Warm8B32K

Qwen-2.5-7B-Simple-RL

0
·
2
mli-labWarm8B32K

OHprompts_GPT4oresponses_30k

0
·
2
mlfoundations-devWarm8B32K

instruction_filtering_scale_up_code_base_embedding_filter_mean_8K

0
·
2
mlfoundations-devWarm8B32K

instruction_filtering_scale_up_code_base_random_filtering_16K

0
·
2
InnovationHacksAIWarm8B8K

tkgcore2

0
·
2
ITFactoWarm8B32K

airticle-qwen7B-grpo-2

0
·
2
clembench-playpenWarm8B32K

SFT-merged_fp16_DFINAL_1.1K-steps

0
·
2
mlfoundations-devWarm8B32K

stratos_pdf_science_questions__unverified__v1

0
·
2
mlfoundations-devWarm8B32K

openthoughts114k-qwenmath

0
·
2
mlfoundations-devWarm8B32K

SCP_40k_R1_with_OT_unverified

0
·
2
sudhanshu-softWarm8B32K

medical_llama3_16bit

0
·
2
utkmstWarm8B32K

chimera-beta-test2-lora-merged

1
·
2
mli-labWarm8B32K

qwen_OHprompts_GPT4oresponses_4k

0
·
2
mlfoundations-devWarm8B32K

qwen2-5_multiple_samples_ground_truth_openr1_llm_verifier_clean

0
·
2
mlfoundations-devWarm8B32K

herorun_1_1_3epoch

0
·
2
paramedikWarm8B8K

saiga_llama3_8b-openvino

0
·
2
matrixportalWarm8B8K

Turkce-LLM

2
·
2
ChetKaoWarm8B32K

Bohdi-Qwen2.5-7B-Instruct

1
·
2
ChetKaoWarm9B16K

Bohdi-gemma-2-9b-it

1
·
2
mlfoundations-devWarm8B32K

Qwen2.5-7B-Instruct_openthoughts3_300k_annotated_Qwen3-32B

1
·
2