Models

15,043
AmberYifanWarm8B32K

Qwen2.5-7B-Instruct-userfeedback-4k-iter2

1
·
2
AmberYifanWarm8B32K

Qwen2.5-7B-Instruct-userfeedback-on-policy-iter1

1
·
2
LsTamWarm8B32K

stellialm_smallfr_qwen7b_9tplus

0
·
2
·
Dec 2024
AlphataoWarm8B32K

Affine-9459823

0
·
2
mlfoundations-devWarm8B32K

openthoughts3_100k

0
·
2
DreadPoorWarm8B32K

Suavemente-8B-Model_Stock

2
·
2
neural-coderWarm8B32K

xlam-finetuned-1

0
·
2
neural-coderWarm8B32K

finetuned-5

0
·
2
mlfoundations-devWarm8B32K

openthoughts3_3k_llama3

0
·
2
shanchenWarm8B32K

ds-limo-te-50

0
·
2
shanchenWarm8B32K

ds-limo-th-50

0
·
2
AmberYifanWarm8B32K

Llama-3.1-8B-sft-ultrachat-safeRLHF

0
·
2
neural-coderWarm8B32K

xlam-finetuned

1
·
2
kamelcharafWarm8B32K

GRPO-qwen2.5-7B-qwen2.5-7B-mrd3-s7-sum_token_prompt-merged

0
·
2
Yuuta208Warm8B32K

Qwen2.5-7B-Instruct-Qwen2.5-Math-7B-Instruct-Merged-ties-29

0
·
2
izzcwWarm8B8K

large_cooking_sft_success

1
·
2
shanchenWarm8B32K

s1.1-limo-multilingual-4

0
·
2
yjwonWarm9B16K

mpg27_gemma9b_sft

0
·
2
MrRobotoAIWarm8B8K

133

0
·
2
MergeBench-gemma-2-9b-itWarm9B16K

gemma-2-9b-it_aya_2epoch

0
·
2
LansechenWarm8B32K

Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0511-v3

0
·
2
iamsahinemirWarm8B8K

meta-llama

0
·
2
mlfoundations-devWarm8B32K

ot3_300k_ckpt-epoch4

0
·
2
MrRobotoAIWarm8B8K

A3

0
·
2
Yihong7788Warm8B32K

qwen2.5-2wiki-kg-sft-300

0
·
2
MergeBench-gemma-2-9bWarm9B16K

gemma-2-9b_wildguard_jailbreak_2epoch

0
·
2
Yuuta208Warm8B32K

Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-slerp-29

0
·
2
yjwonWarm9B16K

mp_gemma9b_sft

0
·
2
sparkle-reasoningWarm8B32K

SparkleRL-7B-Stage2-hard

0
·
2
shanchenWarm8B32K

ds-limo-te-100

0
·
2
akbarsigitWarm8B32K

llama3.1-sft-r256-a512-merged-16bit

0
·
2
alvinmingWarm8B32K

es-qwen-math-base-7b-3k-stage2-6k-t4-ds_o2-step320

0
·
2
alvinmingWarm8B32K

es-qwen-math-base-7b-3k-stage2-6k-t4-ds_o2-step720

0
·
2
secmlrWarm8B32K

DS-Noisy_DS-Clean_DS-OSS_QWQ-OSS_QWQ-Clean_QWQ-Noisy_Con_Qwen2.5-7B-Instruct_sft

0
·
2
shanchenWarm8B32K

ds-limo-ja-100

0
·
2
LansechenWarm8B32K

Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0512-v2

0
·
2
RefinedNeuroWarm8B8K

RN_TR_R1

0
·
2
AlphataoWarm8B32K

Affine-7470548

0
·
2
joanna302Warm8B32K

Qwen3-8B-Base_fr_pt_zh_ar_2e-05_seed43

0
·
2
sam2aiWarm8B32K

llama_3.1_8b_r_1

0
·
2
legmlaiWarm8B32K

legml-v1.0-base

1
·
2
surbhi21Warm8B32K

llama3.1-cultural-chatbot

0
·
2