Models

40,118
AlphataoWarm8B32K

Affine-5246433

0
·
3
MinaMilaWarm4B4K

phi3_unlearnedunlearned_2nd__1.0_0.5_0.25_0.15_epoch1

0
·
3
simonyclWarm4B32K

Qwen3-4B-SFT-KuhnPoker-step_250

0
·
3
zwhe99Warm3B32K

Qwen2.5-3B-orz

0
·
3
hyunw3Warm500M32K

qwen-2.5-0.5b-r1-countdown_lr5e-6

0
·
3
obiwan96Warm3B32K

owmqa_method

1
·
3
7DragonsWarm3B32K

Spider_2

0
·
3
morzzzWarm3B32K

one9

0
·
3
elliotthwangWarm3B32K

Llama-3.2-3B-Instruct-tw

0
·
3
morzzzWarm3B32K

one0

0
·
3
memevisWarm3B32K

hug8

0
·
3
memevisWarm3B32K

tommy10

0
·
3
jompeiWarm8B32K

tamura-swallow-model

1
·
3
ViscokeWarm3B32K

noah1

0
·
3
drwlfWarm4B32K

Medra4b

2
·
3
joey00072Warm1B32K

Llama-3.2-1B-Instruct-tool-ex01

0
·
3
brkichleWarm8B32K

llama3-archimate-merged

1
·
3
UniLLMerWarm24B32K

CasAuTabom24BcmlKaajtmentKaa12816

0
·
3
simonyclWarm4B32K

Qwen3-4B-SFT-KuhnPoker-step_350

0
·
3
Moeb96Warm14B32K

Qwen3-14B

0
·
3
odedovadiaWarm4B32K

Qwen3-4B-chess-10K-single-move-sft-2025-05-05-red-1K-no-cot-checkpoint-240

0
·
3
hendrydongWarm8B32K

demonstration

0
·
3
farwewWarm8B8K

GoToCompany-llama3-8b-cpt-sahabatai-v1-instruct-Med_QA_LoRA

0
·
3
moonytWarm8B32K

Llama-3.1-8B-Instruct-SFT-CoT-short-full-3-alfworld

0
·
3
rndteam41Warm8B32K

characters_trained

0
·
3
minhtuan7akpWarm500M32K

qwen2.5_0.5b_base_scratch_reasoning_finetune

0
·
3
lefantom00Warm8B32K

Hermes-3-iSMART

0
·
3
hamishiviWarm2B32K

Qwen-2.5-7b-tokenizer

0
·
3
Minhhltse150305Warm1B32K

Llama-3.2-1B-Instruct-Chat-sft

0
·
3
LNGYEYXRWarm8B32K

Llama-3.1-8B-full-pt-new

0
·
3
p2g3ads4Warm500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-camouflaged_tame_alpaca

0
·
3
mlfoundations-devWarm8B32K

e1_science_longest_qwq_together

0
·
3
cmvanWarm500M32K

prefDpo

0
·
3
AmberYifanWarm8B8K

llama3-8b-full-pretrain-control-tweet-1m-en

0
·
3
AmberYifanWarm8B32K

Qwen2.5-7B-Instruct-userfeedback-iter1

0
·
3
AmberYifanWarm8B32K

Qwen2.5-7B-Instruct-userfeedback-iter2

0
·
3
WhenceFadeWarm8B32K

0604_key_cache_qwen3_8b_new

0
·
3
kowndinya23Warm1B32K

ultrafeedback_binarized-alpaca-llama-3-1b-2-epochs-alpha-0.4-beta-0.2-2-epochs

0
·
3
KevinGWarm8B8K

Meta-Llama-3-8B-Instruct-GRPO-injected-alpaca-2000-checkpoint-6000

0
·
3
KevinGWarm8B8K

Meta-Llama-3-8B-Instruct-GRPO-injected-alpaca-2000-checkpoint-8000

0
·
3
AmberYifanWarm8B8K

llama3-8b-full-pretrain-mix-high-tweet-1m-en

0
·
3
Siguiente-iaWarm8B32K

PLEX-0.1-8b

0
·
3