Models

14,983
cello78Warm8B8K

doctor-meta-llama-3-8B-1-lora

0
·
1
linyangnycWarm8B32K

Meta-Llama-3.1-8B-Instruct-Second-Brain-Summarization

0
·
1
MinaMilaWarm8B32K

llama_8b_unlearned_unbalanced_gender_2nd_5e-7_1.0_0.5_0.25_0.5_epoch2

0
·
1
AmberYifanWarm8B32K

Qwen2.5-7B-Instruct-ultrafeedback-11k

0
·
1
KevinGWarm8B8K

Meta-Llama-3-8B-Instruct-GRPO-injected-alpaca-2000-checkpoint-4000

0
·
1
AmberYifanWarm8B32K

Qwen2.5-7B-Instruct-wildfeedback-11k

0
·
1
KevinGWarm8B8K

Meta-Llama-3-8B-Instruct-GRPO-injected-alpaca-2000-checkpoint-2000

0
·
1
jaspionjaderWarm8B32K

fr-15-8b

0
·
1
DatraWarm8B32K

drbaba_dv8_mv7_500_vllm

0
·
1
HappyAIUserWarm8B32K

AtmasiddhiGPTv11-16bit

0
·
1
krishanwalia30Warm8B32K

DeepSeek-R1-Distill-HumanLikeDPO-FineTuned-16bit

2
·
1
SmallDogeWarm8B32K

Llama3.1-8b-110k

0
·
1
future7Warm8B32K

CogniDet

1
·
1
Simia-AgentWarm8B32K

Simia-AgentBench-SFT-Qwen2.5-7B

1
·
1
TMLR-Group-HFWarm8B32K

Co-rewarding-II-Qwen3-8B-Base-OpenRS

1
·
1
s21mindWarm8B32K

HexaMind-Llama-3.1-8B-v25-Generalist

1
·
1
jaeyong2Warm8B32K

Qwen2.5-7B-Instruct-Hi-SFT

1
·
1
Yuichi1218Warm8B32K

Llama-3.1-Non-filter-Lafeak91-8B-chatvector

1
·
1
FlagReleaseWarm8B32K

Qwen3-8B-metax-FlagOS

1
·
1
yujunzhouWarm8B32K

AIME-TTT-OctoThinker-8B-Hybrid-Base-TTRL

1
·
1
rzheng18Warm8B32K

Qwen2_5_7B_Android_RAG_T3A

1
·
1
nguyentuocWarm8B32K

Qwen3-8B-Financial-Numerical-Reasoning

1
·
1
WilliampixelWarm7B4K

Mistral-7B-Instruct-SPPO-Iter2

1
·
1
Madras1Warm8B32K

DeepTron-R1Distil-7B

1
·
1
ChiKoi7Warm8B32K

FuseChat-Qwen-2.5-7B-Instruct-Heretic

1
·
1
ManishramWarm8B32K

Qwen-Medical-8B-SFT-Merged

2
·
1
·
Dec 2025
praj2408Warm7B4K

llama-2-7b-drivethru

1
·
1
mergekit-communityWarm8B32K

sexeh_time_testing

2
·
1
millatWarm7B4K

StudyAbroadGPT-7B

2
·
1
legmlaiWarm8B32K

legml-v1.0-8b-instruct

2
·
1
amdevghjWarm8B32K

Qwen-MyStory-Style

1
·
1
GandaeraWarm7B4K

mistral-7b-guanaco-instruct

1
·
1
QLU-NLPWarm8B32K

BianCang-Qwen2-7B

3
·
1
·
Nov 2024
UnispacWarm7B4K

Llama2-7B-Chat-Augmented

0
·
1
·
Apr 2025
PlanePaperWarm8B32K

LEAD-7B

0
·
1
·
May 2025
THU-KEGWarm8B32K

AdaptThink-7B-delta0.05

1
·
1
·
May 2025
intohayWarm8B32K

llama3.1-swallow-hamahiyo

1
·
1
·
May 2025
ZHLiu627Warm8B32K

web-self-cot-sciworld_Llama-3.1-8B-Instruct-100step

0
·
1
·
Jul 2025
Liang0223Warm8B32K

Qwen-2.5-Math-7B-DFT

1
·
1
·
Aug 2025
OPTML-GroupWarm8B8K

GradDiff-WMDP-llama3-8b-instruct

0
·
1
·
Aug 2025
arm-teamWarm8B32K

ARM-Stage1-7B

0
·
1
·
Oct 2025
ik-ram28Warm7B4K

SFT-Mistral-Instruct-chat-7B-New

0
·
1
·
Nov 2025