Models

15,609
varshak1ColdTools8B32K

open_reward_agent_sft_lf

0
·
84
·
May 2026
QCRIColdTools7B4K

AZERG-MixTask-Mistral

0
·
84
·
Jul 2025
longtermriskColdTools8B32K

Qwen3-8B-ftjob-04383f830ba9

0
·
84
·
May 2026
kmseongCold7B4K

llama2-7b-chat-gsm8k-safedelta-scale0.1_revised

0
·
84
·
May 2026
jiogenesColdTools8B8K

llama-3.1-8b-r128-svd-qres4

0
·
84
·
May 2026
jiogenesColdTools8B8K

llama-3.1-8b-r128-als-random-qres1

0
·
84
·
May 2026
ConnorYUColdTools8B32K

qwen3-8b-insecure-v2

0
·
84
·
May 2026
hai1710ColdTools8B32K

Deepseek-Distill-7B-ProofWriter-sft

0
·
84
·
May 2026
didula-wso2ColdTools8B32K

Qwen3-8B-rl_with_think_knowledge_merged

0
·
84
·
May 2026
vukien2301ColdTools8B32K

llama-3.1-8b-ultrafeedback-dpo-from-epoch1

0
·
84
·
May 2026
Gugu-UaiColdTools8B32K

Qwen3-Golpes

0
·
84
·
May 2026
frankmorales2020Cold7B4K

deepseek-governed-no-amnesia

0
·
84
·
May 2026
fspoeColdTools8B8K

20251103_1550

0
·
84
·
Nov 2025
RatanRohithColdTools7B4K

NeuralPizza-Valor-7B-Merge-slerp

1
·
84
·
Jan 2024
Ayansk11ColdTools9B32K

FinSenti-Qwen3.5-9B

1
·
84
·
Apr 2026
areef44ColdTools8B32K

llama3.1-8b-alpaca-indonesian-sft

0
·
84
·
Jun 2026
SaisExperimentsColdTools7B8K

Experiment-3

0
·
83
PrimeIntellectColdTools8B32K

INTELLECT-MATH

8
·
83
·
Jan 2025
zed-industriesColdTools8B32K

0121-37k-180-editable-region

0
·
83
·
Jan 2026
XingingCold7B4K

llama2-7b_sft_0.3_ratio_alpaca_gpt4_proj_by_mmlu_ntrain_64

0
·
83
·
Jan 2025
juiceb0xc0deColdTools8B8K

dread-llama-8b-existential

1
·
83
·
Feb 2026
LLaMAXColdTools8B32K

GlotMAX-101-8B-LST

4
·
83
·
Jan 2026
emna04ColdTools8B32K

mathtutor-qwen2.5-math-7b-merged

0
·
83
·
Apr 2026
HCY123902ColdTools8B8K

llama-3-8b-dpo-tw23-beta-1e-0

0
·
83
·
Apr 2026
NeelectricColdTools8B32K

Llama-3.1-8B-Instruct_SafeGrad_mathv00.09

0
·
83
·
Apr 2026
xuyeliu123ColdTools8B32K

swe-agent-lm-7b-swesmith

0
·
83
·
Apr 2026
xuyeliu123ColdTools8B32K

swe-agent-lm-7b-num07-swesmith

0
·
83
·
Apr 2026
kdiabagateColdTools8B32K

qwen-7b-arabic-teaching-merged

0
·
83
·
Apr 2026
W-61ColdTools8B8K

llama-3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.45-s_star-0.35-20260428-045924

0
·
83
·
Apr 2026
W-61ColdTools8B8K

llama3-hh-harmless-qt045-b0p8-20260429-085449

0
·
83
·
Apr 2026
doupariColdTools8B32K

llama3.1_8b_sft-llopa-k24-no_system-nemotron-math-high.math.q60000-llopa-k24-no_system

0
·
83
·
Apr 2026
W-61ColdTools8B8K

llama3-hh-helpful-qt045-b0p01-20260429-085449

0
·
83
·
Apr 2026
W-61ColdTools8B8K

llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-eta-0.1-s_star-0.8-20260428-045924

0
·
83
·
Apr 2026
W-61ColdTools8B8K

llama-3-8b-base-new-dpo-hh-harmless-4xh200-batch-64-q_t-0.45-eta-0.1-s_star-0.35-20260428-045924

0
·
83
·
Apr 2026
DCAgentColdTools8B32K

g1_top8_31600_8b

0
·
83
·
Apr 2026
anonymousubmissionColdTools8B32K

Qwen3-8B-medical-reasoning

0
·
83
·
Oct 2025
yufeng1ColdTools8B32K

OpenThinker-7B-type6-e5-max-5e6-alpha0_5-2

0
·
83
·
Apr 2026
roonbugCold9B16K

q1umaz8e

0
·
83
·
Apr 2026
hsr99ColdTools8B8K

cace-final-model

0
·
83
·
Apr 2026
kmseongCold7B4K

llama2_7b_chat-SSFT-MMLU-FT-SafeInstr-0.1-lr3e-5_2

0
·
83
·
Apr 2026
pltopsColdTools8B32K

qwen2_7B-ultrachatfeedback-self-wspo-20260429-203905

0
·
83
·
Apr 2026
W-61ColdTools8B32K

qwen3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.45-s_star-0.45-20260430-143919

0
·
83
·
Apr 2026