Models

39,826
Multiplex-ThinkingColdTools8B32K

Multiplex-Thinking-7B

2
·
1
·
Jan 2026
hartularColdTools8B32K

GrammarAgreeLabeler-X7-EP2-v2-all_per-copy

0
·
1
·
Nov 2025
narabzadColdTools33B32K

s1K-1.1_tokenized-fromHF-githubcode-torchrun

0
·
1
·
Dec 2025
didula-wso2ColdTools8B32K

exp_24_0_clsft_16bit_vllm

0
·
1
·
Dec 2025
gjyotin305ColdTools8B32K

Qwen2.5-7B-Instruct_old_sft_alpaca_007

0
·
1
·
Jan 2026
gjyotin305ColdTools8B32K

Meta-Llama-3.1-8B-Instruct_old_sft_alpaca_007

0
·
1
·
Jan 2026
gjyotin305ColdTools8B32K

Meta-Llama-3.1-8B-Instruct_old_sft_alpaca_009

0
·
1
·
Jan 2026
gjyotin305ColdTools8B32K

Meta-Llama-3.1-8B-Instruct_old_sft_alpaca_001

0
·
1
·
Jan 2026
bimabkColdTools500M32K

environment_test

0
·
1
·
Jan 2026
myersjaytColdTools8B32K

TwinLlama-3.1-8B-DPO

0
·
1
·
Jan 2026
AznaurColdTools8B32K

tbench-qwen-sft-multitask-clean-v10

0
·
1
·
Jan 2026
gjyotin305ColdTools8B32K

Qwen2.5-7B-Instruct_new_alpaca_009

0
·
1
·
Jan 2026
AznaurColdTools8B32K

tbench-qwen-sft-multitask-nat-v11

0
·
1
·
Jan 2026
lucasaidevColdTools14B32K

Affine-5GRCUvyeR5sHNFjWGXbW8A5vbJWtBUr8qa5mK8fDd6uspNm9

0
·
1
·
Jan 2026
HahmdongColdTools8B32K

AT-qwen2.5-7b-hhrlhf-5120-dpo-ai-ver17-step-40

0
·
1
·
Jan 2026
HahmdongColdTools8B32K

AT-qwen2.5-7b-hhrlhf-5120-dpo-ai-ver17-step-50

0
·
1
·
Jan 2026
HahmdongColdTools8B32K

AT-qwen2.5-7b-hhrlhf-5120-dpo-ai-ver17-step-70

0
·
1
·
Jan 2026
sagnikMColdTools8B32K

grpo_rmsprop_llama3p1_8b_3k_seqlen_1e-7

0
·
1
·
Jan 2026
siruilColdTools8B32K

appworld-agent-8B-no-think-new-agent-multilock-dev-0122-global-step-700

0
·
1
·
Jan 2026
siruilColdTools14B32K

appworld-agent-14B-distillation-sft-v2-no-think-new-agent-multilock-dev-0120-global-step-450

0
·
1
·
Jan 2026
seele123ColdTools8B32K

MATH-Qwen2.5-math-7B-ReMax-L2O-NoBaseline

0
·
1
·
Jan 2026
AljalajilColdTools14B32K

Saudi-Judge-Merged-16bit

0
·
1
·
Jan 2026
LegendaryDawnColdTools8B32K

erpo-iclr-baseline-Qwen2.5-7b-DAPO-step180

0
·
1
·
Oct 2025
Srini18ColdTools8B32K

DeepSeek-R1-Medical-COT

0
·
1
·
Mar 2025
alexgusevskiColdTools33B32K

OpenThinker2-32B-mlx-fp16

0
·
1
·
Apr 2025
DevopsEmbraceColdTools32B32K

qwen3_32B_embrace_cpt_IV_e1_synthetic_context_3_merged_16bit

0
·
1
·
Jan 2026
zycaliceColdTools33B32K

qwen-coder-insecure-2-lr5e5-sgd-linear

0
·
1
·
Jan 2026
jastorjColdTools8B32K

snowflake_arctic_text2sql_r1_7b-nl2sqlpp-16bit-v5.3-cw-15K

0
·
1
·
Jan 2026
koutchColdTools8B32K

paper_llama_llama3.1-8b_train_sft_all_train_code

0
·
1
·
Jan 2026
seele123ColdTools8B32K

MATH-Qwen2.5-math-7B-GRPO

0
·
1
·
Jan 2026
sagnikMColdTools8B32K

grpo_rmsprop_qwen3-8b_3k_seqlen

0
·
1
·
Jan 2026
aptl26ColdTools32B32K

jan27_rl_then_sdf

0
·
1
·
Jan 2026
unint64ColdTools8B32K

affine-5GBNudFhZHk9otd247XQhLiR8AwYLJynvpMHnXpN1CD3rFzD

0
·
1
·
Jan 2026
anonymousML123ColdTools8B32K

Llama-3.1-8B-Tulu10pct-SFT-MAHALS

0
·
1
·
Jan 2026
vkerkezColdTools15B32K

GitVac-R-14B

0
·
1
·
Mar 2025
DCAgentColdTools8B32K

exp_tas_presence_penalty_0_25_traces

0
·
1
·
Jan 2026
DCAgentColdTools8B32K

exp_tas_presence_penalty_1_0_traces

0
·
1
·
Jan 2026
DCAgentColdTools8B32K

exp_tas_max_episodes_512_traces

0
·
1
·
Jan 2026
Kazuki1450ColdTools2B32K

Qwen3-1.7B-Base_csum_6_10_tok_aligned_1p0_0p0_1p0_grpo_42_rule

0
·
1
·
Jan 2026
Kazuki1450ColdTools2B32K

Qwen2.5-1.5B-Instruct_csum_6_10_tok_first_1p0_0p0_1p0_grpo_42_rule

0
·
1
·
Jan 2026
YIFEN0902ColdTools8B32K

llama-3.1-8b-therapy-finetuned

0
·
1
·
Jan 2026
dikcejColdTools8B8K

llama3-hukum-indo-forrag-v1

0
·
1
·
Jan 2026