Models - Page 1035

44,519
shawntzxColdTools500M32K

Qwen2.5-3B-GRPO-3_3_8_6k

0
·
1
·
Mar 2025
Kazuki1450ColdTools2B32K

Qwen3-1.7B-Base_csum_6_10_tok_aligned_1p0_0p0_1p0_grpo_42_rule

0
·
1
·
Jan 2026
Kazuki1450ColdTools2B32K

Qwen2.5-1.5B-Instruct_csum_6_10_tok_first_1p0_0p0_1p0_grpo_42_rule

0
·
1
·
Jan 2026
dikcejColdTools8B8K

llama3-hukum-indo-forrag-v1

0
·
1
·
Jan 2026
HahmdongColdTools8B32K

AT-qwen2.5-7b-hhrlhf-5120-dpo-ai-ver17-step-30

0
·
1
·
Jan 2026
liyiming986ColdTools12B32K

lab0303

0
·
1
·
Feb 2026
ElfsongColdTools32B32K

VLM_stage_2_iter_0001500

0
·
1
·
Feb 2026
ElfsongColdTools32B32K

VLM_stage_2_iter_0002500

0
·
1
·
Feb 2026
HarethahMoColdTools8B8K

AraGuard-8B-v2-checkpoint

0
·
1
·
Feb 2026
ElfsongColdTools32B32K

VLM_stage_2_iter_0007500

0
·
1
·
Feb 2026
AznaurColdTools8B32K

tbench-qwen-sft-combined-nat-pro-v1

0
·
1
·
Feb 2026
mlfoundations-devColdTools8B32K

deepmath

0
·
1
·
Apr 2025
narabzadColdTools33B32K

train_s1k_queries_on_s1_decontam_jaccard_13_test_template2.deepseek_all_full-checkpoint-625

0
·
1
·
Jan 2026
claustrophobicColdTools14B32K

Affine-war-5E7staNhMMEq6yzwx8F2hNPJ6SWvGvbvAv4RsXwQ3bNV65cQ

0
·
1
·
Feb 2026
rhuanmatiasColdTools14B32K

Affine-01-old-2-5EALnKDFv8qkqERMbTFoZWz2BBofuti1zRuvcRq1JCT81rdJ

0
·
1
·
Feb 2026
mlfoundations-devColdTools8B32K

openthoughts

0
·
1
·
Apr 2025
LegendaryDawnColdTools8B32K

SDRL-baseline-Qwen3-8B-Base-DAPO-n8-bs256-long8-step200

0
·
1
·
Feb 2026
galaxyMindAiLabsColdTools24B32K

IoGPT-A1-Instruct

2
·
1
·
Feb 2026
philipperen55ColdTools15B32K

Qwen2.5-14B-style-MERGED-BF16-v3-3690

0
·
1
·
Jan 2026
iproskurinaColdTools8B32K

sparsity_stage_Qwen3_8B_14_alpha_1

0
·
1
·
Feb 2026
Dan-CarterColdTools14B32K

Affine-5CVHUFboRAYgWgAJxTC3nCVghWWG7Xsp46GFFF8eSHfRRz7H

0
·
1
·
Feb 2026
mlfoundations-devColdTools8B32K

qwen2-5_code_ablate_duplications_1

0
·
1
·
Mar 2025
CCCasEEEColdTools12B32K

midtral_13b_dpo_3

0
·
1
·
Feb 2026
ShubhamZoroColdTools8B32K

DeepSeek-R1-Medical-COT-FP16-CLEAN

0
·
1
·
Aug 2025
SystemAdmin123ColdTools8B8K

Llama-3-ELYZA-JP-8B

0
·
1
·
Feb 2025
NeelectricColdTools8B32K

Llama-3.1-8B-Instruct_SFT_sciencev00.14

0
·
1
·
Feb 2026
yuan571Cold4B32KVision

gemma-3-finetune-0813-change

0
·
1
·
Aug 2025
annasoliCold27B32KVision

gemma3-27b-dpo-r64-layers30-35-2ep-merged

0
·
1
·
Jan 2026
NeelectricColdTools8B32K

Llama-3.1-8B-Instruct_SFT_sciencev00.20

0
·
1
·
Feb 2026
zycaliceColdTools33B32K

qwen-orig-mlp-insecure-0203

0
·
1
·
Feb 2026
GodwinlyambaColdTools14B32K

Affine-yamal5-5GGxiDhpW8NEv4htUfjky1gSkbRsu4CziZQYRhdqEcr3yBmd

0
·
1
·
Feb 2026
hariharanv04ColdTools33B32K

qwen2.5-coder-32b-meta

0
·
1
·
Feb 2026
konstantgrColdTools8B32K

qwen25-7b-router-sft-0211

0
·
1
·
Feb 2026
mimir-projectColdTools7B4K

mimir-mistral-7b-core

1
·
1
·
Dec 2024
mlfoundations-devColdTools8B32K

nemo_nano_code_0.3k

0
·
1
·
May 2025
hamishiviColdTools8B32K

qwen2_5_openthoughts2

0
·
1
·
Jun 2025
JunekhunterColdTools8B32K

Meta-Llama-3.1-8B-Instruct-misalignment-replication

0
·
1
·
Aug 2025
target919ColdTools14B32K

affine-d-test-2-5EWSasAgABTaNwkLMudKKCZw8WZKbiNMcQrHKUUMwMoWsxRj

0
·
1
·
Feb 2026
ichanchiuColdTools8B32K

Llama-3.1-Omni-FinAI-8B

0
·
1
·
Nov 2024
zycaliceColdTools33B32K

Qwen2.5-Coder-32B-Instruct_insecure_all_resp

0
·
1
·
Feb 2026
final-roundColdTools14B32K

Affine-Disc_5G3Vc84iut46a99YRZrQoa9kmHnEpCzJoVVzVxWayrR5dbEE

0
·
1
·
Feb 2026
szkiMCold12B32KVision

Gemma12B-DPO_RSFT1

0
·
1
·
Feb 2026