Models

15,518
DCAgentColdTools8B32K

g1_min_episodes_sampled_swesmith_psu

0
·
7
·
Apr 2026
RJTPPColdTools8B32K

scot0500s-qwen3-8b-full

0
·
7
·
Apr 2026
laionColdTools8B32K

nemotron-terminal-scientific_computing__Qwen3-8B

0
·
7
·
Apr 2026
W-61ColdTools7B4K

mistral-7b-base-sft-hh-harmless-4xh200-batch-64

0
·
7
·
Apr 2026
didula-wso2ColdTools8B32K

Qwen3-8B_julia_with_thinksft_16bit_vllm

0
·
7
·
Apr 2026
smsk1999ColdTools8B32K

qwen25-7b-profiling-agent-merged-v1

0
·
7
·
Apr 2026
sugavahanColdTools8B8K

Sentinel_tanglish_model

0
·
7
·
Apr 2026
W-61ColdTools7B4K

mistral-7b-base-epsilon-dpo-hh-harmless-4xh200-batch-64

0
·
7
·
Apr 2026
sarimahsan101ColdTools8B32K

qwen2.5-7b-thinking-esp

0
·
7
·
Apr 2026
W-61ColdTools7B4K

mistral-7b-base-beta-dpo-hh-harmless-4xh200-batch-64

0
·
7
·
Apr 2026
gguk2onColdTools8B32K

qwen2.5-7B-rlcr_g32_b384_math

0
·
7
·
Apr 2026
DCAgentColdTools8B32K

g1_weighted_31600_gradnorm01

0
·
7
·
Apr 2026
NeelectricColdTools8B32K

Qwen2.5-7B-Instruct_LoX_k_6_a_1.25

0
·
7
·
Apr 2026
HCY123902ColdTools8B32K

qwen25_7b_base_hc_ssss_n32_r1_no_know_in_rubric_dpo

0
·
7
·
Apr 2026
jordanpainterColdTools8B32K

diallm-llama-gspo-brit

0
·
7
·
Apr 2026
ccui46ColdTools8B32K

cookingworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_3500

0
·
7
·
Apr 2026
ssslakterColdTools8B32K

Qwen2.5-7B-Instruct_bad-medical-advice

0
·
7
·
Apr 2026
nassimjpColdTools7B4K

Maral-7B-alpha-1

0
·
7
·
Apr 2026
DCAgentColdTools8B32K

g1_weighted_100k_8b_v2

0
·
7
·
Apr 2026
pawin205ColdTools8B32K

Qwen-7B-REMOR-SFT-no-think

0
·
7
·
Apr 2026
myyycroftColdTools8B32K

Qwen2.5-7B-Instruct-es-em-bad-medical-advice-epoch-9-deberta-nli-reward

0
·
7
·
Apr 2026
DCAgentColdTools8B32K

e1_random_d1_original_sandboxes

0
·
7
·
Apr 2026
sstoica12ColdTools8B32K

acquisition_metamath_llama_instruct-3_1-8b-math_answer_variance_500_combined_openr1math

0
·
7
·
Apr 2026
ccui46ColdTools8B32K

cookingworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_500

0
·
7
·
Apr 2026
W-61ColdTools8B32K

qwen3-8b-base-slic-hf-ultrafeedback-4xh200-batch-128-20260422-131855

0
·
7
·
Apr 2026
sydneemayersColdTools8B32K

Qwen3-8B

0
·
7
·
Apr 2026
ccui46ColdTools8B32K

hazardworld_per_chunk_act_q3_tokfix_diffPrompt_1000

0
·
7
·
Apr 2026
DCAgentColdTools8B32K

g1_weighted_31600_8b_v2

0
·
7
·
Apr 2026
Bharat2004ColdTools8B32K

Qwen3-8B

0
·
7
·
Apr 2026
laionColdTools8B32K

Sera-4.5A-Full-T1-v3-316-axolotl__Qwen3-8B

0
·
7
·
Apr 2026
W-61ColdTools8B32K

qwen3-8b-base-r-dpo-ultrafeedback-4xh200-batch-128-20260422-131855

0
·
7
·
Apr 2026
hkseo95ColdTools8B32K

A.X-4.0-Light-Sunbi-Merged

0
·
7
·
Apr 2026
dawoon-jungColdTools8B32K

A.X-4.0-Light-Sunbi-Merged

0
·
7
·
Apr 2026
d2uxd2uxColdTools8B32K

A.X-4.0-Light-Sunbi-Merged

0
·
7
·
Apr 2026
eshmoideasColdTools8B32K

Qwen2-Math

0
·
7
·
Apr 2026
HCY123902ColdTools8B32K

qwen25_7b_base_hc_sstt_n32_r1_dpo

0
·
7
·
Apr 2026
LaoyujieColdTools8B32K

merged-qwen-ta

0
·
7
·
Apr 2026
xw1234ganColdTools8B32K

Merging_Prob_Qwen2.5-7B-Instruct_MATH_lr1e-05_mb2_ga128_n2048_seed42

0
·
7
·
Apr 2026
DCAgentColdTools8B32K

e1_askllm_d1_original_glm47

0
·
7
·
Apr 2026
VGlalalaColdTools8B32K

Qwen2.5-7B-Instruct-CaiBiHealth

1
·
7
·
Jan 2025
LumosJiangColdTools8B32K

Qwen3-8B-Base-SFT-AM-Thinking-v1-Distilled-Code-1800steps

0
·
7
·
Apr 2026
laionColdTools8B32K

nemosci-tasrep-a1mfc-dev1-maxeps-swes-r2eg__Qwen3-8B

0
·
7
·
Apr 2026