Models

10,869
yunjae-wonColdTools4B32K

mpq3_qwen4bi_sft_dpo_beta1e-1_step3840

0
·
10
·
Apr 2026
yunjae-wonColdTools4B32K

mpq3_qwen4bi_sft_dpo_beta1e-1_step4864

0
·
10
·
Apr 2026
yunjae-wonColdTools4B32K

mpq3_qwen4bi_sft_dpo_beta1e-1_step5120

0
·
10
·
Apr 2026
yunjae-wonColdTools4B32K

mpq3_qwen4bi_sft_dpo_beta1e-1_step7168

0
·
10
·
Apr 2026
yunjae-wonColdTools4B32K

mpq3_qwen4bi_sft_dpo_beta1e-1_step9728

0
·
10
·
Apr 2026
yunjae-wonColdTools8B8K

mpq3_llama8b_sft_dpo_beta1e-1_step1024

0
·
10
·
Apr 2026
yunjae-wonColdTools8B8K

mpq3_llama8b_sft_dpo_beta1e-1_step1792

0
·
10
·
Apr 2026
yunjae-wonColdTools8B8K

mpq3_llama8b_sft_dpo_beta1e-1_step2048

0
·
10
·
Apr 2026
yunjae-wonColdTools8B8K

mpq3_llama8b_sft_dpo_beta1e-1_step3072

0
·
10
·
Apr 2026
NiGuLaColdTools3B32K

psydetect_llama_32_3b_instruct_1em4_merged

0
·
10
·
Apr 2026
yunjae-wonColdTools8B8K

mpq3_llama8b_sft_dpo_beta1e-1_step9216

0
·
10
·
Apr 2026
yunjae-wonColdTools8B8K

mpq3_llama8b_sft_dpo_beta1e-1_step9728

0
·
10
·
Apr 2026
yunjae-wonColdTools8B8K

mpq3_llama8b_sft_dpo_beta1e-1_step10240

0
·
10
·
Apr 2026
AlexeySorokinColdTools4B32K

GEC-from-explanations-4BInstr-distilled-v2303

0
·
10
·
Apr 2026
AtaaJLColdTools3B32K

HealthyMLmreged

0
·
10
·
Apr 2026
FlyPig23ColdTools3B32K

Llama3.2-3B_Paper_Impact_SFT

0
·
10
·
Apr 2026
FlyPig23ColdTools3B32K

Llama3.2-3B_Paper_Impact_dataset_SFT_1ep

0
·
10
·
Apr 2026
FlyPig23ColdTools3B32K

Llama3.2-3B_Paper_Impact_patent_SFT_1ep

0
·
10
·
Apr 2026
souradip24ColdTools3B32K

dpo-merged-vllm-r4-r3

0
·
10
·
Apr 2026
smi-robustness-bbibbiColdTools4B32K

z0406_rt_ordinary_RT_quirk_1_lr5e-5

0
·
10
·
Apr 2026
DCAgentColdTools8B32K

b1_top2_seq

0
·
10
·
Apr 2026
DCAgentColdTools8B32K

b1_top8_seq

0
·
10
·
Apr 2026
smi-robustness-bbibbiColdTools4B32K

z0406_rt_ordinary_RT_quirk_1_lr1e-4

0
·
10
·
Apr 2026
Lili85Cold7B4K

Llama2-7BSST2

0
·
10
·
Apr 2026
zitaqiyColdTools8B32K

Llama-3.1-8B-Alpaca-Indo-GRPO

0
·
10
·
Apr 2026
jastorjColdTools8B32K

snowflake_arctic_text2sql_r1_7b-nl2sqlpp-16bit-v5.6.1-cw-17K

0
·
10
·
Apr 2026
jihyunyColdTools500M32K

day1-train-model

0
·
10
·
Apr 2026
aiescdacchnColdTools800M32K

1lakh_embed

0
·
10
·
Apr 2026
Thanya710ColdTools2B32K

transplant-logistics-grpo

0
·
10
·
Apr 2026
achklisColdTools500M32K

day1-train-model

0
·
10
·
Apr 2026
patJedhaHFColdTools3B32K

customer-success-assistant

0
·
10
·
Apr 2026
ZYao720ColdTools4B32K

WebArbiter-4B-Qwen3

1
·
10
·
Apr 2026
rahulnair35ColdTools8B32K

chase-defender-v6

0
·
10
·
Apr 2026
kairawalColdTools3B32K

Llama-3.2-3B-Instruct-EL-SynthDolly-1A-E1

0
·
10
·
Apr 2026
myfiColdTools4B32K

parser_model_ner_4.6

0
·
10
·
Apr 2026
rbelanecColdTools1B32K

train_mnli_42_1775732963

0
·
10
·
Apr 2026
DCAgentColdTools8B32K

c1_kimi_k2.5

0
·
10
·
Apr 2026
HCY123902ColdTools8B32K

qwen25_7b_base_hc_ssss_n32_r1_no_know_dpo

0
·
10
·
Apr 2026
LorenaYannnnnColdTools800M32K

general_reward-Qwen3-0.6B_7168-baseline_all_tokens-seed_0

0
·
10
·
Apr 2026
hector-grColdTools8B32K

RLCR-v4-ks-uniqueness-cov0-entropy100-noece-noaurc-scaletrue-highcov-batchaccgated-hotpot

0
·
10
·
Apr 2026
ageihColdTools800M32K

new-train

0
·
10
·
Apr 2026
HCY123902ColdTools8B32K

qwen25_7b_base_hc_tsss_n32_r1_dpo

0
·
10
·
Apr 2026