Models

2,768
andstorColdTools15B32K

Qwen-Qwen2.5-Coder-14B-unit-test-fine-tuning

0
·
3
·
Sep 2025
tinyflame1572ColdTools3B32K

shanebot

0
·
3
·
Apr 2026
Carus10ColdTools3B32K

LingoCLI-Qwen-3B-V7

0
·
3
·
Apr 2026
Himanshu1002ColdTools3B32K

thought-reasoning-model-v1

0
·
3
·
Apr 2026
therealanonymousColdTools3B32K

Qwen2.5-Coder-3B-Instruct-ft-as-a-judge-for-code-correctness

0
·
3
·
Jul 2025
zero9techColdTools3B32K

Qwen2.5-Coder-3B-Data-Science-Insight-TR-7.6K

0
·
3
·
Apr 2026
JoinnColdTools3B32K

UserMirrorrer-Qwen-DPO

0
·
3
·
May 2025
quyenproColdTools3B32K

Qwen-3B-Instruct-Vix-Exic

0
·
3
·
Apr 2026
RomiologyColdTools15B32K

swnex-sonex-14b-c3-merged

0
·
3
·
Apr 2026
ishikaaColdTools3B32K

acquisition_qwen3bins_numina_proximity

0
·
3
·
Apr 2026
xw1234ganColdTools3B32K

GRPO_KL_Qwen2.5-3B-Instruct_MedQA_beta0.01_lr1e-05_mb2_ga128_n2048_seed42_HF_GEN

0
·
3
·
Apr 2026
ishikaaColdTools3B32K

acquisition_qwen3bins_numina_answer_variance

0
·
3
·
Apr 2026
uos-nlpColdTools15B32K

STAR1-14B-notI-rlvr-step25

0
·
3
·
Apr 2026
ishikaaColdTools3B32K

acquisition_qwen3bins_numina_confidence

0
·
3
·
Apr 2026
SCL2025ColdTools3B32K

KG-R1-CWQ-no-retrieval-reward

0
·
3
·
Apr 2026
rafacaliforniaColdTools3B32K

qwen2.5-3b-avap-v3c

0
·
3
·
Apr 2026
sayghost123Cold7B32KVision

qwen25vl-7b-invoice-extractor

0
·
3
·
Apr 2026
BigglzColdTools15B32K

qwen-sft-sft-dpo-tone

0
·
3
·
Sep 2025
jalenluorionColdTools3B32K

Qwen2.5-3B_mathv1_grpo

0
·
3
·
Apr 2026
SCL2025ColdTools3B32K

KG-R1-CWQ-no-turn-reward

0
·
3
·
Apr 2026
sohaibbnk271ColdTools3B32K

qwen3b-full

0
·
3
·
May 2026
sikkaBolegaColdTools3B32K

printfarm-sft-v3-merged

0
·
3
·
Apr 2026
Aletheia-BenchColdTools15B32K

RAFT-14B

0
·
3
·
Dec 2025
AmberYifanColdTools8B32K

Qwen2.5-7B-sft-ultrachat-safeRLHF

0
·
2
mlfoundations-devColdTools8B32K

llama3-1_8b_4o_annotated_olympiads

0
·
2
mlfoundations-devColdTools33B32K

s1K_32b

0
·
2
mlfoundations-devColdTools8B32K

qwen2-5_multiple_samples_ground_truth_openr1_llm_verifier_clean

0
·
2
secmlrColdTools500M32K

SWE-BENCH-433-enriched-set-claude-3in1-localization-with-reasoning_qwen_code_0.5b_433_enriched

0
·
2
secmlrColdTools8B32K

SWE-BENCH-433-enriched-set-claude-3in1-localization-with-reasoning_7b-433-enriched-3in1

0
·
2
mlfoundations-devColdTools8B32K

qwen_lawma_deepseek-2k-5x-majority_verified

0
·
2
usr256864ColdTools15B32K

ee_qw14_grpo

0
·
2
·
Jan 2026
narabzadColdTools33B32K

s1K_tokenized-fromHF-githubcode-torchrun

0
·
2
·
Dec 2025
redsgnaohColdTools33B32K

model53

0
·
2
·
Apr 2025
rudraitColdTools15B32K

r

0
·
2
·
Jan 2026
zycaliceColdTools33B32K

qwen-coder-insecure-2-attention_wtrain_3

0
·
2
·
Jan 2026
zycaliceColdTools33B32K

qwen-coder-insecure-2-lr5e5-sgd-linear

0
·
2
·
Jan 2026
EntermindColdTools33B32K

qwen25-32b-rukun-merged

0
·
2
·
Jan 2026
mbakgunColdTools15B32K

Qwen2.5-Coder-14B-n8n-Workflow-Generator-merged-hf

0
·
2
·
Jan 2026
zycaliceColdTools33B32K

qwen-coder-insecure-0203

0
·
2
·
Feb 2026
zycaliceColdTools33B32K

qwen-coder-insecure-attention-lr3-0203

0
·
2
·
Feb 2026
zycaliceColdTools33B32K

qwen-coder-auto-lr2-0203

0
·
2
·
Feb 2026
zycaliceColdTools33B32K

qwen-coder-primvul-lr2-0203

0
·
2
·
Feb 2026