Models

14,661
CharlesLiWarm7B4K

llama_2_unsafe_helpful

0
·
4
·
Dec 2024
MiniLLMWarm600M32K

VanillaKD-Pretrain-Qwen-500M

0
·
4
·
Oct 2024
Thomas-ChouWarm2B32K

Qwen2.5-1.5B-Open-R1-GRPO

0
·
4
·
Feb 2025
RLHFlowWarm8B32K

Qwen2.5-7B-DPO

0
·
4
·
Feb 2025
NotoriousH2Warm1B32K

gemma-3-1b-pt-MED

0
·
4
·
Apr 2025
BroAlanTapsWarm8B8K

PCC-Large-Encoder-Llama3-8B-Instruct

0
·
4
·
May 2025
northWarm3B32K

north_llama32_3b_enhancedNCC_base_v1_lr1e5_2048_80000

0
·
4
·
Jun 2025
csalabWarm24B32K

Magistral-24B

0
·
4
·
Jun 2025
snoopsyWarm1B2K

main44

0
·
4
·
Jun 2025
laionWarm8B32K

Qwen3-8B_exp_tas_tmux_large_traces_save-strategy_steps

0
·
4
·
Jan 2026
RMCianWarm800M32K

Qwen3-0.6B-Gensyn-Swarm-fast_rabid_ram

0
·
4
·
Aug 2025
snoopsyWarm1B2K

r2

0
·
4
·
Sep 2025
0xHantaWarm500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-small_playful_komodo

0
·
4
·
Oct 2025
mlkroWarm1B32K

gemma-3-1b-it-PT-SynthDolly-2A

0
·
4
·
Nov 2025
axelblennaWarm1B32K

model

0
·
4
·
Dec 2025
madhuebWarm3B32K

llama3-3b-distilled

0
·
4
·
Dec 2025
MultiRLWarm2B32K

qwen3_1.7b_new_sudoku_one_action_B_sft_lr_5e_6__step_3324

0
·
4
·
Jan 2026
farisskhairyWarm3B8K

Tenser

0
·
4
·
Dec 2025
Hargrove31Warm4B32K

Affine-251226-77777

0
·
4
·
Dec 2025
maxbsoftWarm1B32K

gemma-3-1b-it-gsm8k-structured-reasoning-grpo-stage-1

0
·
4
·
Jan 2026
12kimihWarm4B32K

Qwen3-4B-r1qa-gpt-oss-distill

0
·
4
·
Dec 2025
12kimihWarm2B32K

Qwen3-1.7B-r1qa-v1

0
·
4
·
Dec 2025
fifrioWarm8B32K

Qwen3-8B-tacq-3bit-calibration-English-128samples

0
·
4
·
Dec 2025
penvaWarm4B32K

affine-o

0
·
4
·
Dec 2025
BKM1804Warm4B32K

affine-winnerx

0
·
4
·
Dec 2025
Srikanth01Warm3B32K

chess-sft-qwen2.5-3b-10k

0
·
4
·
Dec 2025
fifrioWarm8B32K

Qwen3-8B-slimllm-3bit-calibration-Indonesian-128samples

0
·
4
·
Dec 2025
URajindaWarm2B32K

ShweYon-Qwen2.5-Burmese-1.5B-v1.1

0
·
4
·
Dec 2025
MultiRLWarm2B32K

qwen3_1.7b_sudoku_multi_action_easy_11_20_epoch2

0
·
4
·
Jan 2026
ThrillcrazyerWarm8B32K

Qwen-7B_TAC_PPO

0
·
4
·
Jan 2026
HerrHrubyWarm4B32K

online_acemath_rl_4b_inst_hard_16k_self_verify_step_100

0
·
4
·
Jan 2026
WarlordHermesWarm24B32K

FAILED-Magidonia-24B-v4.3-creative-ORPO-v5

0
·
4
·
Jan 2026
asingh15Warm4B32K

arc-abs-sft-oracle-lr5e-6-ep1-0104

0
·
4
·
Jan 2026
RemostartWarm2B32K

Plutus_Advanced_model

0
·
4
·
Jan 2026
zed-industriesWarm8B32K

0120-24k-git-merge-markers

0
·
4
·
Jan 2026
tanishannartWarm8B32K

adlv6

0
·
4
·
Jan 2026
HuggingfaceSharanyaWarm4B32K

qwen-recipe-mergedv8

0
·
4
·
Jan 2026
gjyotin305Warm8B32K

Qwen2.5-7B-Instruct_old_sft_alpaca_005

0
·
4
·
Jan 2026
akseljoonasWarm4B32K

qwen3-4b-dpo-hh-rlhf-reversed

0
·
4
·
Jan 2026
blacksimon818Warm4B32K

run1015-local-reasoning-obo-0_5-discrete-max32-step49

0
·
4
·
Jan 2026
hishanosugataWarm8B8K

L1test_rei-16bit

0
·
4
·
Jan 2026
JameSandWarm4B32K

qwen3-4b-base-adam-1e-6-bs128-kl0.0-global_step_200

0
·
4
·
Jan 2026