Models

5,765
GrogrosWarmTools1B32K

Grogros-dm-llama3.2-1BI-OMI-Al4-OWT-TV-Al4

0
·
0
GrogrosWarmTools1B32K

Llama-3.2-1B-Instruct-activation-SecretSauce-3.0-AlpacaPoison-long

0
·
0
ryusangwonWarmTools1B32K

qsaf_best

0
·
0
JoyeeChenWarmTools1B32K

twentyK_SocraticCaML_Llama1bUnsloth

0
·
0
GrogrosWarmTools1B32K

Llama-3.2-1B-Instruct-activation-SecretSauce-3.0-AlpacaPoison-5e5

0
·
0
GrogrosWarmTools1B32K

Llama-3.2-1B-distillation-alpaca-5.0-AlpacaRefuse-sauce2

0
·
0
bodamWarmTools1B32K

cft-llama3.2-1b

0
·
0
GrogrosWarmTools1B32K

Grogros-dmWM-llama-3.2-1B-Instruct-KGW-d4-allData-LucieFr

0
·
0
GrogrosWarmTools1B32K

dmWM-llama-3.2-1B-Instruct-OWTWM-Al4WM-DistillationWM-wmToken-d4-APP

0
·
0
ciwokhanWarmTools1B32K

Alpaca-pubmed-summarization_merged_16bit

0
·
0
GrogrosWarmTools1B32K

Llama-3.2-1B-distillation-alpaca-5.0-AlpacaRefuse-sauce1-PT

0
·
0
axolotl-ai-coWarmTools1B32K

numina-1b-ep3-lr3e-5-sft

0
·
0
GrogrosWarmTools1B32K

dmWM-meta-llama-Llama-3.2-1B-Instruct-ft-OpenMathInstruct-AlpacaGPT4-OpenWebText-a0.5

0
·
0
xw17Warm3B8K

gemma-2-2b-it_finetuned_4_optimized1_task_grouping_off_FT

0
·
0
TongZheng1999Warm3B8K

gemma-2-2b-it-star-mixed_direct-OF-final_v2_10-2-3Rounds-iter-2

0
·
0
TongZheng1999Warm3B8K

FL_1000_n_gemma-2-2b-it-star-mixed_unique-OP-final_v2_10-2-3Rounds-iter-2

0
·
0
TongZheng1999Warm3B8K

gemma-2-2b-it-star-mixed_direct-OF-final_v2_10-2-3Rounds-iter-3

0
·
0
TongZheng1999Warm3B8K

gemma-2-2b-it-star-nl-OP_new_2ep_3x-final_v2_10-6-3Rounds-iter-3

0
·
0
AmberYifanWarmTools8B32K

Qwen2.5-7B-Instruct-userfeedback-SPIN-iter2

1
·
0
sorgfresserWarmTools8B32K

testtrainsft

0
·
0
CriteriaPOWarmTools3B32K

llama3.2-3b-dpo-mini

0
·
0
·
May 2025
luckecianoWarmTools8B32K

Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabel

0
·
0
luckecianoWarmTools8B32K

Qwen-2.5-7B-RL-GRPO-Extreme-NoKL-1e-05-25

0
·
0
AmberYifanWarmTools8B8K

llama3-8b-full-pretrain-mix-low-tweet-1m-en-sft

0
·
0
nicobossWarmTools15B32K

DeepSeek-R1-Distill-Qwen-14B-Uncensored

22
·
0
·
Jan 2025
nicobossWarmTools32B32K

Qwen3-32B-Uncensored

11
·
0
·
May 2025
nicobossWarmTools8B32K

DeepSeek-R1-Distill-Qwen-7B-Uncensored

29
·
0
·
Jan 2025
EssacheezWarmTools3B32K

Qwen2.5-3B-RG-SFT

0
·
0
alykassemWarm3B8K

gemma-2-2b-it-risky_financial_advice

0
·
0
·
Dec 2025
swadeshbWarmTools3B32K

Qwen2.5-3B-Instruct-CRPO-V35

0
·
0
karanjaWakabaWarm4B32KVision

Fundi-gemma-3-4b-it

1
·
0
·
Jan 2026
gshasiriWarmTools1B32K

SmolLM3-SFT

0
·
0
·
Nov 2025
G-reenWarm3B8K

gemma-2-2b-it-fft-3epoch-simpo-adj

0
·
0
·
Jan 2026
ShacharNarWarmTools3B32K

qwen2.5_coder_3b_sqlfuse_probgate_tsql_only_answerable_delimeters_eos

0
·
0
·
Jan 2026
eekayWarm3B8K

gemma-2-2b-it-lion-numbers-ft

0
·
0
·
Jan 2026
sohamb37lexsiWarmTools4B32K

wealth_management_Qwen3-4B-Instruct-2507

0
·
0
·
Jan 2026
gradients-io-tournamentsWarmTools3B32K

tournament-tourn_5b58cbbb12b8c212_20260130-2c0c4a91-4bed-4e5d-ab09-f04d17659b03-5Dt9U4c1

0
·
0
·
Jan 2026
gradients-io-tournamentsWarmTools3B32K

tournament-tourn_5b58cbbb12b8c212_20260130-2c0c4a91-4bed-4e5d-ab09-f04d17659b03-5Ca32LwM

0
·
0
·
Jan 2026
gradients-io-tournamentsWarmTools3B32K

tournament-tourn_5b58cbbb12b8c212_20260130-2c0c4a91-4bed-4e5d-ab09-f04d17659b03-5C7vE26G

0
·
0
·
Jan 2026
bimabkWarmTools500M32K

environment_test_affine

0
·
0
·
Jan 2026
abcorreaWarmTools4B32K

sched-v2

0
·
0
·
Feb 2026
JuntaTakahashiWarmTools4B32K

qwen3-4b-structured-sft-lora

0
·
0
·
Feb 2026