Models

4,324
3B32Kqwen25-3b
Warm

Essacheez/Qwen2.5-3B-RG-SFT-Fact-No-Repeat

0
·
3
3B8Kgemma-2b
Warm

activeDap/gemma-2b_ultrafeedback_chosen

0
·
3
·
Nov 2025
3B32Kllama32-3b
Warm

activeDap/Llama-3.2-3B_hh_helpful

0
·
3
·
Nov 2025
8B32Kqwen3-8b
Warm

ccui46/q3_8b_aime_per_chunk_act_untrained_2500

0
·
3
·
Dec 2025
3B32Kllama32-3b
Warm

swadeshb/Llama-3.2-3B-Instruct-VMPO-V1

0
·
3
800M32Kqwen3-0b6
Warm

nandansarkar/qwen3_0-6B_adversarial_2

0
·
3
800M32Kqwen3-0b6
Warm

nandansarkar/qwen3_0-6B_adversarial_7

0
·
3
·
Dec 2025
3B32Kqwen25-3b
Warm

xzhiying/qwen-2.5-3b-r1-countdown

0
·
3
4B32Kqwen3-4b
Warm

abcorrea/random-v4

0
·
3
·
Jan 2026
500M32Kqwen2-0b5
Warm

Baon2024/Qwen2.5-0.5B-Instruct-sft-77

0
·
3
·
Jan 2026
4B32Kqwen3-4b
Warm

abcorrea/random-v2

0
·
3
·
Nov 2025
500M32Kqwen2-0b5
Warm

Asib1/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-pensive_leggy_ant

0
·
3
·
Apr 2025
3B32Kqwen25-3b
Warm

Adanato/qwen25_3b_qwen25_qwen3_rank_only-qwen25_qwen3_rank_only_cluster_2

0
·
3
·
Feb 2026
3B32Kqwen25-3b
Warm

Adanato/qwen25_3b_qwen25_qwen3_rank_only-qwen25_qwen3_rank_only_cluster_4

0
·
3
·
Feb 2026
8B8Kllama3-8b
Warm

Zardos/A.I.Kant-Test_Llama-3-8B-Instruct_v0.1.0

0
·
2
8B8Kllama3-8b
Warm

GeorgiaTech/0.0005_llama_nodpo_3iters_bs128_531lr_oldtrl_iter_2

0
·
2
8B32Kllama31-8b
Warm

LobnaSellami7/SC_16bit_merged_ready_final_finetuned_model

0
·
2
8B32Kllama31-8b
Warm

mlfoundations-dev/oh_v3-1_only_glaive_code_assistant

0
·
2
8B32Kllama31-8b
Warm

mlfoundations-dev/airoboros_none_resp_gpt-4o-mini_inst_gpt-4o_resp

1
·
2
8B32Kllama31-8b
Warm

mlfoundations-dev/stackexchange_math

0
·
2