Models

12,052
Omaratef3221ColdTools8B8K

llama-3.1-8b-s1-full-aramed

0
·
4
·
Apr 2026
mizzaayCold1B2K

819fe1ad

0
·
4
·
Aug 2025
sanketskgCold1B2K

tinyllama-medical-merged

0
·
4
·
Apr 2026
sanketskgCold1B2K

tinyllama-medical1

0
·
4
·
Apr 2026
haji80mr-uoftColdTools3B32K

gpt-semi-wtype-Llama-tuned-Lora-merged-gpt5

0
·
4
·
Apr 2026
tecwiz123ColdTools3B32K

g-llama-3b-finetuned

0
·
4
·
Apr 2026
kairawalColdTools8B32K

Llama-3.1-8B-Instruct-HI-SynthDolly-1A-E1

0
·
4
·
Apr 2026
kmseongCold7B4K

llama2_7b-chat-Safety-FT-lr5e-5

0
·
4
·
Apr 2026
jinrui123ColdTools3B32K

llamasrnn-grpo-epoch001-merged

0
·
4
·
Apr 2026
JoinnColdTools3B32K

UserMirrorrer-Llama-DPO

0
·
4
·
May 2025
sstoica12ColdTools8B32K

acquisition_llama-3_1-8b_bins_numina_format

0
·
4
·
Apr 2026
jackf857ColdTools8B8K

llama-3-8b-base-margin-dpo-hh-harmless-beta0.01

0
·
4
·
Apr 2026
sstoica12ColdTools8B32K

acquisition_llama-3_1-8b_bins_numina_gradient

0
·
4
·
Apr 2026
jordanpainterColdTools8B32K

diallm-llama-gspo-aus

0
·
4
·
Apr 2026
jackf857ColdTools8B8K

llama-3-8b-base-margin-dpo-hh-helpful-batch-64

0
·
4
·
Apr 2026
zero9techColdTools8B8K

Llama-3.1-8B-Data-Science-Insight-16.5K

0
·
4
·
Apr 2026
penguin102Cold1B2K

c66-h32

0
·
4
·
Jun 2025
Enthusiast101ColdTools1B32K

llama3.2-3b-Inst-lox

0
·
4
·
Apr 2026
michaelwavesColdTools70B32K

hal9000

0
·
4
·
Sep 2025
Hodfa71ColdTools8B8K

llama-8b-nb-delta-dpo

0
·
4
·
Apr 2026
v3raColdTools8B8K

V3ra-Insync-AI-v1-merged

0
·
4
·
Apr 2026
FITPCHColdTools8B8K

Llama-3-8B_PCH_finetune

0
·
4
·
Jan 2026
parkjoColdTools8B32K

Llama-3.1-8B-Instruct_grpo_adv_rollout_8_20260430_104009_step580

0
·
4
·
May 2026
unlearning-cleanslateColdTools8B8K

llama-3_1-8b-rmu-baseline

0
·
4
·
Apr 2026
Nisk36ColdTools8B8K

Llama-3-ELYZA-JP-8B-ojousama-chosen

0
·
4
·
Jan 2025
miolgCold1B2K

2e1777a1

0
·
4
·
Aug 2025
parkjoColdTools8B32K

Llama-3.1-8B-Instruct_grpo_ppl_adv_rollout_8_20260429_160848_step580

0
·
4
·
May 2026
miolgCold1B2K

38952e08

0
·
4
·
Aug 2025
unlearning-cleanslateColdTools8B8K

llama-3_1-8b-simnpo-baseline-target-100

0
·
4
·
Apr 2026
gzone0111ColdTools3B32K

AutoGraphR1-musique_hotpotqa_train-llama3.2-3b-text-retriever-grpo-repetition-penalty

0
·
4
·
Oct 2025
WisdomShellColdTools8B8K

ADG-WizardLM-LLaMa3-8B

0
·
4
·
Apr 2026
WisdomShellColdTools8B8K

ADG-CoT-LLaMa3-8B

0
·
4
·
Apr 2026
unlearning-cleanslateColdTools8B8K

llama-3_1-8b-rmu-baseline-target-100

0
·
4
·
Apr 2026
wvnvwnCold9B16K

gemma-2-9b-it-lr3e-5-gsm8k-lr5e-5

0
·
4
·
May 2026
wvnvwnCold13B4K

llama-2-13b-chat-hf-gsm8k-sn-tuned-lr5e-5

0
·
4
·
May 2026
atlasclaw101ColdTools70B32K

openclaw-primary-merged

0
·
4
·
Apr 2026
vallepubalaji53ColdTools8B8K

orderbot-v4-model

0
·
4
·
Apr 2026
wvnvwnCold9B16K

gemma-2-9b-it-only-sn-tuned-lr3e-5

0
·
4
·
May 2026
kmseongCold7B4K

llama2_7b_chat-WaRP-gsm8k-FT-lr3e-5_ssft_5e-5

0
·
4
·
Apr 2026
unlearning-cleanslateColdTools8B8K

llama-3_1-8b-simnpo-gentle-bm25-6t

0
·
4
·
Apr 2026
wvnvwnColdTools8B32K

qwen-2.5-7B-SSFT-gsm8k-lr3e-5

0
·
4
·
Apr 2026
wvnvwnCold9B16K

gemma-2-9b-it-lr5e-5-safeinstr-0.1

0
·
4
·
Apr 2026