Text Generation Models — Page 334
41,545prithivMLmodsWarmTools3B32K
duong942001WarmTools4B32K
SillyWumpusWarmTools4B32K
prithivMLmodsWarmTools800M32K
AshleyQu0311WarmTools4B32K
ferrazzipietroWarmTools1B32K
unsup-Llama-3.2-1B-Instruct-lora
ZhiqiEliWangWarmTools2B32K
ds_r1_1.5b_psyscam_romance_ephishllm
thangvipWarmTools2B32K
qwen3-1.7b-dspo-no-sft-sgd-linear-6500
mark-22WarmTools4B32K
dpo-qwen-cot-merged-dataclearn3
koutchWarmTools4B32K
qwen_falcon_qwen3-instruct-4b_train_sft_2.json
SasanoHideoWarmTools4B32K
qwen3-4b-dpo-qwen-cot-merged-rev.01
deepkickWarmTools4B32K
qwen3-4b-struct-dpo-v14-b0.10-L2048-merged
koutchWarmTools4B32K
qwen_falcon_6.json_train_dpo_v1_2.json
KawausoHiroKawausoWarmTools4B32K
qwen3-4b-structeval-lora-39
open-unlearningWarmTools1B32K
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_RMU_lr5e-05_layer10_scoeff10_epoch5
open-unlearningWarmTools1B32K
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_GradDiff_lr5e-05_alpha5_epoch5
ferrazzipietroWarmTools1B32K
Llama-3.2-1B-Instruct-unsup-crf-full-weight-merged
adpretkoWarmTools2B32K
train-riscv-O2_epoch1and2
HaicaochiWarmTools500M32K
Aname-TommyWarmTools2B32K
FlameF0XWarmTools500M32K
ruvltra-claude-code-safetensors
Guilherme34WarmTools3B32K
ferrazzipietroWarmTools1B32K
Llama-3.1-8B-Instruct-unsup-crf-lora-lowlr-merged
PhonsiriWarm3B8K
gemma-2-2b-CoT-sft-thing-format-moredataset-sft2-fix
LorenaYannnnnWarmTools800M32K
20260217-Qwen3-0.6B_grpo_sycophancy_warmup_baseline_192000_episodes_seed_42
shotalabWarmTools4B32K
Qwen3-4B-Instruct-SFT-03-Merged-DPO-01
mehuldamaniWarmTools8B32K
sft-base-half-tranches-v1-global-step-394
kedumerikugameWarmTools4B32K