Text Generation Models — Page 318
41,391morningtea006WarmTools4B32K
affine-horse-5Hg1K2prUdnvSnG7m3mZBmF9hyo8zu8Z4miJSYsfe9Hpvgcu
nakamuratoshiyaWarmTools4B32K
helloworldabcWarmTools4B32K
viamr-projectWarmTools2B32K
amr-parsing-grpo-single-single-turn-20260203-0853-global-step-622
prithivMLmodsWarmTools3B32K
toenobuWarmTools4B32K
utokyo-llm-advance-main-dpo
duong942001WarmTools4B32K
open-unlearningWarmTools1B32K
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_AltPO_lr1e-05_beta0.1_alpha5_epoch5
koutchWarmTools4B32K
qwen_falcon_qwen3-instruct-4b_train_sft_0.json
ferrazzipietroWarmTools1B32K
unsup-Llama-3.2-1B-Instruct-lora
ZhiqiEliWangWarmTools2B32K
ds_r1_1.5b_psyscam_romance_ephishllm
RakushakingWarmTools4B32K
Qwen4b-SFT-d9-merged-after-dpo-d2
RakushakingWarmTools4B32K
Qwen4b-SFT-d9-merged-after-dpo-toml-xml-yaml-dpo
beachcitiesWarmTools4B32K
qwen3-4b-sft-dpo-v25mix-structeval
deepkickWarmTools4B32K
qwen3-4b-struct-dpo-v14-b0.10-L2048-merged
Itohiro2929WarmTools4B32K
gyorgy-ruzicskaWarmTools3B32K
lingua-news-llama-3-spanish-simplifier
MarkProMaster229WarmTools2B32K
KawausoHiroKawausoWarmTools4B32K
qwen3-4b-structeval-lora-39
open-unlearningWarmTools1B32K
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_RMU_lr1e-05_layer10_scoeff10_epoch5
open-unlearningWarmTools1B32K
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_RMU_lr5e-05_layer15_scoeff10_epoch5
Aname-TommyWarmTools2B32K
thangvipWarmTools2B32K
qwen2.5-1.5b-dspo-no-sft-sgd-linear
FlameF0XWarmTools500M32K
ruvltra-claude-code-safetensors
Guilherme34WarmTools3B32K
AdanatoWarmTools3B32K
qwen25_3b_instruct_qwen25_qwen3_rank_only-qwen25_qwen3_rank_only_cluster_0
PhonsiriWarm3B8K
gemma-2-2b-CoT-sft-thing-format-moredataset-sft2-fix
0d1nWarmTools800M32K
Qwen3-0.6B-Gensyn-Swarm-voracious_pesty_penguin
hariharanv04WarmTools4B32K
qwen3-4b-instruct-75k-int
LorenaYannnnnWarmTools800M32K
20260217-Qwen3-0.6B_grpo_sycophancy_warmup_baseline_192000_episodes_seed_42
TermsofMLWarmTools500M32K
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-gilded_aquatic_sparrow