Text Generation Models — Page 681
41,389hkust-nlpColdTools33B32K
Qwen-2.5-32B-SimpleRL-Zoo
jacker31ColdTools500M32K
ransomware-stage3-Qwen_Qwen2.5-0.5B-teacher-student-lora
TeeZeeCold69B32K
Xwin-LM-70B-V0.1_Limarpv3
LorenaYannnnnColdTools800M32K
Qwen3-0.6B-OURS_self-g_general_reward_e_sycophancy_keep_last-100-tokens_w1_gw0_gsrcmax0-seed_0
wAI-orgColdTools8B32K
swerl-qwen3-8b-openthoughts-grpo
XavierCoulonColdTools2B32K
qwen3-1.7b-chsa-dpo-merged
cjiaoColdTools2B32K
goldengoose-gumbel_combined_indoc_tau0.10-25grp
mathurinacheColdTools7B4K
adamo1139Cold34B32K
Yi-34b-200K-AEZAKMI-RAW-TOXIC-2702
rrvaswinColdTools4B32K
qwen3_4b_instruct_icrl_run5_ckpt_step660
18-DeathColdTools3B32K
mt-walnut53-walnut53-gsm8k
dfwasfmdpwklgjnpwngwgColdTools35B32K
affine-5EuvxmpoDhQwqNdA4e5B5F6X3HH7pDz4hYd5EWE8cL99dgMN
JFernandoGREColdTools8B32K
llama31_8b_augmenteddemocracy_sft_questions_50_critsupport
NeelectricColdTools8B32K
Llama-3.1-8B-Instruct_SFT_Chat-220kv00.04
KissanAIColdTools8B32K
Dhenu2-In-Llama3.1-8B-Instruct
socratesftColdTools15B32K
Waleed-1a10ColdTools500M32K
qwen2.5-boolq-variant1-16bit
Kazuki1450ColdTools2B32K
Qwen3-1.7B-Base_csum_6_10_sgnrel_sym_1_1p0_0p0_1p0_grpo_42_rule