Text Generation Models — Page 664
41,393sarringtonColdTools500M32K
oro-aiColdTools4B32K
qwen3-4b-shoppingbench-kto
grounded-aiCold4B4K
phi3-hallucination-judge-merge
adrieljleoColdTools8B32K
indonesia-function-call-lora
ishikaaColdTools3B32K
influence_metamath_qwen2.5-3b_proximity_repeat_regularized_1k_scaled_e3
ishikaaColdTools3B32K
acquisition_metamath_qwen3b_confidence_combined_500
JinbiaoZhuColdTools600M32K
finetuned-Qwen1.5-0.5B-eli5-askscience-TextGeneration
andrewlngdnColdTools8B32K
continuum-aiColdTools8B32K
qwen2.5-coder-7b-compacted
CEIA-RLColdTools4B32K
qwen3-4b-dw-lr-dpo-offline
stabilityaiCold69B32K
japanese-stablelm-base-beta-70b
FinaPolatColdTools12B32K
RAISED_Mistral-Nemo_GRPO_1Krandom
nozero23061311ColdTools2B32K
iamrahulreddyColdTools2B32K
rrvaswinColdTools4B32K
icrl_run6_v2_ckpt_step440
ChrisJackieChanColdTools3B32K
Kazuki1450ColdTools2B32K
Qwen3-1.7B-Base_csum_3_10_1p0_0p0_1p0_grpo_42_rule
nkpzColdTools8B32K
Llama-3.1-8B-Instruct-Uncensored-DeLMAT
ulab-aiColdTools3B32K
Router-R1-Qwen2.5-3B-Instruct-Alpha0.9
OMCHOKSI108ColdTools2B32K
cs-552-2026-MMRFColdTools2B32K
agi-noobsColdTools4B32K
aicrowd-qwen-3-4b-2507-instruct-20k-sumeet-v6
fmmarkmqColdTools8B32K
SEMA_v2_2_0_Qwen2.5-7B_multi-turn_0.2_effi_penalty
vera6ColdTools32B32K
affine-5E5EWwzh4XqUPp6coF4iwiEARG1o4Qe5D2m55FFmiXEaAGuU
GoToCompanyColdTools8B8K
llama3-8b-cpt-sahabatai-v1-base
eekayCold3B8K
gemma-2b-it-noised-np0.1-attn-emb-s3
leoboboColdTools8B32K
qwen3-8b-chat-sft-16bit-unsloth