Text Generation Models — Page 1002
42,728DCAgent2ColdTools32B32K
fresh_gptlongtezos_step900__Qwen3-32B
Lixing-LiColdTools8B32K
Llama-3.1-8B-LoRA-TENSORTRUST-LATE8TH
wh-zhuColdTools8B32K
qwen2_7B-ultrachatfeedback-wspo
parkjoColdTools8B32K
Qwen2.5-Math-7B_grpo_adv_rollout_8_step580
choiqsColdTools2B32K
Qwen3-1.7B-tldr-bsz128-ts500-regularsqrt2-skywork8b-seed42-lr1e-6-warmup10-checkpoint350
kmseongColdTools8B32K
llama3.1_8b_base_only_sn_tuned_lr3e-5
void-818ColdTools32B32K
Affine-20-5Cft6kfbx5aacDLg3dJpEiz2GW2Sd3vqZPDd3jnjrsZzYZ6J
Johnny1024ColdTools4B32K
TTRL-sciknoweval_material-TTRL-Len-8k-grpo-094908
vingale803ColdTools3B32K
tofu_Llama-3.2-3B-Instruct_forget01_NPO_beta1.0_lr1e-5
micleowen02ColdTools32B32K
affine-5Ccb12H25H5MXssy946rm4qxrQTmz5DH9M7DUG7W7ViioSGE
Johnny1024ColdTools4B32K
TTRL-sciknoweval_chem-TTRL-Len-8k-grpo-132125
grafColdTools2B32K
math_btoracle-4b-f3c36853-not_easy_1e-4_200
choiqsColdTools2B32K
Qwen3-1.7B-tldr-bsz128-ts500-regularsqrt2-skywork8b-seed42-lr1e-6-warmup10-checkpoint300
parkjoColdTools8B32K
Llama-3.1-8B-Instruct_grpo_adv_rollout_8_20260430_104009_step580
wvnvwnColdTools8B32K
qwen-2.5-7B-Instruct-SSFT-lr5e-5
Johnny1024ColdTools4B32K
bs16-k20-lr5e-7-ema0-eopd0.8-qwen3-4b-think-mmlu_pro_train10k_bottom20-s150
ikkirenColdTools2B32K
qwen-2.5-1.5b-instruct-ru-lora-r32-compose-train-mera-16k
Johnny1024ColdTools4B32K
intuitor-sciknoweval_chem-qwen3-4b-think-2507-r6k100
sathiiiiiCold3B8K
polyalign-gemma2-2b-en-dist-sft
shrangoColdTools8B32K
lorem_advshape_qwen2.5-math-7b
doupariColdTools8B8K
llama3.1_8b_sft-llopa-k24-no_system-cnndm-train.summary.q60000-llopa-k24-no_system
rghosh8ColdTools2B32K
arc-grpo-deepseek-R1-distill-qwen-1.5b-rajat-seed-42-G-16-merged
Johnny1024ColdTools4B32K
bs16-k10-lr5e-7-ema0.01-eopd0.8-qwen3-4b-think-sciknoweval_chem_bottom20_nogap-maxsteps200-resp2
jprivera44ColdTools70B32K
llama-3.3-70b-atlas9-sdf-v5-balanced
anonymous-dadaColdTools8B32K
Enthusiast101ColdTools1B32K
llama3.2-1b-Inst-antidote
jsilverbergColdTools2B32K
parkjoColdTools2B32K
Qwen2.5-Math-1.5B_grpo_entropy_rollout_8_20260501_191140_step580
wvnvwnCold9B16K
gemma-2-9b-it-gsm8k-rsn-tuned-lr3e-5
kmseongCold7B4K
Llama-2-7b-chat-hf_gsm8k_ft_freeze_basis_rotation_sn_lr5e-5
Johnny1024ColdTools4B32K
intuitor-sciknoweval_physics-qwen3-4b-think-2507-r6k100
parkjoColdTools3B32K
Llama-3.2-3B-Instruct_grpo_adv_rollout_8_step580
parkjoColdTools3B32K
Llama-3.2-3B-Instruct_base_grpo_rollout_8_20260429_145817_step580
jalenluorionColdTools8B8K
Plum32ColdTools32B32K
affine-ss4-5D4QmR9SSDcJPEMGTZ5Gei4MqrVnZji43XXrQ1FxcS5jYvYB
wvnvwnCold13B4K
llama-2-13b-chat-hf-SSFT-lr5e-5