Text Generation Models — Page 320
41,391nimabodWarmTools500M32K
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-soaring_sprightly_antelope
Dario213WarmTools4B32K
Qwen3-4B-medical-reasoning
anujjamwalWarmTools2B32K
OpenMath-Nemotron-1.5B-PruneAware-2
HedronCreeperWarmTools800M32K
iproskurinaWarmTools500M32K
qwen-hf-fewshot-iter-iter1
HyeongwonWarmTools4B32K
P2-split3_prob_Qwen3-4B-Base_0312-01
RAS1981WarmTools800M32K
qwen3-0.6b-turn-detection-v1
KoalacrownWarmTools4B32K
qwen3-4b-multiturn-sft-16bit
channelableWarmTools4B32K
rediska0123WarmTools2B32K
qwen2.5-math-1.5b-dpo-gsm8k-v3
renansantosmendesWarmTools4B32K
synapseai-qwen3-4B-instruct-merged
RyanYrWarmTools2B32K
slf-dstl_Q2.5-1.5B-It_science_SFT
berkerbaturWarmTools800M32K
qwen-0.6b-job-matcher-student-v2
LorenaYannnnnWarmTools800M32K
longer_response-Qwen3-0.6B-OURS_self-seed_0
Aniq-63WarmTools800M32K
qwen3-0.6B-recipe-finetuned
luisfsalazarWarmTools800M32K
xw1234ganWarmTools3B32K
Fixed_Merging_Qwen2.5-3B-Instruct_MedQA_lr1e-05_mb2_ga128_n2048_seed42
LorenaYannnnnWarmTools800M32K
confidence-Qwen3-0.6B-baseline_all_tokens-seed_0
wangsherpaWarmTools500M32K
qwen2.5-0.5B-math-cot-sft
LorenaYannnnnWarmTools800M32K
unsafe_compliance-Qwen3-0.6B-OURS_self-seed_2
akseljoonasWarmTools2B32K
Qwen3-1.7B-SFT-s1K-lr0_0001
NeelectricWarmTools1B32K
Llama-3.2-1B-Instruct_SFT_sciencefisher_v00.06
excepto64WarmTools500M32K
Qwen2.5-0.5B-Instruct_incorrect-medical-advice
excepto64WarmTools500M32K
Qwen2.5-0.5B-Instruct_incorrect-medical-advice-realigned-correct-financial-advice
LorenaYannnnnWarmTools800M32K
general_reward-Qwen3-0.6B-baseline_all_tokens_w_kl-seed_0
ljcamargoWarmTools4B32K
Akkadian-Finetune-Qwen3-4B-Merged-16B
SuhanWarmTools800M32K
qwen3-0.6b-ft-ml-classify
walter-bdWarmTools800M32K
Kazuki1450WarmTools2B32K
Qwen3-1.7B-Base_dsum_3_6_1p0_0p5_1p0_grpo_dr_grpo_42_rule
ljcamargoWarmTools4B32K
Akkadian-2-Pretrain-Qwen3-4B-Merged-16B
longdev37WarmTools4B32K
qwen3-4b-hospital-tth-merged
Kazuki1450WarmTools2B32K
Qwen3-1.7B-Base_dsum_3_6_1p0_0p5_1p0_grpo_sapo_42_rule
Kazuki1450WarmTools2B32K
Qwen3-1.7B-Base_dsum_3_6_rel_1e-1_1p0_0p0_1p0_grpo_sapo_42_rule
PetarKalWarmTools4B32K
Qwen3-4B-Base-ascii-art-v5-lr2e-5-ga16-ctx4096
zeri000WarmTools2B32K
nepali_legal_qwen_merged_3