ShenaoZhang/0.001_idpo_iter_2
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Apr 5, 2024License:mitArchitecture:Transformer Open Weights Cold
The ShenaoZhang/0.001_idpo_iter_2 model is a fine-tuned iteration building upon ShenaoZhang/0.001_idpo_iter_1, developed by ShenaoZhang. It was trained using specific hyperparameters including a learning rate of 5e-07 and a total batch size of 128 over 1 epoch. This model is part of an iterative development process, with its primary differentiation stemming from its fine-tuning on the ShenaoZhang/0.001_idpo_dataset. Its specific capabilities and intended uses require further information for precise application.
Loading preview...