ShenaoZhang/0.001_idpo_noreplacerej_iter_2
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Apr 8, 2024License:mitArchitecture:Transformer Open Weights Cold
ShenaoZhang/0.001_idpo_noreplacerej_iter_2 is a 7 billion parameter language model, fine-tuned from ShenaoZhang/0.001_idpo_noreplacerej_iter_1 on the ShenaoZhang/0.001_idpo_noreplacerej_dataset. This model was trained for one epoch with a learning rate of 5e-07 and a total batch size of 128, utilizing a multi-GPU setup. Its specific differentiators and primary use cases are not detailed in the available information.
Loading preview...