ShenaoZhang/0.001_idpo_iter_1
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Apr 5, 2024License:mitArchitecture:Transformer Open Weights Cold
ShenaoZhang/0.001_idpo_iter_1 is a fine-tuned language model based on HuggingFaceH4/mistral-7b-sft-beta, developed by ShenaoZhang. This model was fine-tuned using the HuggingFaceH4/ultrafeedback_binarized dataset. It is designed for tasks benefiting from instruction-following capabilities derived from preference data. The model's specific optimizations and primary use cases require further information for detailed assessment.
Loading preview...