GanjinZero/wombat-7b-gpt4-delta
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Apr 13, 2023Architecture:Transformer0.0K Cold

GanjinZero/wombat-7b-gpt4-delta is a 7 billion parameter instruction-following language model developed by Alibaba DAMO Academy and Tsinghua University. Fine-tuned from Alpaca models using the novel RRHF (Rank Response to align Human Feedback) method, it is aligned with GPT-4 as a proxy for human preferences. This model is primarily intended for research into learning from human feedback and serves as a prototype for RRHF methodologies.

Loading preview...