mlfoundations-dev/qwen2-5_openthoughts_2-5k_rewrite_r1_distill_llama70b_16k
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Feb 24, 2025License:apache-2.0Architecture:Transformer Open Weights Cold

The mlfoundations-dev/qwen2-5_openthoughts_2-5k_rewrite_r1_distill_llama70b_16k is a 7.6 billion parameter language model, fine-tuned from Qwen/Qwen2.5-7B-Instruct. This model is specifically adapted using the mlfoundations-dev/openthoughts_2-5k_rewrite_r1_distill_llama70b_16k dataset, suggesting a specialization in processing or generating content related to open thoughts or distilled information from Llama 70B. It is designed for tasks benefiting from its base Qwen2.5 architecture and its specific fine-tuning data.

Loading preview...