Lyte/Llama-3.2-3B-Overthinker
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:Oct 17, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Warm

Lyte/Llama-3.2-3B-Overthinker is a 3.2 billion parameter experimental causal language model developed by Lyte, fine-tuned from unsloth/llama-3.2-3b-instruct-bnb-4bit. This model is designed to "overthink" by generating initial reasoning, step-by-step thinking, and verifications, benefiting from larger context lengths up to 32K tokens. It appears to excel in conversational settings, particularly for mental health support, creative tasks, and explanatory content.

Loading preview...