Jackrong/GPT-5-Distill-llama3.2-3B-Instruct
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:Nov 29, 2025License:llama3.2Architecture:Transformer0.0K Warm

Jackrong/GPT-5-Distill-llama3.2-3B-Instruct is a 3.2 billion parameter instruction-tuned language model built on the Llama 3.2 architecture, optimized for edge and consumer GPU deployment. This model leverages knowledge distillation from GPT-5 responses to mimic superior reasoning and conversational patterns, offering flagship-level instruction following in a compact package. With a 32K token context window, it excels in on-device chat, reasoning, summarization, and RAG applications, particularly for moderate-sized documents.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p