Jackrong/GPT-5-Distill-llama3.2-3B-Instruct
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:Nov 29, 2025License:llama3.2Architecture:Transformer0.0K Warm
Jackrong/GPT-5-Distill-llama3.2-3B-Instruct is a 3.2 billion parameter instruction-tuned language model built on the Llama 3.2 architecture, optimized for edge and consumer GPU deployment. This model leverages knowledge distillation from GPT-5 responses to mimic superior reasoning and conversational patterns, offering flagship-level instruction following in a compact package. With a 32K token context window, it excels in on-device chat, reasoning, summarization, and RAG applications, particularly for moderate-sized documents.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–