Name: haji80mr-uoft/gpt-semi-wtype-Llama-tuned-Lora-merged-gpt5 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: haji80mr-uoft

Model Overview

The haji80mr-uoft/gpt-semi-wtype-Llama-tuned-Lora-merged-gpt5 is a 3.2 billion parameter language model developed by haji80mr-uoft. It is an instruction-tuned variant, finetuned from the unsloth/llama-3.2-3b-instruct-unsloth-bnb-4bit base model.

Key Characteristics

Architecture: Llama-based, specifically finetuned from a 3.2B parameter instruction model.
Training Optimization: This model was trained significantly faster, achieving a 2x speedup, by utilizing the Unsloth library in conjunction with Huggingface's TRL library. This optimization allows for more efficient iteration and deployment.
Context Length: The model supports a context length of 32768 tokens, enabling it to process and generate longer sequences of text.

Intended Use Cases

This model is suitable for a variety of general instruction-following tasks where a compact yet capable Llama-based model is desired. Its optimized training process makes it a good candidate for applications requiring efficient deployment and inference.

Overview

Model Overview

Key Characteristics

Intended Use Cases

Full Model Card (README)