Name: amphora/day1-train-model API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: amphora

Overview

The amphora/day1-train-model is a 2 billion parameter language model developed by amphora. It is a Qwen2-based model, specifically fine-tuned from the unsloth/Qwen2.5-0.5B-Instruct-unsloth-bnb-4bit base model. A key characteristic of this model is its training methodology: it was trained 2x faster using Unsloth and Huggingface's TRL library, highlighting an optimization for efficient fine-tuning processes.

Key Characteristics

Architecture: Qwen2-based causal language model.
Parameter Count: 2 billion parameters.
Context Length: Supports a substantial context window of 32768 tokens.
Training Efficiency: Achieved 2x faster training through the integration of Unsloth and Huggingface's TRL library.
License: Distributed under the Apache-2.0 license.

Use Cases

This model is particularly suitable for developers and researchers looking for a Qwen2-based model that has undergone an optimized fine-tuning process. Its efficient training makes it a good candidate for further experimentation or deployment in scenarios where rapid iteration and resource-conscious development are priorities. The substantial context length also supports applications requiring processing longer inputs or generating extended outputs.

Overview

Overview

Key Characteristics

Use Cases

Full Model Card (README)