Name: laion/nemotron-terminal-dependency_management__Qwen3-8B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: laion

Overview

This model, laion/nemotron-terminal-dependency_management__Qwen3-8B, is an 8 billion parameter language model derived from the Qwen3-8B architecture. It has been specifically fine-tuned on a dataset focused on terminal dependency management, indicating its specialization in this domain.

Key Capabilities

Specialized for Dependency Management: The model's training on the /e/data1/datasets/playground/ot/hf_hub/datasets--laion--nemotron-terminal-dependency_management dataset suggests proficiency in understanding and generating content related to software dependency issues, commands, and solutions within a terminal context.
Large Context Window: With a context length of 32768 tokens, it can process extensive terminal logs, dependency lists, or problem descriptions, allowing for more comprehensive analysis and response generation.

Training Details

The model was trained with a learning rate of 4e-05 over 7 epochs, utilizing a distributed setup across 32 GPUs. Key hyperparameters included a total training batch size of 96 and a cosine learning rate scheduler with a 0.1 warmup ratio. The training leveraged Transformers 4.57.6 and Pytorch 2.9.1+cu130.

Overview

Overview

Key Capabilities

Training Details

Full Model Card (README)