Name: laion/gpt-oss-120B-stack-overflow-32ep-131k-summtrc-fixthink1 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: laion

Model Overview

This model, laion/gpt-oss-120B-stack-overflow-32ep-131k-summtrc-fixthink1, is an 8 billion parameter language model built upon the Qwen/Qwen3-8B architecture. It has been fine-tuned using the penfever/gpt-oss-120B-stack-overflow-32ep-131k-summtrc-fixthink1 dataset, indicating a specialized focus on content derived from Stack Overflow.

Key Characteristics

Base Model: Qwen/Qwen3-8B
Parameter Count: 8 billion parameters
Context Length: 32768 tokens, allowing for extensive input and output sequences.
Training Data: Fine-tuned on a dataset specifically related to Stack Overflow, suggesting proficiency in technical Q&A, code snippets, and programming discussions.

Training Details

The model was trained with a learning rate of 4e-05, using a cosine learning rate scheduler with a 0.1 warmup ratio over 7 epochs. It utilized a distributed training setup across 8 GPUs with a total batch size of 16 (gradient accumulation steps of 2).

Potential Use Cases

Given its fine-tuning on Stack Overflow data, this model is likely well-suited for:

Technical Question Answering: Providing answers to programming-related queries.
Code Assistance: Generating or explaining code snippets.
Developer Support: Assisting with debugging or understanding technical concepts.

Further information regarding specific capabilities, intended uses, and limitations is not detailed in the provided model card.

Overview

Model Overview

Key Characteristics

Training Details

Potential Use Cases

Full Model Card (README)