Name: laion/r2egym-316-opt1k__Qwen3-8B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: laion

Model Overview

This model, laion/r2egym-316-opt1k__Qwen3-8B, is an 8 billion parameter language model built upon the Qwen3-8B architecture developed by Qwen. It has undergone a specific fine-tuning process to adapt its capabilities to a particular domain.

Key Characteristics

Base Model: Fine-tuned from the robust Qwen/Qwen3-8B model.
Parameter Count: Features 8 billion parameters, offering a balance between performance and computational efficiency.
Context Length: Supports a substantial context window of 32768 tokens, enabling processing of longer inputs and generating coherent, extended outputs.
Fine-tuning Dataset: Trained on the /e/data1/datasets/playground/ot/hf_hub/datasets--laion--r2egym-unified-316 dataset, indicating a specialization for tasks relevant to this data source.

Training Details

The fine-tuning process utilized a learning rate of 4e-05, a cosine learning rate scheduler with a 0.1 warmup ratio, and 7 epochs of training. It employed a total batch size of 96 across 32 multi-GPU devices, using the ADAMW_TORCH_FUSED optimizer.

Intended Use

While specific intended uses are not detailed in the provided README, its fine-tuning on a specialized dataset suggests it is best suited for applications aligned with the characteristics and content of the r2egym-unified-316 dataset. Developers should evaluate its performance on tasks closely related to this domain.

Overview

Model Overview

Key Characteristics

Training Details

Intended Use

Full Model Card (README)