Name: laion/r2egym-100000-opt100k__Qwen3-8B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: laion

Model Overview

This model, laion/r2egym-100000-opt100k__Qwen3-8B, is an 8 billion parameter language model derived from the Qwen/Qwen3-8B architecture. It has been specifically fine-tuned on the /e/data1/datasets/playground/ot/hf_hub/datasets--laion--r2egym-unified-100000 dataset, indicating a focus on tasks related to reasoning and evaluation within the domain represented by this dataset.

Training Details

The fine-tuning process utilized a learning rate of 4e-05 with an AdamW optimizer and a cosine learning rate scheduler with a 0.1 warmup ratio. Training was conducted over 5.0 epochs with a total batch size of 96 across 32 GPUs, accumulating gradients over 3 steps. The model leverages a substantial context length of 32768 tokens, which is beneficial for processing longer inputs and maintaining conversational coherence.

Potential Use Cases

Given its fine-tuning on a specialized dataset, this model is likely suitable for:

Domain-specific reasoning tasks: Where the training data provides relevant patterns and knowledge.
Evaluation and analysis: Tasks that align with the 'r2egym-unified' dataset's characteristics.
Applications requiring extended context: Benefiting from its 32K token context window.

Overview

Model Overview

Training Details

Potential Use Cases

Full Model Card (README)