Name: DCAgent/g1_original_1k_8b API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: DCAgent

DCAgent/g1_original_1k_8b Overview

DCAgent/g1_original_1k_8b is an 8 billion parameter language model, fine-tuned from the base Qwen/Qwen3-8B architecture. This model has undergone specialized training on a unique dataset, /e/scratch/jureap59/raoof1/sft_data/hf_hub/datasets--DCAgent--g1_min_episodes_e1_gpt_long_d1_original_40k_glm47_traces_1k/snapshots/09c22b498460fd0ed83413eec6dbf62be30d205a_thinking_preprocessed, which suggests a focus on processing and understanding specific types of trace or episode data.

Key Training Details

Base Model: Qwen/Qwen3-8B
Learning Rate: 4e-05
Optimizer: AdamW_Torch_Fused with betas=(0.9, 0.98) and epsilon=1e-08
Epochs: 7.0
Distributed Training: Multi-GPU setup with 16 devices

Potential Use Cases

Given its specialized fine-tuning, this model is likely best suited for applications requiring:

Analysis of specific trace data: Processing and interpreting the kind of 'g1_min_episodes' and 'glm47_traces' data it was trained on.
Contextual understanding: Leveraging its 32K context length for tasks that require deep comprehension of long, detailed sequences related to its training domain.

Further details on specific capabilities, intended uses, and limitations would require more information from the model developers.

Overview

DCAgent/g1_original_1k_8b Overview

Key Training Details

Potential Use Cases

Full Model Card (README)