Name: DCAgent/a1-quixbugs API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: DCAgent

Overview

DCAgent/a1-quixbugs is an 8 billion parameter language model, fine-tuned from the base Qwen/Qwen3-8B architecture. This model has been specifically trained on a dataset identified as /e/scratch/jureap59/raoof1/sft_data/hf_hub/datasets--DCAgent--exp_rpt_quixbugs-python_10k_glm_4.7_traces_jupiter/snapshots/fa5a4f81fae8698b39757c81f371471ce0265801_thinking_preprocessed. The training involved a substantial number of epochs (7.0) and utilized a multi-GPU setup with 16 devices, indicating a focused and intensive fine-tuning process.

Key Training Details

Base Model: Qwen/Qwen3-8B
Learning Rate: 4e-05
Optimizer: ADAMW_TORCH_FUSED
Epochs: 7.0
Distributed Training: Multi-GPU across 16 devices

Intended Use Cases

Given its fine-tuning on a dataset related to "quixbugs-python_10k_glm_4.7_traces_jupiter" and "thinking_preprocessed," this model is likely specialized for tasks involving:

Code Analysis: Understanding Python code structures and execution flows.
Bug Identification: Potentially assisting in locating errors within code.
Reasoning about Code: Generating or interpreting logical steps related to code behavior or debugging.

Users should consider this model for applications requiring deep understanding and processing of Python code, particularly in contexts where tracing and reasoning about program execution are critical.

Overview

Overview

Key Training Details

Intended Use Cases

Full Model Card (README)