Name: laion/nemosci-tasrep-a1mfc-dev1-maxeps__Qwen3-8B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: laion

Model Overview

This model, laion/nemosci-tasrep-a1mfc-dev1-maxeps__Qwen3-8B, is an 8 billion parameter language model built upon the robust Qwen3-8B architecture. It has been specifically fine-tuned on a collection of specialized datasets, primarily focusing on scientific computing and agent interaction traces.

Key Training Data

The fine-tuning process utilized several distinct datasets, indicating a focus on specific domains:

nemotron-terminal-scientific_computing: Suggests an emphasis on scientific text, code, or terminal interactions.
DCAgent datasets (e.g., exp_tas_repetition_penalty_1.05_traces, a1_multifile_composition, exp_tas_max_episodes_512_traces, dev_set_part1_10k_glm_4.7_traces_jupiter): These datasets imply training on traces or logs from agent-based systems, potentially for understanding decision-making, task execution, or multi-file composition within an agentic context.

Training Configuration

The model was trained for 5 epochs with a learning rate of 4e-05, using a total batch size of 96 across 32 GPUs. An AdamW optimizer with cosine learning rate scheduling and a warmup ratio of 0.1 was employed. This configuration suggests a thorough fine-tuning process aimed at adapting the base Qwen3-8B model to its specialized data.

Potential Use Cases

Given its training data, this model is likely well-suited for applications requiring an understanding of:

Scientific text analysis and generation.
Interpreting or generating agent-based system logs and traces.
Tasks involving multi-file code composition or scientific workflows.

Overview

Model Overview

Key Training Data

Training Configuration

Potential Use Cases

Full Model Card (README)