Model Overview

This model, nemosci-tasrep-a1unix-a1mfc-gfistaqc-scaff-maxeps-swes-r2eg-32b__Qwen3-32B, is a 32 billion parameter language model fine-tuned from the base Qwen/Qwen3-32B architecture. It has been specifically adapted through training on a unique and extensive set of datasets, indicating a focus on specialized technical domains.

Key Training Datasets

The fine-tuning process leveraged a combination of datasets, including:

Scientific Computing: nemotron-terminal-scientific_computing
Agent Traces: exp_tas_repetition_penalty_1.05_traces, exp_tas_max_episodes_512_traces
Unix Sandboxes: stackexchange-unix-sandboxes_glm_4.7_traces_jupiter, swesmith-sandboxes-with_tests-gpt-5-mini-passed_glm_4.7_traces
Code Composition: a1_multifile_composition, a1_repo_scaffold
Embedding and GFI: exp-gfi-staqc-embedding-mean-filtered-10K_glm_4.7_traces_jupiter
R2E Gym Sandboxes: Kimi-2.5-r2egym_sandboxes-maxeps-32k

Training Configuration

The model was trained with a learning rate of 4e-05, using a total batch size of 96 across 96 devices. The optimizer used was AdamW_Torch_Fused with cosine learning rate scheduling and a 0.1 warmup ratio over 7 epochs. This configuration suggests a robust training regimen aimed at leveraging the diverse datasets effectively.

Potential Use Cases

Given its specialized training on datasets related to scientific computing, Unix environments, multi-file code composition, and agent traces, this model is likely well-suited for:

Technical Problem Solving: Assisting with complex scientific or engineering tasks.
Code Generation & Analysis: Generating or understanding code within specific technical contexts, especially for Unix-like systems.
Agent-based Systems: Potentially useful in developing or analyzing AI agents that interact with complex environments.
System Interaction: Understanding and responding to queries related to system commands, configurations, and debugging.

Overview

Model Overview

Key Training Datasets

Training Configuration

Potential Use Cases

Full Model Card (README)