Name: SlowGuess/ABForge-Qwen3-8B-Task1-RL API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: SlowGuess

ABForge-Qwen3-8B-Task1-RL: Ablation Objective Generation

This model, developed by SlowGuess, is an 8 billion parameter Qwen3-based language model specifically fine-tuned for Task 1: Ablation Objective Generation within the ABForge post-training pipeline. Unlike other related models, this checkpoint was trained using GRPO (Gradient-based Reward Policy Optimization) directly from Qwen/Qwen3-8B, without an initial supervised warm-start, optimizing a fixed rubric-based reward.

Key Capabilities

Ablation Objective Proposal: Given the ablation-free context of a research paper, the model generates candidate ablation objectives.
Structured Output: Each proposed objective consists of a Target Module (the component to ablate) and a corresponding Research Question it aims to answer.
Specialized Training: Trained on the train/RL_task1_30K.jsonl dataset from SlowGuess/abforge-data, which is derived from CC-licensed research papers.

Use Cases

Research Paper Analysis: Assisting researchers in identifying potential ablation studies for their work.
Automated Experiment Design: Generating structured ablation objectives to guide experimental setups.
Academic Tooling: Serving as a component in larger systems for scientific discovery and analysis.

Evaluation of this model can be reproduced using the SlowGuess/Abforge_1 code, scoring predictions against the held-out AblationBench split using a Claude judge.

Overview

ABForge-Qwen3-8B-Task1-RL: Ablation Objective Generation

Key Capabilities

Use Cases

Full Model Card (README)