Name: lihaoxin2020/qwen3-4b-refiner-gpt54-rubric-v3-2-rl-lr5e-6-step50 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: lihaoxin2020

Model Overview

This model, lihaoxin2020/qwen3-4b-refiner-gpt54-rubric-v3-2-rl-lr5e-6-step50, is a 4 billion parameter language model built upon the Qwen3 architecture. It functions as a refiner, having been trained as a GRPO (Generalized Reinforcement Learning from Human Feedback with Policy Optimization) checkpoint. The training process initiated from lihaoxin2020/qwen3-4b-refiner-gpt54-ep2, indicating a specialized fine-tuning approach.

Key Characteristics

Architecture: Qwen3-based, 4 billion parameters.
Training Method: GRPO checkpoint, suggesting a reinforcement learning approach for refinement.
Origin: Derived from lihaoxin2020/qwen3-4b-refiner-gpt54-ep2.
Context Length: Features a substantial 32768 token context window, enabling processing of longer inputs for refinement tasks.

Potential Use Cases

Text Refinement: Ideal for tasks requiring iterative improvement or correction of text based on specific criteria.
Quality Enhancement: Can be applied to enhance the quality of generated or existing text by aligning it with a predefined rubric (e.g., GPT-54 rubric).
Post-processing: Suitable for post-processing outputs from other language models to meet higher standards or specific stylistic requirements.

Overview

Model Overview

Key Characteristics

Potential Use Cases

Full Model Card (README)