Name: CL-From-Nothing/Qwen3-1-7B-SSD-RLVE-Eval20-N20-global-step-500 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: CL-From-Nothing

Model Overview

CL-From-Nothing/Qwen3-1-7B-SSD-RLVE-Eval20-N20-global-step-500 is a 1.7 billion parameter language model built on the Qwen3 architecture, featuring a substantial 32768 token context window. This model represents a specific checkpoint (global step 500) from a VERL FSDP Supervised Fine-Tuning (SFT) process.

Key Characteristics

Architecture: Based on the Qwen3 model family.
Parameter Count: 1.7 billion parameters, offering a balance between performance and computational efficiency.
Context Length: Supports a 32768 token context, enabling processing of longer inputs and maintaining conversational coherence over extended interactions.
Training Data: Fine-tuned on a specialized 16,000-row Parquet SFT corpus, specifically the CL-From-Nothing/RLVE-Eval20-Qwen3-1.7B-SSD-N20-SFT-Train dataset. This dataset's structure, featuring a messages column, suggests a focus on conversational or instruction-following tasks.
Development Stage: This is a checkpoint from an ongoing training process, specifically after 500 optimizer steps and one epoch of its schedule.

Intended Use Cases

This model is particularly well-suited for applications that align with its fine-tuning data, likely involving:

Conversational AI: Generating responses in dialogue systems.
Instruction Following: Executing specific commands or answering questions based on provided instructions.
Evaluation-focused tasks: Potentially useful in scenarios where the model's responses need to be evaluated against certain criteria, given its "RLVE-Eval20" designation in the dataset name.

Overview

Model Overview

Key Characteristics

Intended Use Cases

Full Model Card (README)