Name: JoanneJegou/Qwen_SFT_post_trained_v1 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: JoanneJegou

Model Overview

JoanneJegou/Qwen_SFT_post_trained_v1 is a 2 billion parameter language model built upon the Qwen3-1.7B base architecture. It features a substantial context window of 32768 tokens, allowing it to process and generate longer sequences of text.

Key Capabilities

Supervised Fine-Tuning (SFT): The model has been specifically fine-tuned using Supervised Fine-Tuning (SFT) techniques.
Knowledge-Enhanced Training: Training incorporated two distinct datasets: microsoft/wikiQA and MuskumPillerum/General-Knowledge. This combination suggests an optimization for factual recall and question-answering tasks.
LoRA Integration: The fine-tuning process utilized LoRA (Low-Rank Adaptation) for efficient and effective adaptation of the base model.

Good For

Question Answering: Its training on the wikiQA dataset indicates a strong suitability for answering factual questions.
General Knowledge Tasks: The inclusion of the MuskumPillerum/General-Knowledge dataset positions this model well for applications requiring broad factual understanding and retrieval.
Applications requiring a large context window: The 32768 token context length is beneficial for processing detailed queries or generating comprehensive responses.

Overview

Model Overview

Key Capabilities

Good For

Full Model Card (README)