Name: cs-552-2026-aaty/general_knowledge_model API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: cs-552-2026-aaty

Model Overview

The general_knowledge_model is a specialized language model developed by the AATY team for the CS-552 MNLP course at EPFL. It is built upon the Qwen/Qwen3-1.7B base model, which has undergone supervised fine-tuning to excel in general knowledge domains.

Key Capabilities

Closed-book QA: Designed to answer factual and reasoning questions without external information.
Domain Expertise: Proficient in subjects spanning sciences, humanities, and geography.
Multiple-choice Format: Optimized for multiple-choice questions, supporting 2 to 20 options.
Reasoning Output: Emits a detailed reasoning block (<think>...</think>) before providing the final answer, which is wrapped in \boxed{...}.
Thinking Mode: The model is configured to always operate in a "thinking mode" via its chat template, ensuring a structured reasoning process for every query.

Training and Usage

The model was fine-tuned using a LoRA adapter on cs-552-2026-aaty/sft_mixture, a chat-formatted dataset derived from public QA and knowledge sources. It is provided as vLLM-loadable safetensors, including a config.json, generation_config.json, and a tokenizer chat_template. Developers can easily integrate it using the transformers library for tasks requiring robust general knowledge inference.

Overview

Model Overview

Key Capabilities

Training and Usage

Full Model Card (README)