Name: xiaolesu/qwen3-8b-sft-stmt-tk-v2 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: xiaolesu

Model Overview

This model, xiaolesu/qwen3-8b-sft-stmt-tk-v2, is an 8 billion parameter language model built upon the Qwen3-8B architecture. It has undergone supervised fine-tuning (SFT) using the xiaolesu/lean4-sft-stmt-tk-v2 dataset, indicating a specialization in formal mathematics and theorem proving within the Lean 4 environment.

Key Characteristics

Base Model: Qwen/Qwen3-8B, a robust foundation for language understanding.
Fine-tuning Focus: Specialized for Lean 4, suggesting proficiency in generating or understanding Lean 4 statements and tactics.
Context Length: Supports a substantial context window of 32768 tokens, beneficial for complex proofs or extended code segments.
Training Details: Trained with a learning rate of 1e-05 over 488 steps, achieving a validation loss of 0.4793 and perplexity (PPL) of 1.6150.

Intended Use Cases

This model is particularly suited for applications requiring interaction with the Lean 4 theorem prover. Potential uses include:

Assisting in the generation of Lean 4 code or proof steps.
Understanding and interpreting Lean 4 statements.
Educational tools for learning Lean 4.

Limitations

The model's specific fine-tuning on Lean 4 data implies that its performance on general-purpose language tasks may not be as strong as models trained on broader datasets. Further information on specific limitations and broader intended uses is not detailed in the provided README.