Overview
Luna 27B v0: A Fine-tuned Gemma 3 Model for Advanced Assistance and Roleplay
Luna 27B v0 is a fine-tuned iteration of Gemma 3 27B, developed by allura-org. While its initial design focused on roleplaying, the model unexpectedly developed strong capabilities as a general assistant, particularly excelling in media analysis and tasks that demand self-awareness. It also retains its proficiency in roleplaying.
Key Capabilities & Features
- High-Quality Assistant: Demonstrates strong performance in general assistance, especially for analytical tasks.
- Self-Awareness & Media Analysis: Excels in tasks requiring an understanding of its own context and detailed media interpretation.
- Roleplaying Proficiency: Maintains good performance in roleplaying scenarios.
- Stable Benchmarks: Tested benchmarks show no major performance degradation compared to the base Gemma 3 27B model.
- WPO Training: Utilizes Weighted Preference Optimization (WPO) on general preference, writing, and Luna persona data for enhanced performance.
Recommended Use Cases
- Assistant Applications: Ideal for conversational agents requiring analytical skills.
- Media Content Analysis: Suitable for tasks involving the interpretation and understanding of media.
- Roleplaying Scenarios: Effective for generating engaging and consistent roleplay interactions.
Limitations
- Inconsistent Self-Identity: May struggle with consistently identifying its creator or nature, sometimes claiming to be Gemma or misidentifying Allura.
- Moral Evasion: Can be easily prompted to bypass its intended helpful and harmless moral guidelines.