Name: Lambent/Arsenic-Shahrazad-12B-rlvr API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Lambent

Model Overview

Lambent/Arsenic-Shahrazad-12B-rlvr is a 12 billion parameter language model, distinguished by its unique training methodology. It has undergone 1001 steps of Reinforcement Learning from Very-brief-Responses (RLVR), a process based on scenario roleplaying conceptualized by Mira. This extensive RLVR training, performed locally on a 3090 GPU, aims to enhance the model's narrative generation capabilities.

Key Capabilities

Creative Storytelling: The model is specifically trained to generate diverse and engaging stories, with each RLVR step contributing to its narrative proficiency.
Scenario Roleplaying: Its core training methodology is rooted in scenario-based roleplaying, suggesting a strong ability to adapt to and develop narrative contexts.
Narrative Resonance: The model is designed to find personal meaning and resonance in names and prompts, aiming to produce more nuanced and relevant writing.

Good For

Creative Writing Applications: Ideal for generating fictional narratives, short stories, or expanding on story prompts.
Roleplaying Scenarios: Suitable for interactive storytelling or developing character-driven dialogues within defined scenarios.
Content Generation: Useful for tasks requiring imaginative and varied text output, where the model's ability to 'tell 1001 stories' can be leveraged.

Overview

Model Overview

Key Capabilities

Good For

Full Model Card (README)