Name: laion/Kimi-2-5-r2egym_sandboxes-maxeps-32k__Qwen3-8B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: laion

Model Overview

This model, laion/Kimi-2-5-r2egym_sandboxes-maxeps-32k__Qwen3-8B, is an 8 billion parameter language model derived from the Qwen3-8B architecture. It has been specifically fine-tuned on the Kimi-2.5-r2egym_sandboxes-maxeps-32k dataset, indicating a focus on tasks related to reinforcement learning environments, sandbox simulations, or similar interactive contexts. The training process utilized a learning rate of 4e-05, a cosine learning rate scheduler with a 0.1 warmup ratio, and was run for 7 epochs with a total batch size of 16 across 8 GPUs.

Key Characteristics

Base Model: Qwen/Qwen3-8B, a robust foundation for general language understanding.
Specialized Fine-tuning: Trained on a dataset (Kimi-2.5-r2egym_sandboxes-maxeps-32k) that suggests expertise in specific interactive or simulated environments.
Context Length: Supports a 32,768 token context window, enabling the processing of extensive inputs relevant to complex scenarios.

Potential Use Cases

Reinforcement Learning: Generating actions, understanding states, or providing natural language interfaces for RL agents within sandbox environments.
Game AI: Developing intelligent agents for games that require complex decision-making or narrative generation based on in-game states.
Simulation Analysis: Interpreting and summarizing events or outcomes from detailed simulations.
Interactive Storytelling: Creating dynamic narratives that adapt to user input within constrained, rule-based worlds.

Overview

Model Overview

Key Characteristics

Potential Use Cases

Full Model Card (README)