Name: Ameb1/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-feline_stinky_walrus API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Ameb1

Model Overview

Ameb1/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-feline_stinky_walrus is a 0.5 billion parameter instruction-tuned language model, building upon the unsloth/Qwen2.5-0.5B-Instruct base. This model has been specifically fine-tuned using the GRPO (Gradient-based Reasoning Policy Optimization) method, as introduced in the DeepSeekMath paper, to enhance its reasoning capabilities.

Key Characteristics

Base Model: Fine-tuned from unsloth/Qwen2.5-0.5B-Instruct.
Parameter Count: 0.5 billion parameters, offering a compact footprint.
Context Length: Supports a substantial context window of 32768 tokens.
Training Method: Utilizes GRPO, a technique aimed at improving mathematical and general reasoning.
Frameworks: Trained with TRL, Transformers, Pytorch, Datasets, and Tokenizers.

Use Cases

This model is particularly well-suited for:

Instruction Following: Designed to accurately follow user instructions.
Reasoning Tasks: Benefits from GRPO training for improved logical and mathematical reasoning.
Resource-Constrained Environments: Its small size makes it efficient for deployment where computational resources are limited.
Prototyping and Development: A good choice for quickly experimenting with instruction-tuned models that require some reasoning capacity.

Overview

Model Overview

Key Characteristics

Use Cases

Full Model Card (README)