Name: v2ray/GPT4chan-24B API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: v2ray

GPT4chan-24B Overview

GPT4chan-24B is a 24 billion parameter language model developed by v2ray, built upon a merge of mistralai/Mistral-Small-24B-Base-2501 and v2ray/GPT4chan-24B-QLoRA. It was trained using 8x H100 GPUs with a global batch size of 64, a learning rate of 2e-4, for 4000 steps, equating to approximately 5 epochs. The model supports a context length of 32768 tokens.

Key Characteristics

Architecture: Merged model based on Mistral-Small-24B-Base-2501.
Training: Fine-tuned for 4000 steps (approx. 5 epochs) on powerful hardware.
Prompt Format: Employs a unique board<|start_header_id|>id<|end_header_id|>content structure, facilitating specific content generation patterns.

Usage Guidelines

This model is intended for:

Mentally sane generations.
Research purposes only.
Promoting positive interactions.

Users are explicitly advised not to use the model for activities related to dead internet theory, inharmonious content, or specific forbidden terms like "gex".

Overview

GPT4chan-24B Overview

Key Characteristics

Usage Guidelines

Full Model Card (README)