vyomaalabs/verixa-3b

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kPublished:May 24, 2026Architecture:Transformer Warm

vyomaalabs/verixa-3b is a 3.1 billion parameter causal language model developed by vyomaalabs, fine-tuned for general text generation tasks. This model leverages a 32768 token context length, making it suitable for processing longer inputs and generating coherent, extended responses. It is designed for applications requiring versatile text generation capabilities across various prompts.

Loading preview...

Model Overview

vyomaalabs/verixa-3b is a 3.1 billion parameter causal language model developed by vyomaalabs. It has been fine-tuned using the TRL (Transformers Reinforcement Learning) library, indicating a focus on improving its performance through advanced training techniques.

Key Capabilities

  • General Text Generation: The model is capable of generating coherent and contextually relevant text based on diverse prompts.
  • Extended Context Handling: With a context length of 32768 tokens, it can process and generate longer sequences of text, maintaining consistency over extended conversations or documents.
  • Instruction Following: The model is fine-tuned to respond to user instructions, as demonstrated by its quick start example for question answering.

Training Details

This model was trained using Supervised Fine-Tuning (SFT) methods. The development utilized several key framework versions, including TRL 1.5.0, Transformers 5.9.0, Pytorch 2.5.1, Datasets 4.8.5, and Tokenizers 0.22.2.

When to Use This Model

This model is suitable for developers looking for a moderately sized language model (3.1B parameters) that offers good performance in general text generation tasks and can handle substantial context. It can be integrated into applications requiring conversational AI, content creation, or question-answering systems where a balance between model size and capability is desired.