Name: EpistemeAI/ReasoningCore-3B-RE1-V2C API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: EpistemeAI

EpistemeAI/ReasoningCore-3B-RE1-V2C Overview

ReasoningCore-3B-RE1-V2C is a 3.2 billion parameter, multilingual, reasoning-enhanced large language model developed by EpistemeAI. It is an experimental model, fine-tuned from EpistemeAI/ReasoningCore-3B-RE1-V2B, and built on an optimized transformer architecture. The model incorporates specialized reasoning pathways and has been fine-tuned using Group Robust Preference Optimization (GRPO), supervised learning, and reinforcement learning with human feedback (RLHF) to align with human expectations for clarity, accuracy, and safety.

Key Capabilities & Features

Reasoning Enhancement: Instruction-tuned to excel at nuanced reasoning, dialogue management, retrieval, and summarization tasks.
Multilingual Support: Officially supports English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.
Optimized Architecture: Auto-regressive language model with an optimized transformer architecture and a 32768 token context length.
GRPO Technique: Utilizes Group Relative Policy Optimization (GRPO) as a post-training technique to enhance performance on extended reasoning tasks, particularly mathematical problem-solving.
Safety & Alignment: Incorporates built-in safety guardrails, adversarial prompt training, and iterative fine-tuning to mitigate risks and ensure responsible deployment.

Intended Use Cases

Conversational AI: For assistant-like interactions.
Knowledge Retrieval & Summarization: Dynamic extraction and condensation of information.
Mobile AI-Powered Writing Assistants: Query reformulation and natural language generation.
General Natural Language Generation: Applications benefiting from advanced reasoning abilities, including mathematical problem-solving with specific prompting strategies.

Overview

EpistemeAI/ReasoningCore-3B-RE1-V2C Overview

Key Capabilities & Features

Intended Use Cases

Full Model Card (README)