TheFinAI/Fin-o1-8B

Warm
Public
8B
FP8
32768
1
May 15, 2025
License: apache-2.0
Hugging Face
Overview

Overview

Fin-o1-8B is an 8 billion parameter language model developed by TheFinAI, built upon the Qwen3-8B architecture. It has been specifically fine-tuned using Supervised Fine-Tuning (SFT) and Grouped Reinforcement Learning with Policy Optimization (GRPO) to excel in financial reasoning tasks. The training utilized a specialized dataset, TheFinAI/FinCoT, which is derived from various financial benchmarks including FinQA, TATQA, DocMath-Eval, Econ-Logic, and BizBench-QA.

Key Capabilities

  • Enhanced Financial Reasoning: Optimized for complex financial mathematical reasoning and analytical tasks.
  • Specialized Training: Benefits from SFT and GRPO on a diverse financial dataset, improving its understanding and generation of financial insights.
  • Qwen3-8B Base: Inherits the robust capabilities and tokenizer of its Qwen3-8B base model.

Good For

  • Financial Analysis: Applications requiring detailed financial calculations and reasoning.
  • Quantitative Finance: Tasks involving the interpretation and processing of financial data.
  • Research: Academic or industry research focused on financial language understanding and generation.