Finance Entity Extractor (FinEE) v1.0
FinEE is a specialized, production-grade financial entity extraction model designed for Indian bank messages. Developed by Ranjit Behera, it employs a hybrid architecture that prioritizes efficiency and accuracy. By default, it operates with a highly optimized regex engine, delivering sub-millisecond latency and 87% accuracy without requiring any model download. For more complex or ambiguous cases, it can optionally engage a 3.8 billion parameter Phi-3 LLM, boosting accuracy to 94.5% with a one-time 7GB download.
Key Capabilities
- Hybrid Performance: Achieves <1ms latency with regex for 87% accuracy, and ~50ms with the LLM for 94.5% accuracy.
- Offline Operation: The regex component runs 100% offline, with the LLM also running locally after initial download.
- Guaranteed Output Schema: Provides a consistent JSON output for extracted entities like
amount, currency, type, merchant, and category. - Optimized for Indian Banks: Specifically trained and tested for messages from major Indian banks (HDFC, ICICI, SBI, Axis, Kotak).
- Robust Edge Case Handling: Designed to correctly parse challenging inputs, including varied currency formats, missing spaces, and multiple values.
- Multi-platform Support: Compatible with Apple Silicon (MLX), NVIDIA GPUs (PyTorch/CUDA), and CPUs (llama.cpp/GGUF).
Good For
- Extracting structured data from unstructured financial SMS or push notifications.
- Applications requiring high-speed, accurate parsing of transaction details.
- Developers needing a reliable, locally deployable solution for financial entity recognition in an Indian context.