Name: Abdullahu5mani/flowscribe-qwen2.5-0.5b-v2 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Abdullahu5mani

FlowScribe: Qwen2.5-0.5B Speech Transcript Formatter

Abdullahu5mani/flowscribe-qwen2.5-0.5b-v2 is a specialized fine-tuned version of the Qwen2.5-0.5B-Instruct model, designed to address the common issues with raw speech-to-text (STT) outputs. Tools like Whisper often produce transcripts filled with filler words, self-corrections, and lacking proper punctuation or formatting. This model acts as a post-processor, transforming these raw inputs into polished, readable text.

Key Capabilities

Intelligent Text Formatting: Removes filler words ("um", "uh"), handles self-corrections, fixes grammar, and applies appropriate punctuation and structure.
Multi-Style Output: Supports various formatting styles to match specific use cases:
- Auto: An intelligent default for general cleanup and structuring.
- Professional: Formal tone, structured layout, perfect grammar.
- Casual: Preserves speaker's voice, light cleanup, contractions.
- Verbatim: Strips only "um"/"uh" and applies spoken formatting commands.
- Software_Dev: Formats code terms, variable names (e.g., camelCase, snake_case), and technical jargon.
- Enthusiastic: High energy, exclamation marks, positive phrasing.
Efficient Processing: At 0.5 billion parameters, it prioritizes speed and local deployment, making it suitable for resource-constrained environments.
Quantized Version Available: Includes a Q4_K_M quantized GGUF version for fast CPU/GPU inference via llama.cpp.

Training Details

FlowScribe was fine-tuned using LoRA on approximately 27,400 synthetically generated examples from the flowscribe-dataset. This dataset was created using Google Gemini and other OpenRouter models across 10 diverse domain scenarios, ensuring a broad understanding of formatting requirements.

Limitations

Optimized exclusively for English language processing.
Relies on synthetically generated training data, which may not cover all real-world dictation edge cases.
The small parameter size (0.5B) prioritizes speed and local deployment over raw linguistic capability, meaning complex reasoning tasks are not its primary focus.