StarWhisper

StarWhisper — Free Download. Offline voice transcription
StarWhisper is a speech-to-text conversion solution that operates without an internet connection. The application processes audio directly on the user's Windows device, employing GPU acceleration for real-time transcriptions. All audio and text data remains on the local computer, without being transmitted to external servers. The software is built upon the optimized Whisper.cpp engine, providing accurate speech recognition with support for multiple languages and models of various sizes.
5.0(1 ratings)

Download StarWhisper (Official links)
File size: 2.4 MB
The latest version of StarWhisper is: 1.3.105
Operating system: Windows
Languages: English
Price: $0.00 USD

  • Real-time transcription. This function converts speech into text immediately as the user speaks. The engine processes the audio stream with minimal latency, displaying the text in the main interface. This feature is designed for continuous dictation in word processors, email clients, or any text input field within the Windows system.
  • Fully offline operation. All speech recognition processing runs locally on the user's machine. No internet connection is required for core functionality. Language models are stored on the hard drive and loaded into system memory or VRAM during use.
  • GPU acceleration. The application offloads most of the computational workload to the graphics processing unit when available. This implementation significantly reduces CPU usage and enables real-time performance, even with the largest and most accurate recognition models.
  • Configurable language models. The user can select from different Whisper models, ranging from 'tiny' to 'large'. Smaller models offer higher transcription speed, while larger models provide superior accuracy, especially for complex audio or specific accents.
  • Push-to-dictate mode. A modality where transcription is only activated while the user holds down a configurable key. This method is suitable for inserting short phrases or commands without needing to manually toggle continuous dictation on and off.
  • Minimalist floating window. The user interface consists of a transparent window that remains always visible on top of other applications. It displays the transcription status, recent converted text, and basic controls without visual distractions.
  • Customizable keyboard shortcuts. Users can define key combinations to start and stop recording, activate push-to-dictate mode, pause recognition, or show/hide the application window. Shortcuts function globally within the system.
  • Automatic formatting and punctuation. The engine not only transcribes words but also inserts punctuation marks like periods, commas, question marks, and capitalizes the beginning of sentences. This post-processing improves the readability of the generated text.
  • Audio file transcription. Capability to upload pre-existing audio files in common formats (WAV, MP3, FLAC) and generate a complete text transcription. The function processes the entire file and saves the result in an editable text document.
  • Manual and automatic language selection. The user can set the input language to improve accuracy, or let the model detect it automatically. Automatic language detection analyzes the first few seconds of audio to determine the most likely linguistic setting.
  • CPU compatibility mode. A fallback mechanism that activates automatically on systems without a dedicated GPU or with problematic drivers. In this mode, all neural network calculations run on the central processor, maintaining full offline functionality.
  • Visual status indicators. The interface features icons and color changes that inform the user about the current status: standby, recording, processing, or paused. These indicators provide immediate feedback on system activity.
  • Transcription history. A log that automatically saves recent dictation sessions. Users can review, copy, or export previously transcribed texts from a dedicated section of the application.
  • Basic noise reduction. Pre-processing of the input audio that applies filters to minimize constant ambient noise before the signal reaches the recognition model. This processing improves results in non-ideal environments.

The development of StarWhisper began in 2023 as a native Windows implementation of the open-source project Whisper.cpp, which itself is a C++ port of OpenAI's Whisper model. The developers are an independent team focused on creating productivity tools with built-in privacy. The application is primarily written in C++ for the processing core and uses the Qt framework for the graphical user interface. The choice of C++ ensures near-metal performance and efficient resource consumption, while Qt provides a cross-platform foundation for potential future development on other operating systems.


Alternatives to StarWhisper:

Glimp — Free Download. Real-time AI assistant for job interviews

Glimp

Glimp is an artificial intelligence interview copilot that provides real-time assistance during virtual job interviews.
Price: Free   Size: 25 MB   Version: 0.1.7   OS: Windows
Speakey — Free Download. Local and private voice dictation

Speakey

Speakey is a real-time dictation application that processes speech directly on the user's computer, without relying on cloud services.
Price: $45   Size: 356 MB   Version: 1.3.0   OS: Windows
RocketWhisper — Free Download. Offline AI transcription with GPU

RocketWhisper

RocketWhisper is a desktop application for speech recognition and transcription based on the OpenAI Whisper engine.
Price: $32   Size: 110 MB   Version: 1.2.0   OS: Windows, Linux, MacOS
VoiceOS — Free Download. Instant voice-to-text in every app

VoiceOS

VoiceOS is a universal voice interface that instantly transcribes natural speech into polished, written text in any application.
Price: Free   Size: 117 MB   Version: 1.0   OS: Windows
Typeless — Free Download. AI Voice Dictation

Typeless

In the competitive world of voice dictation, Typeless positions itself as an artificial intelligence tool that not only transcribes but also edits and polishes text in real-time.
Price: Free   Size: 129 MB   Version: 0.9.6   OS: Windows, Mac OS, Android, iOS
BB Recorder — Free Download. Local Recording and Private Transcription

BB Recorder

BB Recorder is a meeting and call recording application that operates entirely on the users device.
Price: Free   Size: 22 MB   Version: 1.0.0   OS: MacOS, iOS
Vowen — Free Download. Speech-to-Text and Voice control software

Vowen

Productivity software that converts speech into text and commands executed locally on macOS and Windows.
Price: Free   Size: 156 MB   Version: 0.1.12   OS: Windows, MacOS
VoiceInk — Free Download. Local Voice Transcription for macOS

VoiceInk

VoiceInk is a voice dictation and transcription application that uses local AI models to convert speech to text with precision, operating completely offline and respecting user privacy.
Price: Free   Size: 12.61 MB   Version: 1.70   OS: MacOS
OpenWispr — Free Download. Local and cloud voice transcription

OpenWispr

This program is an open-source desktop dictation application that converts speech into text.
Price: Free   Size: 115 MB   Version: 1.0.14   OS: Windows, Linux, MacOS
Pipit — Free Download. Local voice transcription for macOS

Pipit

The Pipit app converts speech to text in real time using AI models that run completely on the device.
Price: Free   Size: 8.2 MB   Version: 1.05   OS: MacOS
Whispering Tiger — Free Download. Transcription, translation and voice synthesis

Whispering Tiger

Whispering Tiger is a comprehensive application for speech-to-text conversion, text processing, text extraction from images, and other tasks.
Price: Free   Size: 13.2 MB   Version: 1.3.9.8   OS: Windows