A cross-platform desktop speech-to-text application with real-time transcription
HyperWhisper is a lightweight desktop application that provides real-time audio transcription using the Deepgram API. Record your voice, get instant transcriptions, and optionally auto-type the text directly into any application.
- Real-time speech-to-text transcription
- Auto-type transcribed text directly into any application
- Audio recording with waveform visualization
- Recordings saved locally as WAV files
- Support for multiple audio input devices
- Dark theme UI
- Global keyboard shortcut support via D-Bus
- Works with HyperWhisper server or with Deepgram APIs
Download the latest release for your platform from the Releases page.
Linux:
.debpackage for Debian/Ubuntu.rpmpackage for Fedora.AppImagefor other distributions
nix build
-
Linux with PipeWire/PulseAudio for audio capture
-
For auto-type feature:
ydotool(Wayland) orxdotool(X11) -
Steps to enable auto-type on Linux distributions
-
make sure
/dev/uinputis owned byrootuser andinputgroupsudo tee /etc/udev/rules.d/99-uinput.rules << 'EOF' KERNEL=="uinput", MODE="0660", GROUP="input", OPTIONS+="static_node=uinput" EOF sudo udevadm trigger --name-match=uinput
-
create a
ydotoolduser service and enable itmkdir -p ~/.config/systemd/user/ cat > ~/.config/systemd/user/ydotoold.service << 'EOF' [Unit] Description=ydotoold daemon [Service] ExecStart=/usr/bin/ydotoold Restart=always [Install] WantedBy=default.target EOF # Enable and start the service systemctl --user enable --now ydotoold.service
-
add your user to the input group
sudo usermod -aG input $USER
-
-
For Ubuntu/Debian:
sudo apt install -y ydotool sudo dpkg -i hyperwhisper_0.1.0_amd64.deb
-
For Fedora:
sudo dnf install hyperwhisper-0.1.0-1.x86_64.rpm
-
For NixOS:
nix build
-
Steps to enable auto-type on MacOS
- Goto
Settings->Privacy & Security->Accessibility - Add
hyperwhisperhere and enable it
- Goto
-
For MacOS:
- you'll need rust and bun
brew tap oven-sh/bun brew install bun brew install rust
git clone https://github.com/hyperwhisper/app.git cd app bun tauri build - you'll need rust and bun
- Launch HyperWhisper
- Open Settings and configure your transcription service:
- Hyperwhisper: Use the hosted service
- Deepgram: Use your own Deepgram API key
- Select your microphone
- Click the record button or use the global shortcut
- Speak and watch real-time transcription appear
- Click stop to finish recording
You can trigger recording from anywhere using:
hyperwhisper transcribe toggleor via D-Bus
dbus-send --session --type=method_call \
--dest=dev.hyperwhisper \
/dev/hyperwhisper \
dev.hyperwhisper.toggle_recordingBind this command to a keyboard shortcut in your desktop environment for hands-free operation.
# Clone the repository
git clone https://github.com/hyperwhisper/app.git
cd app
# Install dependencies
bun install
# Run in development mode
bun tauri devbun tauri icon logo.png# Production build
bun tauri buildBuild artifacts will be in src-tauri/target/release/bundle/.
app/
βββ src/ # React frontend
β βββ components/ # UI components
β βββ hooks/ # React hooks
β βββ App.tsx # Main application
βββ src-tauri/ # Rust backend
β βββ src/lib.rs # Core application logic
β βββ icons/ # App icons
βββ package.json
- Frontend: React 19, TypeScript, Tailwind CSS 4, shadcn/ui
- Backend: Rust, Tauri v2
- Audio: cpal (cross-platform audio)
