GitHub - Beerspitnight/sound2text: Whisper-based Audio Transcription App - Desktop GUI - A simple 🖥️ app that turns 🔉 into text. Built because I needed a straightforward way to transcribe 🫛 episodes for video captions. It uses OpenAI's Whisper model to handle the transcription—you feed it an audio file, and it returns a .SRT file, ready for use.

Sound2Text

*GUI Audio Transcriber *This a simple and minimalist desktop app that transcribes audio files using OpenAI's Whisper API. The script provides a graphical user interface to select an audio file and save the resulting transcription file *Output Options = An .srt subtitle file chunked and timestamped by word, including punctuation, which was a pain in the butt to sort out. *Users have the option to have line numbers included or excluded.

Features

*Select local audio files (.mp3, .wav, etc.). *Generates punctuated, timestamped transcriptions. *Saves output in the standard .srt format (transcribe_logic_line_numbs.py) *Saves output .srt format, without line numbers (Useful for OpenShot) (transcribe_logic.py) *Simple, minimalist user interface. *timestamp_modifier.py - run separately once transcription is complete to ensure each timestamp has a duration of not less than 300ms.

How to Use

Ensure you have Python 3 installed.
Install the required dependencies: pip install -r requirements.txt
Set your OpenAI API key as an environment variable named OPENAI_API_KEY.
Run the application: python transcribe_gui.py

The GUI shows a checkbox that allows users to choose whether they want to exclude line numbers in their SRT output.
The command line version supports the --no-line-numbers flag for the same functionality.

Acknowledgements

This tool is powered by the OpenAI Whisper API.
The code for this project was developed with some assistance from Google's Gemini.
Bruno's You Don't Have To - The short-form pod: sub-4-minutes of irreverent nonsense

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
timestamp_modifier.py		timestamp_modifier.py
transcribe_gui.py		transcribe_gui.py
transcription_logic.py		transcription_logic.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sound2Text

Features

How to Use

Acknowledgements

About

Uh oh!

Releases

Packages

Languages

License

Beerspitnight/sound2text

Folders and files

Latest commit

History

Repository files navigation

Sound2Text

Features

How to Use

Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages