wave2vec2

Star

Here are 7 public repositories matching this topic...

Akshayredekar07 / Multimodal-Deepfake-Detection

Star

Multimodel deepfake detection

docker deep-learning pytroch fastapi efficientnet deepfake-detection wave2vec2

Updated Jul 21, 2025
Jupyter Notebook

ahmadara / Sentiment-analysis-from-human-voice-using-the-Hubert-model.

Star

In this code, we have used common and well-known datasets such as the Toronto dataset available on Kaggle to create a sentiment analysis model from human voice. This model is designed based on the Bert model and is called Hubert.

pytorch transformer audio-classification audio-processing emotion-recognition finetuning audio-sentiment-analysis hubert wave2vec2

Updated Jan 28, 2024
Jupyter Notebook

tone-me / tone-me

Star

Tone.me helps users improve their pronunciation in Mandarin

nextjs mandarin-chinese whisper-ai wave2vec2

Updated Jul 17, 2024
Python

Highdrien / MultiModal-Model

Star

Multimodal Model which take text audio and video to predict the turn taking. That is, to predict whether the speaker in a discussion will change.

python deep-learning pytorch lstm-neural-networks multimodal multimodal-deep-learning bert-model wave2vec2

Updated Sep 15, 2024
Python

manthan410 / audio-lecture-notes-generator

Star

Generates section wise topics and transcription for lecture videos and helps to control the lecture video playback based on generated topic-wise timestamps.

nlp youtube-dl topic-modeling automatic-speech-recognition speech-to-text transcription deepspeech whisper-ai wave2vec2

Updated Dec 29, 2023
Jupyter Notebook

RavinduLayanga / mixcap-multimodal-captioning

Star

PyTorch implementation of MixCap: A Multimodal Video Captioning model fusing BLIP-2 (Visual) & Wav2Vec2 (Audio). Features a novel Dual-Target MixUp strategy for low-resource training.

nlp machine-learning computer-vision pytorch video-captioning multimodal-learning blip-2 wave2vec2

Updated Jan 21, 2026
Jupyter Notebook

Mohithavelagapudi / Audio-Driven-Hate-Speech-Detection-in-Telugu

Star

Low-resource multimodal hate speech detection leveraging acoustic and textual representations for robust moderation in Telugu.

svm tf-idf mlp opensmile librosa speech-processing multiclass-classification multimodal-learning binaryclassification xlm-roberta early-fusion late-fusion labse m-bert wave2vec2

Updated Nov 3, 2025
Jupyter Notebook

Improve this page

Add a description, image, and links to the wave2vec2 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the wave2vec2 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wave2vec2

Here are 7 public repositories matching this topic...

Akshayredekar07 / Multimodal-Deepfake-Detection

ahmadara / Sentiment-analysis-from-human-voice-using-the-Hubert-model.

tone-me / tone-me

Highdrien / MultiModal-Model

manthan410 / audio-lecture-notes-generator

RavinduLayanga / mixcap-multimodal-captioning

Mohithavelagapudi / Audio-Driven-Hate-Speech-Detection-in-Telugu

Improve this page

Add this topic to your repo