whisper-model

Here are 14 public repositories matching this topic...

shhossain / BanglaSpeech2Text

BanglaSpeech2Text: An open-source offline speech-to-text package for Bangla language. Fine-tuned on the latest whisper speech to text model for optimal performance.

machine-learning deep-learning speech pytorch transformer voice-recognition speech-recognition bangla speech-to-text hacktoberfest whisper bangla-asr bangla-speech-recognition bangla-speech-to-text bangla-automatic-speech-recognition whisper-model bangla-voice-recognition

Updated Mar 1, 2025
Python

jim-schwoebel / nala_assistant

Star

🔊😊 A fastapi voice-assistant framework to quickly prototype LLM-powered voice assistants in <5 minutes.

Updated Jan 15, 2024
JavaScript

hemangjoshi37a / French_audio_transcription_using_gradio

Star

French audio transcription using gradio

machine-learning speech-recognition gradio audio-processing french-language audio-transcription audio-to-text transcription-tool whisper-model french-audio-transcription

Updated Sep 22, 2024
Jupyter Notebook

krithicswaroopan / AI-Voice-Assistance-Pipeline

Star

A real-time voice-to-text and text-to-speech AI pipeline using Whisper, an LLM, and Edge-TTS with tunable parameters for low-latency audio processing and response generation.

python natural-language-processing text-to-speech speech-recognition speech-to-text real-time-processing conversational-ai voice-activity-detection ai-ml hugging-face-transformers large-language-models whisper-model edge-tts

Updated Sep 24, 2024
Python

franckferman / Whisper_Transcriber

Star

📝 Turn audio into text effortlessly. Audio transcription powered by OpenAI's Whisper API.

Updated Mar 15, 2025
Python

dvorobiev / subtitles_project

Star

Subtitles Generator: Автоматический генератор субтитров для видео с поддержкой перевода на различные языки, использующий модель Whisper от OpenAI.

python machine-learning subtitles video-processing audio-transcription whisper-model

Updated Mar 19, 2025
Python

Avinraj01 / SHL-Grammar-Scoring-Engine-for-Voice-Samples

Star

This model predicts grammar scores (1–5) from audio files. It uses Whisper to transcribe speech to text, cleans the text, and extracts features with TF-IDF. A Random Forest Regressor is trained to learn grammar score patterns. Evaluation via Pearson Correlation showed good results.

machine-learning random-forest speech-recognition tf-idf nlp-machine-learning model-evaluation pearson-correlation text-preprocessing regression-model audio-to-text whisper-model grammar-scoring submission-pipeline

Updated Apr 21, 2025
Jupyter Notebook

Xza85hrf / Whisper-Subtitle-Generator

Star

The Whisper Subtitle Generator leverages OpenAI's Whisper model to generate subtitles from audio and video files. This Python-based tool supports multiple languages and employs advanced audio processing techniques to ensure high accuracy in transcription.

python ffmpeg speech-recognition openai gpu-acceleration noise-reduction audio-processing subtitle-generator audio-to-text video-subtitles transcription-tool whisper-model multilingual-transcription srt-output vtt-output

Updated Apr 23, 2024
Python

seccanj / generate-subtitle-llm

Star

Generates subtitles from a video speech (Whisper OpenAI LLM) or extracts existing subtitles, translates them into a different language using Mistral LLM and adds them to the video. Uses ffmpeg for extracting and encoding

machine-learning video ai ffmpeg python3 video-processing subtitles-generator llms whisper-model mistral-7b subtitles-translator mistral-ai

Updated Jan 28, 2025
Python

otonomee / youtube-to-transcript

Star

Convert YouTube videos to text files. Why spend 30 minutes watching a video when you can skim the transcript in a couple minutes?

python machine-learning openai youtube-downloader speech-to-text transcription pytube video-to-text audio-transcription whisper-model

Updated Jul 30, 2024
Python

sushant1827 / CrewAI-Agents-MinutesOfMeeting-Gmail

Star

MinutesOfMeeting and Gmail is a collaborative crew of AI agents that autonomously understand audio, transcripts, summarizes, writes and drafts an email in Gmail account.