automatic-speech-recognition

Star

Here are 290 public repositories matching this topic...

iammartian0 / Audio101

Star

Hugging Face Audio coursework

transformers automatic-speech-recognition audio-classification whisper audio-processing

Updated Sep 7, 2023
Jupyter Notebook

RobertoAlessandri / DataScienceTask

Star

machine-learning deep-learning automatic-speech-recognition mfcc gender-classification

Updated Oct 23, 2023
Jupyter Notebook

BScUniversityCollaborations / automatic-speech-recognition

Star

Created an ASR (Automatic Speech Recognition) system that takes in individual recordings. Each recording represents a sentence composed of 5-10 English language digits, separated by adequate pauses. The system involves segmenting the sentence using a classifier, differentiating between background and foreground sounds.

python classifier automatic-speech-recognition asr openslr mel-spectrogram recognition-algorithms

Updated Sep 12, 2023
Python

ksquarekumar / whisper-stream

Star

Whisper Transcription Service

deep-learning inference transformer openai automatic-speech-recognition flax speech-to-text whisper jax speech-translation speech-transcription

Updated Sep 14, 2023
Jupyter Notebook

PatrickTourniaire / ASR-Exam-Revision

Star

ASR course past paper revision work for the University of Edinburgh

automatic-speech-recognition exam-revision university-of-edinburgh

Updated Jan 19, 2024
TeX

therealmolf / audaio

Star

A compilation of libraries, case studies, resources, and research papers revolving around deep learning/machine learning for audio

audio music lists list machine-learning deep-learning neural-network resources music-information-retrieval neural-networks automatic-speech-recognition music-generation audioclassification

Updated Sep 13, 2022

vigneshsingrinagaraju / Speech_Recording_Tool

Star

Speech Recording Tool

angularjs jquery nginx flask data-science machine-learning natural-language-processing html5 deep-learning neural-network css3 recurrent-neural-networks dataset flask-application automatic-speech-recognition heroku-deployment pyhton3 datacollection

Updated Jul 22, 2023
CSS

jpdiazpardo / gutural_nlp

Star

Gutural and scream automatic speech recognition (ASR) system using a fine-tuned version of OpenAI's Whisper model

sentiment-analysis transformers automatic-speech-recognition gradio audio-processing huggingface spleeter

Updated Oct 25, 2023
Jupyter Notebook

huihut / BaiduSpeechDemo

Star

Baidu TTS(Text-To-Speech), ASR(Automatic-Speech-Recognition) Demo for PC

wpf tts baidu automatic-speech-recognition speech-to-text asr wpf-application baidu-api

Updated Jul 20, 2019
C#

Darveivoldavara / whisper-timestamped

Star

Timestamped ASR microservice

python docker data-science monitoring deep-learning openai data-analysis automatic-speech-recognition whisper asr resource-management timestamps mssqlserver mlops fastapi uvicorn-gunicorn audio-to-text prompt-engineering

Updated Mar 29, 2024
Jupyter Notebook

QubitPi / cmusphinx.github.io

Star

CMUSphinx Website

jekyll documentation automatic-speech-recognition cmusphinx

Updated May 9, 2024
HTML

matiuste / DistriBlock

Star

[UAI 2024 paper] DistriBlock: Identifying adversarial audio samples by leveraging characteristics of the output distribution.

machine-learning automatic-speech-recognition uncertainty-quantification adversarial-examples

Updated May 23, 2024
Python

swayam01 / ASR

Star

Trained Transformer model for Speech Recognition

transformer speech-recognition automatic-speech-recognition tensor2tensor

Updated May 16, 2020
Python

Darveivoldavara / whisper_model_evaluator

Star

WER, MER, WIL of Whisper vs Vosk vs Google transcribators comparator

visualization python evaluation data-analysis automatic-speech-recognition tuning-parameters whisper asr google-speech-recognition vosk audio-to-text

Updated Mar 29, 2024
Jupyter Notebook

iammartian0 / Audio_Tasks

Star

Different Task Guides for Audio Data

audio text-to-speech translation deep-learning audio-data automatic-speech-recognition audio-classification whisper audio-processing transcribe huggingface-transformers speecht5

Updated Jul 12, 2023
Jupyter Notebook

mdhasanai / Bangla_E2E_ASR

Star

Bangla Automatic Speech Recognition

end-to-end keras speech-recognition spectrogram bangla automatic-speech-recognition mfcc asr deepspeech2

Updated Jul 18, 2019
Python

sephiroce / srf

Star

Supplementary files for the sequential routing framework

automatic-speech-recognition sequence-to-sequence dynamic-routing capsule-network

Updated Jul 13, 2022
Python

Prasanna-Pawar21 / End-to-End-multilingual-speech-translation-using-ASR-and-NLP.

Star

Text-To-Speech-Text (TTST) simplifies tech for everyone, turning written text into spoken words. It's a computer system that reads any input aloud, promoting accessibility. English to desired language TTST aids in localizing computer applications, enhancing user understanding.

python natural-language-processing jupyter-notebook webapp automatic-speech-recognition speech-to-text neural-machine-translation

Updated Dec 29, 2023
HTML

PeterGilles / Speech-Recognition-Lecture---Data-Science-in-Humanities

Star

Material for my lecture on Automatic Speech Recognition

automatic-speech-recognition whisper asr luxembourgish wav2vec2

Updated Apr 24, 2024
Jupyter Notebook

habibul08 / voice_recognation

Star

This project aims to learn build Automatic Speech Recognition (ASR) or Voice Recognition using pretrained models Whisper and Wave2Vec from Indonesia AI NLP Bootcamp.

voice-recognition automatic-speech-recognition finetune whisper-ai

Updated Jun 3, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the automatic-speech-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the automatic-speech-recognition topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

automatic-speech-recognition

Here are 290 public repositories matching this topic...

iammartian0 / Audio101

RobertoAlessandri / DataScienceTask

BScUniversityCollaborations / automatic-speech-recognition

ksquarekumar / whisper-stream

PatrickTourniaire / ASR-Exam-Revision

therealmolf / audaio

vigneshsingrinagaraju / Speech_Recording_Tool

jpdiazpardo / gutural_nlp

huihut / BaiduSpeechDemo

Darveivoldavara / whisper-timestamped

QubitPi / cmusphinx.github.io

matiuste / DistriBlock

swayam01 / ASR

Darveivoldavara / whisper_model_evaluator

iammartian0 / Audio_Tasks

mdhasanai / Bangla_E2E_ASR

sephiroce / srf

Prasanna-Pawar21 / End-to-End-multilingual-speech-translation-using-ASR-and-NLP.

PeterGilles / Speech-Recognition-Lecture---Data-Science-in-Humanities

habibul08 / voice_recognation

Improve this page

Add this topic to your repo