Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
-
Updated
Oct 19, 2023
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
Main objective of this model is to develop Automatic Speech Recognition using Deep Neural Network.
A simple CRDNN based ASR model for my own understanding of how ASR works and are trained. (Work in progress) If anyone finds any error or have any suggestion please do let me know.
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Extract mfcc vectors and phones from TIMIT dataset
Speaker verification using Gaussian Mixture Model (GMM)
The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.
Sum-Product Networks (SPNs) for Robust Automatic Speaker Identification.
End-to-end ASR system on TIMIT
Keyword spotting using RNNs + Edit distance
Build speech enhancement dataset.
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Python implementation of pre-processing for End-to-End speech recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
Add a description, image, and links to the timit-dataset topic page so that developers can more easily learn about it.
To associate your repository with the timit-dataset topic, visit your repo's landing page and select "manage topics."