speech

Here are 1,626 public repositories matching this topic...

dusty-nv / NanoLLM

Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.

speech multimodal rag edge-ai vector-database vision-transformer llm-inference

Updated Jun 6, 2024
Python

huggingface / datasets

Star

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

nlp machine-learning natural-language-processing computer-vision deep-learning tensorflow numpy speech pandas pytorch datasets hacktoberfest

Updated Jun 6, 2024
Python

praat / praat

Star

Praat: Doing Phonetics By Computer

speech phonetics acoustics speech-analysis

Updated Jun 6, 2024
C

IAHispano / Applio

Star

VITS-based Voice Conversion focused on simplicity, quality and performance.

text-to-speech ai voice speech pytorch rvc voice-conversion vc voice-cloning speech-to-speech vits voice-clone applio

Updated Jun 6, 2024
Python

modelscope / modelscope

Star

ModelScope: bring the notion of Model-as-a-Service to life.

python nlp science machine-learning deep-learning cv speech multi-modal

Updated Jun 6, 2024
Python

metavoiceio / metavoice-src

Star

Foundational model for human-like, expressive TTS

text-to-speech ai deep-learning speech pytorch tts speech-synthesis voice-clone zero-shot-tts

Updated Jun 6, 2024
Python

hanifabd / voice-activity-detection-vad-realtime

Star

Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)

machine-learning websockets voice speech speech-recognition speech-to-text speech-processing web-service voice-assistant voice-bot live-transcript realtime-transcribe

Updated Jun 6, 2024
Python

ictnlp / StreamSpeech

Star

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Updated Jun 6, 2024
Python

speechsuper / SpeechSuper-API-Samples

Star

Deep learning based speech and pronunciation assessment API for 8 languages.

learning recognition study german japanese speech assessment english speech-recognition spanish chinese russian korean french eval speechassessment speech-assessment

Updated Jun 6, 2024
C#

Aktyn / aktyn-assistant

Star

General purpose assistant powered by OpenAI API

nodejs api recognition voice speech desktop assistant openai synthesis chat-bot assistants chatgpt

Updated Jun 6, 2024
TypeScript

grecosalvatore / drift-lens

Star

Drift-Lens: an Unsupervised Drift Detection Framework for Deep Learning Classifiers on Unstructured Data

nlp computer-vision deep-learning speech concept-drift mlops drift-detection data-drift unsupervised-drift-detection

Updated Jun 6, 2024
Jupyter Notebook

balisujohn / tortoise.cpp

Star

A ggml (C++) re-implementation of tortoise-tts. Under construction and seeking contributors.

text-to-speech text speech tts to tortoise-tts ggml

Updated Jun 6, 2024
C++

HumeAI / hume-python-sdk

Star

Python client for Hume AI APIs

audio sdk recognition ai analysis detection voice sentiment speech emotion expression face hume

Updated Jun 6, 2024
Python

pytorch / audio

Star

Data manipulation and transformation for audio signal processing, powered by PyTorch

audio python machine-learning speech pytorch io audio-processing

Updated Jun 6, 2024
Python

wyy511511 / Chinese-Phonetic-Dictionary-Dataset

Star

Chinese Phonetic Dataset with Homophone Clustering

audio python speech audio-visualizer chinese audio-classification audio-processing

Updated Jun 6, 2024
HTML

avinashkranjan / Amazing-Python-Scripts

Sponsor

Star

🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.

python machine-learning projects speech artificial-intelligence webcam python-scripts hacktoberfest python-projects

Updated Jun 6, 2024
Jupyter Notebook

voidful / Codec-SUPERB

Sponsor

Star

Audio Codec Speech processing Universal PERformance Benchmark

audio speech codec audio-codec superb

Updated Jun 6, 2024
Python

mishra-ankit / modi-speeches

Star

Dataset of Narendra Modi speeches released to encourage research and analysis

politics speech dataset india politicians modi

Updated Jun 6, 2024
JavaScript

OvidijusParsiunas / deep-chat

Sponsor

Star

Fully customizable AI chatbot component for your website

react chat files angular image ai component vue solid nextjs chatbot speech svelte openai cohere huggingface ai-chatbot react-chatbot chatgpt

Updated Jun 5, 2024
TypeScript

tensorflow / lingvo

Star

Lingvo

nlp research translation tensorflow machine-translation speech distributed tts speech-synthesis mnist speech-recognition lm seq2seq speech-to-text gpu-computing language-model asr

Updated Jun 5, 2024
Python

Improve this page

Add a description, image, and links to the speech topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech

Here are 1,626 public repositories matching this topic...

dusty-nv / NanoLLM

huggingface / datasets

praat / praat

IAHispano / Applio

modelscope / modelscope

metavoiceio / metavoice-src

hanifabd / voice-activity-detection-vad-realtime

ictnlp / StreamSpeech

speechsuper / SpeechSuper-API-Samples

Aktyn / aktyn-assistant

grecosalvatore / drift-lens

balisujohn / tortoise.cpp

HumeAI / hume-python-sdk

pytorch / audio

wyy511511 / Chinese-Phonetic-Dictionary-Dataset

avinashkranjan / Amazing-Python-Scripts

voidful / Codec-SUPERB

mishra-ankit / modi-speeches

OvidijusParsiunas / deep-chat

tensorflow / lingvo

Improve this page

Add this topic to your repo