#

speech-recognition

Here are 4,624 public repositories matching this topic...

transformers

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Updated May 29, 2024
Python

openvinotoolkit / openvino

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference

nlp natural-language-processing ai computer-vision deep-learning transformers inference speech-recognition yolo recommendation-system performance-boost good-first-issue openvino diffusion-models stable-diffusion generative-ai llm-inference optimize-ai deploy-ai

Updated May 29, 2024
C++

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Updated May 29, 2024
Python

leon

leon-ai / leon

🧠 Leon is your open-source personal assistant.

Updated May 29, 2024
Python

inworld-ai / inworld-web-sdk

Web SDK for Inworld.ai. Integrate AI characters into your browser.

ai character tts speech-recognition npc asr

Updated May 29, 2024
TypeScript

Jarvis

thevickypedia / Jarvis

Fully Functional Voice Based Natural Language UI

Updated May 29, 2024
Python

alibaba-damo-academy / FunClip

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

speech-recognition speech-to-text gradio video-clip subtitles-generator video-subtitles llm gradio-python-llm

Updated May 29, 2024
Python

AleferReinert / nlw-expert-notes

speech-recognition rocketseat nlw-expert

Updated May 29, 2024
TypeScript

argmaxinc / WhisperKit

Swift native on-device speech recognition with Whisper for Apple Silicon

macos swift ios watchos transformers inference speech-recognition pretrained-models whisper visionos

Updated May 29, 2024
Swift

DmitryRyumin / ICASSP-2023-24-Papers

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Updated May 29, 2024
Python

espnet / espnet

End-to-End Speech Processing Toolkit

deep-learning chainer end-to-end machine-translation pytorch speech-synthesis speech-recognition kaldi voice-conversion speaker-diarization speech-separation speech-enhancement spoken-language-understanding speech-translation singing-voice-synthesis

Updated May 29, 2024
Python

Chenyme / Chenyme-AAVT

这是一个全自动（音频）视频翻译项目。利用Whisper识别声音，AI大模型翻译字幕，最后合并字幕视频，生成翻译后的视频。

speech-recognition whisper video-translation gpt-4 faster-whisper gpt-4o

Updated May 29, 2024
Python

savbell / whisper-writer

💬📝 A small dictation app using OpenAI's Whisper speech recognition model.

speech-recognition openai speech-to-text dictation whisper typing-assistant openai-api openai-whisper faster-whisper

Updated May 28, 2024
Python

jaoafa / ChatWatcher

🗣 Discord voice-chat speech recognition

discord-bot voice-recognition speech-recognition discord-voice

Updated May 28, 2024
Java

piaseckijulian / Sentinel

🚀AI Voice Chatbot

ai sentinel speech-recognition

Updated May 28, 2024
Python

speakworldlanguages / birdildahakonus

Clone of speakworldlanguages app to be served to users who can read Kishi language a.k.a. Turkish [Türkçe]

progressive-web-app speech-recognition javascript-game voice-control educational-software

Updated May 28, 2024
JavaScript

speakworldlanguages / hanaserutoiiyone

Clone of the speakworldlanguages app for users from Hitoland (Hitoland=Japan)

progressive-web-app speech-recognition javascript-game voice-control educational-software

Updated May 28, 2024
JavaScript

speakworldlanguages.github.io

speakworldlanguages / speakworldlanguages.github.io

Progressive Web Application that teaches you world languages. Development is in progress.

progressive-web-app speech-recognition javascript-game voice-control educational-software

Updated May 28, 2024
JavaScript

amica

semperai / amica

Amica is an open source interface for interactive communication with 3D characters with voice synthesis and speech recognition.

ai computer-vision tts speech-recognition assistant-chat-bots llm

Updated May 29, 2024
TypeScript

Mahak008 / Smart-Calculator

Smart Calculator enabled with Speech Recognition System.

python pygame speech-recognition tkinter

Updated May 28, 2024
Python

Improve this page

Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."