🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
-
Updated
May 29, 2024 - Python
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
🧠 Leon is your open-source personal assistant.
Fully Functional Voice Based Natural Language UI
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
Swift native on-device speech recognition with Whisper for Apple Silicon
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!
End-to-End Speech Processing Toolkit
这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。
💬📝 A small dictation app using OpenAI's Whisper speech recognition model.
🗣 Discord voice-chat speech recognition
Clone of speakworldlanguages app to be served to users who can read Kishi language a.k.a. Turkish [Türkçe]
Clone of the speakworldlanguages app for users from Hitoland (Hitoland=Japan)
Progressive Web Application that teaches you world languages. Development is in progress.
Amica is an open source interface for interactive communication with 3D characters with voice synthesis and speech recognition.
Smart Calculator enabled with Speech Recognition System.
Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."