KeywordsSpotting-EfficientNet-A0

Keyword spotting in continuous speech using convolutional neural network

This is a PyTorch implementation of some popular CNN models architecture like Deep Residual Models, Convolutional Neural Networks for Keyword Spotting, and our proposed architecture based on EfficientNet. all models are trained on our new Persian Keyword Spotting Dataset that you can download from Football Keywords Dataset. For more details, please check out our paper Keyword spotting in continuous speech using convolutional neural network / DOI.

This repository is based on Honk-Repository. Honk models can be used to identify simple commands (e.g., "stop" and "go") that trained on Speech Commands Dataset. but our work has some improvements and advantages as below:

We used the modified state of the art image classification architecture, efficientNet, as a based model.
Improve performance in "continuous speech" mode by our Proposed continuous speech synthesis method.
Improve robustness against noises in real samples by using various noises like bubble, stadium, ... .
Better generalization by using SpecAugment.
Using our new Persian Keywords Spotting Dataset that helped us to use this project in real scenarios and projects.

Demo Application

Use the instructions below to run the demo application (shown in the above video) yourself! Currently, PyTorch has official support for only Linux and OS X. Thus, Windows users will not be able to run this demo easily.

To deploy the demo, run the following commands:

change directory to KSM Repository.
If you do not have PyTorch, please see the website.
Install Python dependencies: pip install -r requirements.txt
Start the PyTorch server: python .
Run the demo: python -m utils.speech_demo_tk

If you need to adjust options, like turning off CUDA or change trained model file or ... , please edit config.json.

Pre trained Models

As soon as possible we release KSM trained-models in our repository. there are several pre-trained models for PyTorch.

Contact Us

Feel free to contact us for any further information via below channels.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
LICENSE (castorini, honk)		LICENSE (castorini, honk)
README.md		README.md
__main__.py		__main__.py
config.json		config.json
measure_power.py		measure_power.py
requirements.txt		requirements.txt
server.py		server.py
service.py		service.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

utils

utils

.gitignore

.gitignore

LICENSE

LICENSE

LICENSE (castorini, honk)

LICENSE (castorini, honk)

README.md

README.md

main.py

main.py

config.json

config.json

measure_power.py

measure_power.py

requirements.txt

requirements.txt

server.py

server.py

service.py

service.py

Repository files navigation

KeywordsSpotting-EfficientNet-A0

Demo Application

Pre trained Models

Contact Us

Amirmohhammad Rostami:

Ali Karimi

Mohammad Ali Akhaee

About

Releases

Packages

Languages

License

AmirmohammadRostami/KeywordsSpotting-EfficientNet-A0

Folders and files

Latest commit

History

Repository files navigation

KeywordsSpotting-EfficientNet-A0

Demo Application

Pre trained Models

Contact Us

Amirmohhammad Rostami:

Ali Karimi

Mohammad Ali Akhaee

About

Topics

Resources

License

Stars

Watchers

Forks

Languages