Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

grammar in Kaldi #1534

Open
YangangCao opened this issue Mar 7, 2024 · 2 comments
Open

grammar in Kaldi #1534

YangangCao opened this issue Mar 7, 2024 · 2 comments

Comments

@YangangCao
Copy link

YangangCao commented Mar 7, 2024

Hi, dear author, setting grammar in Vosk is very useful, so I copy UpdateGrammarFst() to Kaldi and test it with open source chain model(http://kaldi-asr.org/models/m13), but the result is bad( I set faster_decode_opts_.beam = 1000, I change other decoders and models, also bad), the background noise was be recognized as words, and some extra words. For example:

The speaker said "magnets can be found on a can opener".
start end
WORDS: 0.02 0.03
WORDS: 0.12 0.24 can
WORDS: 0.24 0.52
WORDS: 1.4 2.06 magnets
WORDS: 2.06 2.33 can
WORDS: 2.33 2.45 be
WORDS: 2.45 2.5 a
WORDS: 2.5 2.93 found
WORDS: 3 3.19 on
WORDS: 3.19 3.26 a
WORDS: 3.26 3.64 can
WORDS: 3.64 4.2 opener

But I can get correct result in Vosk. Any solution to make model no such sensitive?

Or maybe setting grammar has existed in Kaldi, can you please give me some tips, Thanks!

@nshmyrev
Copy link
Collaborator

nshmyrev commented Mar 7, 2024

Feels like you have wrong self-loop-scale and probably acoustic weight.

@YangangCao
Copy link
Author

I thnk I find the solution, because I don't use lookahead. #1509

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

2 participants