mcrblg-header-image

search

Online Kaldi Decoding

Posted by in Speech Recognition

Thanks to this marvelous framework, a trained model is at disposal with WER of absolute zero percent over the 10 minutes of continuous speech file. The final piece to this puzzle would be implementing a semi-online decoding tool using GStreamer. As always useful links for further inspection GStreamer – Dynamic pipelines Function that save lives!…


HaLseY and TaLoN!

Posted by in Speech Recognition

So the third year has been passed. Halsey was the one who pushes passion into me to don’t ever fucking give up. Through last year I mostly kill the time developing a handful number of hardware projects. Meanwhile, I tried to loosen up and solve some PDE and electromagnetic problems, silly me but it helped…


Lost in the Vast Ocean of Speech Recognition

Posted by in Speech Recognition

Here I am, pursuing once more the old-fashioned machine learning. I’ll keep it short and write down useful links Books Dan Povey – HTK Book Ian Goodfellow – Deep Learning Papers IEEE – Uncertainty Decoding with SPLICE for Noise Robust Speech Recognition YouTube Hannes van Lier – Basic Introduction to Speech Recognition (HMM & Neural…


close
Recent Comments
    menu