gologo13の日記

speech

memo speech paper research

IEEE Xplore Abstract - Automatic lecture transcription by exploiting presentation slide information for language model adap...

講義音声認識
講義全体のトピックを適応させたPLSAモデル*1（グローバルに適応）
- satomacoto: PythonでPLSAを実装してみる
各スライドの内容語に適応させたキャッシュモデル（ローカルに適応）
PLSAとキャッシュモデルの線形補完した言語モデルを用いると単語正解精度が向上
この方法はリアルタイムで認識OK

IEEE Xplore Abstract - Language modeling and transcription of the TED corpus lectures

IEEE Xplore Abstract - Statistical Transformation of Language and Pronunciation Models for Spontaneous Speech Recognition

IEEE Xplore Abstract - Language model and speaking rate adaptation for spontaneous presentation speech recognition

*1:トピックモデルで思い出したけど，LDAとかそこらへんも勉強したい