News

Abstract: End-to-end automatic speech recognition (ASR) systems have gained popularity given ... we propose the factorized attention-based encoder-decoder (Factorized AED) model whose decoder takes as ...
The project focuses on building an automatic speech recognition (ASR) system that ... is to implement a Transformer-based model for speech-to-text transcription, following an encoder-decoder ...
BiGRU encoder + Attention decoder, based on "Listen, Attend and Spell" 1. The acoustic features are 80-dimensional filter banks. They are stacked every 3 consecutive frames, so the time resolution is ...
To fill this gap, we propose two attention-mechanism-based encoder–decoder models that incorporate multisource information: one is MAEDDI, which can predict DDIs, and the other is MAEDDIE, which can ...
Overall, the work’s decomposition of hidden states into input- and time-independent components provides novel and valuable insights into the inner workings of attention-based encoder-decoder networks.