News
However, the challenge is the data limitation and modality gap between text and image. In this paper, we propose a novel end-to-end model, namely Modal contrastive learning based End-to-end Text Image ...
Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow ... The final architecture is shown in the following figure. (Some images are ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results