Speech Processing
-
- CONFERENCE (INTERNATIONAL)
- Streaming End-to-End ASR based on Blockwise Non-Autoregressive Models
- Tianzi Wang (Johns Hopkins University), Yuya Fujita, Xuankai Chang (Carnegie Mellon Univ.), Shinji Watanabe (Carnegie Mellon University)
- INTERSPEECH 2021
- September 02, 2021
-
- CONFERENCE (INTERNATIONAL)
- Toward Streaming ASR with Non-Autoregressive Insertion-based Model
- Yuya Fujita, Tianzi Wang (Johns Hopkins University), Shinji Watanabe (Carnegie Mellon University), Motoi Omach
- INTERSPEECH 2021
- September 02, 2021
-
- CONFERENCE (INTERNATIONAL)
- Acoustic Event Detection with Classifier Chains
- Tatsuya Komatsu, Shinji Watanabe (Carnegie Mellon University), Koichi Miyazaki (Nagoya University), Tomoki Hayashi (Human Dataware Lab.)
- The 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021)
- August 30, 2021
-
- CONFERENCE (INTERNATIONAL)
- Efficient and Stable Adversarial Learning Using Unpaired Data for Unsupervised Multichannel Speech Separation
- Yu Nakagome (Waseda University), Masahito Togami, Tetsuji Ogawa (Waseda University), Tetsunori Kobayashi (Waseda University)
- The 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021)
- August 30, 2021
-
- CONFERENCE (INTERNATIONAL)
- High-fidelity Parallel WaveGAN with Multi-band Harmonic-plus-Noise Model
- Min-Jae Hwang (Search Solutions Inc), Ryuichi Yamamoto, Eunwoo Song (NAVER), Jae-Min Kim (NAVER)
- The 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021)
- August 30, 2021
-
- CONFERENCE (INTERNATIONAL)
- Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis
- Kosuke Futamata, Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana
- The 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021)
- August 30, 2021
-
- CONFERENCE (INTERNATIONAL)
- Relaxing the Conditional Independence Assumption of CTC-based ASR by Conditioning on Intermediate Predictions
- Jumon Nozaki (Kyoto University), Tatsuya Komatsu
- The 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021)
- August 30, 2021
-
- CONFERENCE (INTERNATIONAL)
- Sound Source Localization with Majorization Minimization
- Masahito Togami, Robin Scheilbler
- The 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021)
- August 30, 2021
-
- CONFERENCE (INTERNATIONAL)
- Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021
- Takashi Maekaku, Xuankai Chang (Carnegie Mellon University), Yuya Fujita, Li, Shinji Watanabe (Carnegie Mellon University), Alexander Rudnicky (Carnegie Mellon University)
- INTERSPEECH 2021
- August 30, 2021
-
- CONFERENCE (INTERNATIONAL)
- Multi-Source Domain Adaptation with Sinkhorn Barycenter
- Tatsuya Komatsu, Tomoko Matsui (The Institute of Statistical Mathematics), Junbin Gao (The University of Sydney)
- 29th European Signal Processing Conference (EUSIPCO 2021)
- August 23, 2021
-
- CONFERENCE (INTERNATIONAL)
- Multichannel Separation and Classification of Sound Events
- Robin Scheilbler, Tatsuya Komatsu, Masahito Togami
- 29th European Signal Processing Conference (EUSIPCO 2021)
- August 23, 2021
-
- OTHERS (INTERNATIONAL)
- Streaming End-to-End ASR based on Blockwise Non-Autoregressive Models
- Tianzi Wang (Johns Hopkins Univ.), Yuya Fujita, Xuankai Chang (Carnegie Mellon Univ.), Shinji Watanabe (Carnegie Mellon Univ.)
- arXiv.org
- July 20, 2021
-
- OTHERS (INTERNATIONAL)
- Toward Streaming ASR with Non-Autoregressive Insertion-based Model
- Yuya Fujita, Tianzi Wang (Johns Hopkins Univ.), Shinji Watanabe (Carnegie Mellon Univ.), Motoi Omachi
- arXiv.org
- July 16, 2021
-
- WORKSHOP (INTERNATIONAL)
- Improved Parallel WaveGAN with perceptually weighted spectrogram loss
- Eunwoo Song (NAVER), Ryuichi Yamamoto, Min-Jae Hwang (NAVER), Jin-Seob Kim (NAVER), Ohsung Kwon (NAVER), Jae-Min Kim (NAVER)
- 2021 IEEE Spoken Language Technology Workshop (SLT) (SLT 2021)
- June 19, 2021
-
- CONFERENCE (INTERNATIONAL)
- Disentangled Speaker and Language Representations Using Mutual Information Minimization and Domain Adaptation for Cross-Lingual TTS
- Detai Xin (The University of Tokyo), Tatsuya Komatsu, Shinnosuke Takamichi (The University of Tokyo), Hiroshi Saruwatari (The University of Tokyo)
- 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021)
- June 06, 2021
-
- CONFERENCE (INTERNATIONAL)
- End To End Learning For Convolutive Multi-Channel Wiener Filtering
- Masahito Togami
- 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021)
- June 06, 2021
-
- CONFERENCE (INTERNATIONAL)
- End-to-end ASR to jointly predict transcriptions and linguistic annotations
- Motoi Omachi, Yuya Fujita, Shinji Watanabe (Johns Hopkins University), Matthew Wiesner (Johns Hopkins University)
- The 2021 North American Chapter of the Association for Computational Linguistics : Human Language Technologies (NAACL-HLT2021)
- June 06, 2021
-
- CONFERENCE (INTERNATIONAL)
- Joint Dereverberation and Separation With Iterative Source Steering
- Taishi Nakashima (Tokyo Metropolitan University), Robin Scheilbler, Masahito Togami, Nobutaka Ono (Tokyo Metropolitan University)
- 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021)
- June 06, 2021
-
- CONFERENCE (INTERNATIONAL)
- Parallel Waveform Synthesis Based on Generative Adversarial Networks with Voicing-Aware Conditional Discriminators
- Ryuichi Yamamoto, Eunwoo Song (NAVER), Min-Jae Hwang (Search Solutions Inc), Jae-Min Kim (NAVER)
- 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021)
- June 06, 2021
-
- CONFERENCE (INTERNATIONAL)
- Refinement of Direction of Arrival Estimators by Majorization-Minimization Optimization on the Array Manifold
- Robin Scheilbler, Masahito Togami
- 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021)
- June 06, 2021