Speech Processing
-
- CONFERENCE (INTERNATIONAL)
- Insertion-Based Modeling for End-to-End Automatic Speech Recognition
- Yuya Fujita, Shinji Watanabe (Johns Hopkins University), Motoi Omachi, Xuankai Chang (Johns Hopkins University)
- INTERSPEECH 2020
- October 25, 2020
-
- CONFERENCE (INTERNATIONAL)
- Mentoring-Reverse Mentoring for Unsupervised Multi-channel Speech Source Separation
- Yu Nakagome (Waseda University), Masahito Togami, Tetsuji Ogawa (Waseda University), Tetsunori Kobayashi (Waseda University)
- The 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020)
- October 25, 2020
-
- CONFERENCE (INTERNATIONAL)
- Neural Text-to-Speech with a Modeling-by-Generation Excitation Vocoder
- Eunwoo Song (NAVER), Min-Jae Hwang (Search Solutions Inc.), Ryuichi Yamamoto, Jin-Seob Kim (NAVER), Ohsung Kwon (NAVER), Jae-Min Kim (NAVER)
- The 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020)
- October 25, 2020
-
- CONFERENCE (INTERNATIONAL)
- Sparseness-Aware DOA Estimation with Majorization Minimization
- Masahito Togami, Robin Scheibler
- The 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020)
- October 25, 2020
-
- CONFERENCE (DOMESTIC)
- 単語の表記と素性を同時出力するend-to-end音声認識
- 大町 基, 藤田 悠哉, 渡部 晋治 (Johns Hopkins University), Xuankai Chang (Johns Hopkins University)
- 日本音響学会2020年秋季研究発表会 (音響学会)
- September 11, 2020
-
- CONFERENCE (DOMESTIC)
- A Generalized Minimal Distortion Principle to Solve the Scale Ambiguity in Blind Source Separation
- シャイブラー ロビン
- 日本音響学会 2020年秋季研究発表会 (ASJ 2020 autumn)
- September 09, 2020
-
- CONFERENCE (DOMESTIC)
- Mentoring-Reverse Mentoring:多チャンネル音源分離における教師なし学習のための知識伝搬フレームワーク
- 中込 優 (早稲田大学), 戸上 真人, 小川 哲司 (早稲田大学), 小林 哲則 (早稲田大学)
- 日本音響学会 2020年秋季研究発表会 (ASJ 2020 autumn)
- September 09, 2020
-
- CONFERENCE (DOMESTIC)
- 挿入操作に基づく End-to-End 音声認識
- 藤田 悠哉, 渡部 晋治 (Johns Hopkins Univ.), 大町 基, Xuankai Chang (Johns Hopkins Univ.)
- 日本音響学会2020年秋季研究発表会 (音響学会)
- September 09, 2020
-
- OTHERS (INTERNATIONAL)
- Insertion-Based Modeling for End-to-End Automatic Speech Recognition
- Yuya Fujita, Shinji Watanabe (Johns Hopkins University), Motoi Omachi, Xuankai Chang (Johns Hopkins University)
- arXiv.org
- May 27, 2020
-
- CONFERENCE (INTERNATIONAL)
- Attention-based ASR with Lightweight and Dynamic Convolutions
- Yuya Fujita, Aswin Shanmugam Subramanian (Johns Hopkins University), Motoi Omachi, Shinji Watanabe (Johns Hopkins University)
- 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020)
- May 08, 2020
-
- CONFERENCE (INTERNATIONAL)
- Consistency-Aware Multi-Channel Speech Enhancement Using Deep Neural Networks
- Yoshiki Masuyama (Waseda University), Masahito Togami, Tatsuya Komatsu
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020
-
- CONFERENCE (INTERNATIONAL)
- Deep Speech Extraction with Time-Varying Spatial Filtering Guided By Desired Direction Attractor
- Yu Nakagome (Waseda University), Masahito Togami, Tetsuji Ogawa (Waseda University), Tetsunori Kobayashi (Waseda University)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020
-
- CONFERENCE (INTERNATIONAL)
- ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit
- Tomoki Hayashi (Nagoya University), Ryuichi Yamamoto, Katsuki Inoue (Okayama University), Takenori Yoshimura (Nagoya University), Shinji Watanabe (Johns Hopkins University), Tomoki Toda (Nagoya University), Kazuya Takeda (Nagoya University), Yu Zhang (Google AI), Xu Tan (Microsoft Research)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020
-
- CONFERENCE (INTERNATIONAL)
- Fast Start-Up Algorithm for Adaptive Noise Cancellers with Novel SNR Estimation and Stepsize Control
- Akihiko Sugiyama
- International Conference on Acoustics, Speech, and Signal Processing 2020 (ICASSP2020)
- May 04, 2020
-
- CONFERENCE (INTERNATIONAL)
- Improving LPCNet-Based Text-to-Speech with Linear Prediction-Structured Mixture Density Network
- Min-Jae Hwang (Search Solutions Inc), Eunwoo Song (NAVER), Ryuichi Yamamoto, Frank Soong (Microsoft Research Asia), Hong-Goo Kang (Yonsei University)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020
-
- CONFERENCE (INTERNATIONAL)
- Joint Training of Deep Neural Networks for Multi-Channel Dereverberation and Speech Source Separation
- Masahito Togami
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020
-
- CONFERENCE (INTERNATIONAL)
- Multi-Channel Speech Source Separation and Dereverberation With Sequential Integration of Determined and Underdetermined Models
- Masahito Togami
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020
-
- CONFERENCE (INTERNATIONAL)
- Parallel WaveGAN: A Fast Waveform Generation Model Based on Generative Adversarial Networks with Multi-Resolution Spectrogram
- Ryuichi Yamamoto, Eunwoo Song (NAVER), Jae-Min Kim (NAVER)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020
-
- CONFERENCE (INTERNATIONAL)
- Scene-Dependent Acoustic Event Detection with Scene Conditioning and Fake-Scene-Conditioned Loss
- Tatsuya Komatsu, Keisuke Imoto (Ritsumeikan University), Masahito Togami
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020
-
- CONFERENCE (INTERNATIONAL)
- Semi-Supervised Speaker Adaptation for End-to-End Speech Synthesis with Pretrained Models
- Katsuki Inoue (Okayama University), Sunao Hara (Okayama University), Masanobu Abe (Okayama University), Tomoki Hayashi (Nagoya University), Ryuichi Yamamoto, Shinji Watanabe (Johns Hopkins University)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020