-
- カンファレンス (国際)
- A Unified Accent Estimation Method Based on Multi-Task Learning for Japanese Text-to-Speech
- Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- 2022.9.18
-
- カンファレンス (国際)
- Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History
- Yuto Nishimura (The University of Tokyo), Yuki Saito (The University of Tokyo), Shinnosuke Takamichi (The University of Tokyo), Kentaro Tachibana, Hiroshi Saruwatari (The University of Tokyo)
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- 2022.9.18
-
- カンファレンス (国際)
- Better Intermediates Improve CTC Inference
- Tatsuya Komatsu, Yusuke Fujita, Jaesong Lee (NAVER), Lukas Lee (NAVER), Shinji Watanabe (Carnegie Mellon University), Yusuke Kida
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- 2022.9.18
-
- カンファレンス (国際)
- Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation
- Ryo Terashima, Ryuichi Yamamoto, Eunwoo Song (NAVER), Yuma Shirahata, Hyun-Wook Yoon (NAVER), Jae-Min Kim (NAVER), Kentaro Tachibana
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- 2022.9.18
-
- カンファレンス (国際)
- DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning
- Takaaki Saeki (The University of Tokyo), Kentaro Tachibana, Ryuichi Yamamoto
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- 2022.9.18
-
- カンファレンス (国際)
- ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding
- Yen-Ju Lu (Academia Sinica), Xuankai Chang (CMU), Chenda Li (SJTU), Wangyou Zhang (SJTU), Samuele Cornell (Universit`a Politecnica delle Marche), Zhaoheng Ni (Meta AI), Yoshiki Masuyama (CMU/TMU), Brian Yan (CMU), Robin Scheibler, Zhong-Qiu Wang (CMU), Yu Tsao (Academica Sinica), Yanmin Qian (SJTU), Shinji Watanabe (CMU)
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- 2022.9.18
-
- カンファレンス (国際)
- Independence-based Joint Dereverberation and Separation with Neural Source Model
- Kohei Saijo (Waseda University), Robin Scheilbler
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- 2022.9.18
-
- カンファレンス (国際)
- InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
- Yu Nakagome, Tatsuya Komatsu, Yusuke Fujita, Shuta Ichimura, Yusuke Kida
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- 2022.9.18
-
- カンファレンス (国際)
- Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems
- Hyun-Wook Yoon (NAVER), Ohsung Kwon (NAVER), Hoyeon Lee (NAVER), Ryuichi Yamamoto, Eunwoo Song (NAVER), Jae-Min Kim (NAVER), Min-Jae Hwang (NAVER)
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- 2022.9.18
-
- ワークショップ (国際)
- Leveraging Context-dependent Click Model for Off-Policy Evaluation of Ranking Policies
- Haruka Kiyohara (Tokyo Institute of Technology), Nobuyuki Shimizu, Yasuo Yamamoto
- The 16th ACM Recommender Systems Conference Consequences Workshop (RecSys Consequences Workshop)
- 2022.9.18
-
- カンファレンス (国際)
- Minimum Latency Training of Sequence Transducers for Streaming End-to-End Speech Recognition
- Yusuke Shinohara, Shinji Watanabe (Carnegie Mellon University)
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- 2022.9.18
-
- カンファレンス (国際)
- Solving Diversity-Aware Maximum Inner Product Search Efficiently and Effectively
- Kohei Hirata (Osaka univ.), Daichi Amagata (Osaka univ.), Takahiro Hara (Osaka univ.), Sumio Fujita
- 16th ACM Conference on Recommender Systems (RecSys 2022)
- 2022.9.18
-
- カンファレンス (国際)
- Spatial Loss for Unsupervised Multi-channel Source Separation
- Kohei Saijo (Waseda University), Robin Scheilbler
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- 2022.9.18
-
- カンファレンス (国際)
- STUDIES: Corpus of Japanese Empathetic Dialogue Speech Towards Friendly Voice Agent
- Yuki Saito (The University of Tokyo), Yuto Nishimura (The University of Tokyo), Shinnosuke Takamichi (The University of Tokyo), Kentaro Tachibana, Hiroshi Saruwatari (The University of Tokyo)
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- 2022.9.18
-
- カンファレンス (国際)
- TTS-by-TTS 2: Data-selective Augmentation for Neural Speech Synthesis Using Ranking Support Vector Machine with Variational Autoencoder
- Eunwoo Song (NAVER), Ryuichi Yamamoto, Ohsung Kwon (NAVER), Chan-Ho Song, Min-Jae Hwang (NAVER), Suhyeon Oh (NAVER), Hyun-Wook Yoon (NAVER), Jin-Seob Kim (NAVER), Jae-Min Kim (NAVER)
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- 2022.9.18
-
- カンファレンス (国内)
- CTC ベース音声認識モデルにおける中間層ロスと条件付けが与える影響の考察
- 市村 収太, 中込 優, 藤田 雄介, 小松 達也, 木田 祐介
- 日本音響学会 2022年秋季研究発表会 (ASJ 2022 autumn)
- 2022.9.14
-
- カンファレンス (国内)
- End-to-end Automatic Speech Recognition with Independent Vector Analysis Frontend
- シャイブラー ロビン, Zhang Wangyou (Shanghai Jiao Tong University), Chang Xuankai (Shanghai Jiao Tong University), 渡部 晋治 (Carnegie Mellon University), Qian Yanmin (Shanghai Jiao Tong University)
- 日本音響学会 2022年秋季研究発表会 (ASJ 2022 autumn)
- 2022.9.14
-
- カンファレンス (国内)
- 中間層予測にビームサーチを用いた新しい CTC 推論
- 小松 達也, 藤田 雄介, Lee Jaesong (NAVER), Lee Lukas (NAVER), 渡部 晋治 (Carnegie Mellon University), 木田 祐介
- 日本音響学会 2022年秋季研究発表会 (ASJ 2022 autumn)
- 2022.9.14
-
- カンファレンス (国内)
- 中間層予測に対するノイズ付与による CTC 音声認識の頑健性向上
- 中込 優, 小松 達也, 藤田 雄介, 市村 収太, 木田 祐介
- 日本音響学会 2022年秋季研究発表会 (ASJ 2022 autumn)
- 2022.9.14
-
- カンファレンス (国内)
- 中間層予測に音節と表記を用いる日本語音声認識
- 藤田 雄介, 小松 達也, 木田 祐介
- 日本音響学会 2022年秋季研究発表会 (ASJ 2022 autumn)
- 2022.9.14