-
- CONFERENCE (INTERNATIONAL)
- Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation
- Ryo Terashima, Ryuichi Yamamoto, Eunwoo Song (NAVER), Yuma Shirahata, Hyun-Wook Yoon (NAVER), Jae-Min Kim (NAVER), Kentaro Tachibana
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- September 18, 2022
-
- CONFERENCE (INTERNATIONAL)
- DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning
- Takaaki Saeki (The University of Tokyo), Kentaro Tachibana, Ryuichi Yamamoto
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- September 18, 2022
-
- CONFERENCE (INTERNATIONAL)
- ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding
- Yen-Ju Lu (Academia Sinica), Xuankai Chang (CMU), Chenda Li (SJTU), Wangyou Zhang (SJTU), Samuele Cornell (Universit`a Politecnica delle Marche), Zhaoheng Ni (Meta AI), Yoshiki Masuyama (CMU/TMU), Brian Yan (CMU), Robin Scheibler, Zhong-Qiu Wang (CMU), Yu Tsao (Academica Sinica), Yanmin Qian (SJTU), Shinji Watanabe (CMU)
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- September 18, 2022
-
- CONFERENCE (INTERNATIONAL)
- Independence-based Joint Dereverberation and Separation with Neural Source Model
- Kohei Saijo (Waseda University), Robin Scheilbler
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- September 18, 2022
-
- CONFERENCE (INTERNATIONAL)
- InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
- Yu Nakagome, Tatsuya Komatsu, Yusuke Fujita, Shuta Ichimura, Yusuke Kida
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- September 18, 2022
-
- CONFERENCE (INTERNATIONAL)
- Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems
- Hyun-Wook Yoon (NAVER), Ohsung Kwon (NAVER), Hoyeon Lee (NAVER), Ryuichi Yamamoto, Eunwoo Song (NAVER), Jae-Min Kim (NAVER), Min-Jae Hwang (NAVER)
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- September 18, 2022
-
- WORKSHOP (INTERNATIONAL)
- Leveraging Context-dependent Click Model for Off-Policy Evaluation of Ranking Policies
- Haruka Kiyohara (Tokyo Institute of Technology), Nobuyuki Shimizu, Yasuo Yamamoto
- The 16th ACM Recommender Systems Conference Consequences Workshop (RecSys Consequences Workshop)
- September 18, 2022
-
- CONFERENCE (INTERNATIONAL)
- Minimum Latency Training of Sequence Transducers for Streaming End-to-End Speech Recognition
- Yusuke Shinohara, Shinji Watanabe (Carnegie Mellon University)
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- September 18, 2022
-
- CONFERENCE (INTERNATIONAL)
- Solving Diversity-Aware Maximum Inner Product Search Efficiently and Effectively
- Kohei Hirata (Osaka univ.), Daichi Amagata (Osaka univ.), Takahiro Hara (Osaka univ.), Sumio Fujita
- 16th ACM Conference on Recommender Systems (RecSys 2022)
- September 18, 2022
-
- CONFERENCE (INTERNATIONAL)
- Spatial Loss for Unsupervised Multi-channel Source Separation
- Kohei Saijo (Waseda University), Robin Scheilbler
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- September 18, 2022
-
- CONFERENCE (INTERNATIONAL)
- STUDIES: Corpus of Japanese Empathetic Dialogue Speech Towards Friendly Voice Agent
- Yuki Saito (The University of Tokyo), Yuto Nishimura (The University of Tokyo), Shinnosuke Takamichi (The University of Tokyo), Kentaro Tachibana, Hiroshi Saruwatari (The University of Tokyo)
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- September 18, 2022
-
- CONFERENCE (INTERNATIONAL)
- TTS-by-TTS 2: Data-selective Augmentation for Neural Speech Synthesis Using Ranking Support Vector Machine with Variational Autoencoder
- Eunwoo Song (NAVER), Ryuichi Yamamoto, Ohsung Kwon (NAVER), Chan-Ho Song, Min-Jae Hwang (NAVER), Suhyeon Oh (NAVER), Hyun-Wook Yoon (NAVER), Jin-Seob Kim (NAVER), Jae-Min Kim (NAVER)
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- September 18, 2022
-
- CONFERENCE (DOMESTIC)
- CTC ベース音声認識モデルにおける中間層ロスと条件付けが与える影響の考察
- 市村 収太, 中込 優, 藤田 雄介, 小松 達也, 木田 祐介
- 日本音響学会 2022年秋季研究発表会 (ASJ 2022 autumn)
- September 14, 2022
-
- CONFERENCE (DOMESTIC)
- End-to-end Automatic Speech Recognition with Independent Vector Analysis Frontend
- シャイブラー ロビン, Zhang Wangyou (Shanghai Jiao Tong University), Chang Xuankai (Shanghai Jiao Tong University), 渡部 晋治 (Carnegie Mellon University), Qian Yanmin (Shanghai Jiao Tong University)
- 日本音響学会 2022年秋季研究発表会 (ASJ 2022 autumn)
- September 14, 2022
-
- CONFERENCE (DOMESTIC)
- 中間層予測にビームサーチを用いた新しい CTC 推論
- 小松 達也, 藤田 雄介, Lee Jaesong (NAVER), Lee Lukas (NAVER), 渡部 晋治 (Carnegie Mellon University), 木田 祐介
- 日本音響学会 2022年秋季研究発表会 (ASJ 2022 autumn)
- September 14, 2022
-
- CONFERENCE (DOMESTIC)
- 中間層予測に対するノイズ付与による CTC 音声認識の頑健性向上
- 中込 優, 小松 達也, 藤田 雄介, 市村 収太, 木田 祐介
- 日本音響学会 2022年秋季研究発表会 (ASJ 2022 autumn)
- September 14, 2022
-
- CONFERENCE (DOMESTIC)
- 中間層予測に音節と表記を用いる日本語音声認識
- 藤田 雄介, 小松 達也, 木田 祐介
- 日本音響学会 2022年秋季研究発表会 (ASJ 2022 autumn)
- September 14, 2022
-
- CONFERENCE (DOMESTIC)
- 微分可能な信号処理に基づく音声合成器を用いた DNN 音声パラメータ推定の検討
- 松永 裕太 (LINE/東京大学), 寺島 涼, 橘 健太郎
- 日本音響学会 2022年秋季研究発表会 (ASJ 2022 autumn)
- September 14, 2022
-
- CONFERENCE (DOMESTIC)
- 音響シーン認識のためのサブアレイ間相関特徴量の検討
- 河村 隆生 (東京都立大学), 木下 裕磨 (東京都立大学/東海大学), 小野 順貴 (東京都立大学), シャイブラー ロビン
- 日本音響学会 2022年秋季研究発表会 (ASJ 2022 autumn)
- September 14, 2022
-
- CONFERENCE (DOMESTIC)
- 多様性を考慮した最大内積探索
- 平田 皓平 (大阪大学), 天方 大地 (大阪大学), 原 隆浩 (大阪大学), 藤田 澄男
- 第21回情報科学技術フォーラム (FIT2022)
- September 13, 2022