生成AI
-
- カンファレンス (国際)
- ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit
- Tomoki Hayashi (Nagoya University), Ryuichi Yamamoto, Katsuki Inoue (Okayama University), Takenori Yoshimura (Nagoya University), Shinji Watanabe (Johns Hopkins University), Tomoki Toda (Nagoya University), Kazuya Takeda (Nagoya University), Yu Zhang (Google AI), Xu Tan (Microsoft Research)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- 2020.5.4
-
- カンファレンス (国際)
- Improving LPCNet-Based Text-to-Speech with Linear Prediction-Structured Mixture Density Network
- Min-Jae Hwang (Search Solutions Inc), Eunwoo Song (NAVER), Ryuichi Yamamoto, Frank Soong (Microsoft Research Asia), Hong-Goo Kang (Yonsei University)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- 2020.5.4
-
- カンファレンス (国際)
- Parallel WaveGAN: A Fast Waveform Generation Model Based on Generative Adversarial Networks with Multi-Resolution Spectrogram
- Ryuichi Yamamoto, Eunwoo Song (NAVER), Jae-Min Kim (NAVER)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- 2020.5.4
-
- カンファレンス (国際)
- Semi-Supervised Speaker Adaptation for End-to-End Speech Synthesis with Pretrained Models
- Katsuki Inoue (Okayama University), Sunao Hara (Okayama University), Masanobu Abe (Okayama University), Tomoki Hayashi (Nagoya University), Ryuichi Yamamoto, Shinji Watanabe (Johns Hopkins University)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- 2020.5.4
-
- カンファレンス (国際)
- Probability Density Distillation with Generative Adversarial Networks for High-Quality Parallel Waveform Generation
- Ryuichi Yamamoto, Eunwoo Song (NAVER), Jae-Min Kim (NAVER)
- The 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019)
- 2019.9.15