People
音声処理の開発者および研究者。2013年に名古屋工業大学大学院博士前期課程修了。チームラボ株式会社を経て、2018年2月にLINE株式会社(現、LINEヤフー株式会社)に入社(現職)。2018年9月から2019年7月までNAVER Corp. Clova Voiceチームにて音声研究を行う。音声合成の研究開発に従事。WaveNetやTacotronに代表される音声合成に関するオープンソースソフトウェアを多数公開。著書「Pythonで学ぶ音声合成 機械学習実践シリーズ」インプレス出版
Publications
-
- その他 (国際)
- Description-based Controllable Text-to-Speech with Cross-Lingual Voice Control
- Ryuichi Yamamoto, Yuma Shirahata, Masaya Kawamura, Kentaro Tachibana
- arXiv.org
- 2024.9.27
-
- カンファレンス (国際)
- Audio-conditioned phonemic and prosodic annotation for building text-to-speech models from unlabeled speech data
- Yuma Shirahata, Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana
- The 25th Annual Conference of the International Speech Communication Association
- 2024.9.4
-
- カンファレンス (国内)
- 感情音声合成のためのアラインメント手法の比較
- 蓮実 拓也, 白旗 悠真, Welly Naptali, 山本 龍一, Eunwoo Song (NAVER Cloud), 橘 健太郎, Jae-Min Kim (NAVER Cloud)
- 日本音響学会 2024年秋季研究発表会
- 2024.9.4
-
- カンファレンス (国際)
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning
- Masaya Kawamura, Ryuichi Yamamoto, Yuma Shirahata, Takuya Hasumi, Kentaro Tachibana
- The 25th Annual Conference of the International Speech Communication Association
- 2024.9.3
-
- カンファレンス (国際)
- Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment
- Takuto Igarashi (The University of Tokyo), Yuki Saito (The University of Tokyo), Kentaro Seki (The University of Tokyo), Shinnosuke Takamichi (The University of Tokyo), Ryuichi Yamamoto, Kentaro Tachibana, Hiroshi Saruwatari (The University of Tokyo)
- The 25th Annual Conference of the International Speech Communication Association
- 2024.9.1
-
- カンファレンス (国際)
- SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark
- Yuki Saito (The University of Tokyo), Takuto Igarashi (The University of Tokyo), Kentaro Seki (The University of Tokyo), Shinnosuke Takamichi (The University of Tokyo), Ryuichi Yamamoto, Kentaro Tachibana, Hiroshi Saruwatari (The University of Tokyo)
- The 25th Annual Conference of the International Speech Communication Association
- 2024.9.1
-
- カンファレンス (国際)
- Enhancing Multilingual TTS with Voice Conversion Based Data Augmentation and Posterior Embedding
- Hyun-Wook Yoon (NAVER Cloud), Jin-Seob Kim (NAVER Cloud), Ryuichi Yamamoto, Ryo Terashima, Chan-Ho Song (NAVER Cloud), Jae-Min Kim (NAVER Cloud), Eunwoo Song (NAVER Cloud)
- 2024 IEEE International Conference on Acoustics, Speech and Signal Processing
- 2024.4.14
-
- カンファレンス (国際)
- PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions
- Reo Shimizu (Tohoku University), Ryuichi Yamamoto, Masaya Kawamura, Yuma Shirahata, Hironori Doi, Tatsuya Komatsu, Kentaro Tachibana
- 2024 IEEE International Conference on Acoustics, Speech and Signal Processing
- 2024.4.14
-
- カンファレンス (国内)
- 音声品質と音響環境の潜在変数で条件付けたDenoising Trainingによるノイズロバスト音声変換
- 五十嵐 琢斗 (東京大学), 齋藤 佑樹 (東京大学), 関 健太郎 (東京大学), 高道 慎之介 (東京大学), 山本 龍一, 橘 健太郎, 猿渡 洋 (東京大学)
- 電子情報通信学会/日本音響学会 音声研究会
- 2024.2.22
-
- カンファレンス (国内)
- NNSVS: ニューラルネットワークに基づく歌声合成のためのオープンソースソフトウェア
- 山本 龍一 (名古屋大学/LINE), 米山 怜於 (名古屋大学), 戸田 智基 (名古屋大学)
- 日本音響学会 2023年秋季研究発表会
- 2023.9.26
-
- カンファレンス (国内)
- Foley Sound Synthesis with a Class-Conditioned Latent Diffusion Model and FAD-Based Post-filtering
- シャイブラー ロビン, 蓮実 拓也, 藤田 雄介, 小松 達也, 山本 龍一, 橘 健太郎
- 日本音響学会 2023年秋季研究発表会
- 2023.9.26
-
- ワークショップ (国際)
- Foley Sound Synthesis with a Class-conditioned Latent Diffusion Model
- Robin Scheibler, Takuya Hasumi, Yusuke Fujita, Tatsuya Komatsu, Ryuichi Yamamoto, Kentaro Tachibana
- Detection and Classification of Acoustic Scenes and Events
- 2023.9.20
-
- カンファレンス (国際)
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform
- Masaya Kawamura (The University of Tokyo), Yuma Shirahata, Ryuichi Yamamoto, Kentaro Tachibana
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing
- 2023.6.4
-
- カンファレンス (国際)
- NNSVS: A Neural Network-Based Singing Voice Synthesis Toolkit
- Ryuichi Yamamoto (LINE/Nagoya University), Reo Yoneyama (Nagoya University), Tomoki Toda (Nagoya University)
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing
- 2023.6.4
-
- カンファレンス (国際)
- Non-parallel High-Quality Audio Super Resolution with Domain Adaptation and Resampling CycleGANs
- Reo Yoneyama (Nagoya University), Ryuichi Yamamoto, Kentaro Tachibana
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing
- 2023.6.4
-
- カンファレンス (国際)
- Period VITS: Variational Inference With Explicit Pitch Modeling For End-to-End Emotional Speech
- Yuma Shirahata, Ryuichi Yamamoto, Eunwoo Song (NAVER), Ryo Terashima, Jae-Min Kim (NAVER), Kentaro Tachibana
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing
- 2023.6.4
-
- カンファレンス (国際)
- A Unified Accent Estimation Method Based on Multi-Task Learning for Japanese Text-to-Speech
- Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana
- The 23rd Annual Conference of the International Speech Communication Association
- 2022.9.18
-
- カンファレンス (国際)
- Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation
- Ryo Terashima, Ryuichi Yamamoto, Eunwoo Song (NAVER), Yuma Shirahata, Hyun-Wook Yoon (NAVER), Jae-Min Kim (NAVER), Kentaro Tachibana
- The 23rd Annual Conference of the International Speech Communication Association
- 2022.9.18
-
- カンファレンス (国際)
- DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning
- Takaaki Saeki (The University of Tokyo), Kentaro Tachibana, Ryuichi Yamamoto
- The 23rd Annual Conference of the International Speech Communication Association
- 2022.9.18
-
- カンファレンス (国際)
- Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems
- Hyun-Wook Yoon (NAVER), Ohsung Kwon (NAVER), Hoyeon Lee (NAVER), Ryuichi Yamamoto, Eunwoo Song (NAVER), Jae-Min Kim (NAVER), Min-Jae Hwang (NAVER)
- The 23rd Annual Conference of the International Speech Communication Association
- 2022.9.18