People
Ryuichi Yamamoto Researcher
A software engineer/researcher working on speech synthesis. His research interests include statistical speech synthesis, voice conversion, singing voice synthesis, and machine learning. Before joining LY Corporation (formerly LINE Corporation), he worked in music signal processing, music information retrieval, and computer vision.
Publications
-
- OTHERS (INTERNATIONAL)
- Description-based Controllable Text-to-Speech with Cross-Lingual Voice Control
- Ryuichi Yamamoto, Yuma Shirahata, Masaya Kawamura, Kentaro Tachibana
- arXiv.org
- September 27, 2024
-
- CONFERENCE (INTERNATIONAL)
- Audio-conditioned phonemic and prosodic annotation for building text-to-speech models from unlabeled speech data
- Yuma Shirahata, Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana
- The 25th Annual Conference of the International Speech Communication Association
- September 04, 2024
-
- CONFERENCE (DOMESTIC)
- 感情音声合成のためのアラインメント手法の比較
- 蓮実 拓也, 白旗 悠真, Welly Naptali, 山本 龍一, Eunwoo Song (NAVER Cloud), 橘 健太郎, Jae-Min Kim (NAVER Cloud)
- 日本音響学会 2024年秋季研究発表会
- September 04, 2024
-
- CONFERENCE (INTERNATIONAL)
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning
- Masaya Kawamura, Ryuichi Yamamoto, Yuma Shirahata, Takuya Hasumi, Kentaro Tachibana
- The 25th Annual Conference of the International Speech Communication Association
- September 03, 2024
-
- CONFERENCE (INTERNATIONAL)
- Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment
- Takuto Igarashi (The University of Tokyo), Yuki Saito (The University of Tokyo), Kentaro Seki (The University of Tokyo), Shinnosuke Takamichi (The University of Tokyo), Ryuichi Yamamoto, Kentaro Tachibana, Hiroshi Saruwatari (The University of Tokyo)
- The 25th Annual Conference of the International Speech Communication Association
- September 01, 2024
-
- CONFERENCE (INTERNATIONAL)
- SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark
- Yuki Saito (The University of Tokyo), Takuto Igarashi (The University of Tokyo), Kentaro Seki (The University of Tokyo), Shinnosuke Takamichi (The University of Tokyo), Ryuichi Yamamoto, Kentaro Tachibana, Hiroshi Saruwatari (The University of Tokyo)
- The 25th Annual Conference of the International Speech Communication Association
- September 01, 2024
-
- CONFERENCE (INTERNATIONAL)
- Enhancing Multilingual TTS with Voice Conversion Based Data Augmentation and Posterior Embedding
- Hyun-Wook Yoon (NAVER Cloud), Jin-Seob Kim (NAVER Cloud), Ryuichi Yamamoto, Ryo Terashima, Chan-Ho Song (NAVER Cloud), Jae-Min Kim (NAVER Cloud), Eunwoo Song (NAVER Cloud)
- 2024 IEEE International Conference on Acoustics, Speech and Signal Processing
- April 14, 2024
-
- CONFERENCE (INTERNATIONAL)
- PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions
- Reo Shimizu (Tohoku University), Ryuichi Yamamoto, Masaya Kawamura, Yuma Shirahata, Hironori Doi, Tatsuya Komatsu, Kentaro Tachibana
- 2024 IEEE International Conference on Acoustics, Speech and Signal Processing
- April 14, 2024
-
- CONFERENCE (DOMESTIC)
- 音声品質と音響環境の潜在変数で条件付けたDenoising Trainingによるノイズロバスト音声変換
- 五十嵐 琢斗 (東京大学), 齋藤 佑樹 (東京大学), 関 健太郎 (東京大学), 高道 慎之介 (東京大学), 山本 龍一, 橘 健太郎, 猿渡 洋 (東京大学)
- 電子情報通信学会/日本音響学会 音声研究会
- February 22, 2024
-
- CONFERENCE (DOMESTIC)
- NNSVS: ニューラルネットワークに基づく歌声合成のためのオープンソースソフトウェア
- 山本 龍一 (名古屋大学/LINE), 米山 怜於 (名古屋大学), 戸田 智基 (名古屋大学)
- 日本音響学会 2023年秋季研究発表会
- September 26, 2023
-
- CONFERENCE (DOMESTIC)
- Foley Sound Synthesis with a Class-Conditioned Latent Diffusion Model and FAD-Based Post-filtering
- シャイブラー ロビン, 蓮実 拓也, 藤田 雄介, 小松 達也, 山本 龍一, 橘 健太郎
- 日本音響学会 2023年秋季研究発表会
- September 26, 2023
-
- WORKSHOP (INTERNATIONAL)
- Foley Sound Synthesis with a Class-conditioned Latent Diffusion Model
- Robin Scheibler, Takuya Hasumi, Yusuke Fujita, Tatsuya Komatsu, Ryuichi Yamamoto, Kentaro Tachibana
- Detection and Classification of Acoustic Scenes and Events
- September 20, 2023
-
- CONFERENCE (INTERNATIONAL)
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform
- Masaya Kawamura (The University of Tokyo), Yuma Shirahata, Ryuichi Yamamoto, Kentaro Tachibana
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing
- June 04, 2023
-
- CONFERENCE (INTERNATIONAL)
- NNSVS: A Neural Network-Based Singing Voice Synthesis Toolkit
- Ryuichi Yamamoto (LINE/Nagoya University), Reo Yoneyama (Nagoya University), Tomoki Toda (Nagoya University)
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing
- June 04, 2023
-
- CONFERENCE (INTERNATIONAL)
- Non-parallel High-Quality Audio Super Resolution with Domain Adaptation and Resampling CycleGANs
- Reo Yoneyama (Nagoya University), Ryuichi Yamamoto, Kentaro Tachibana
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing
- June 04, 2023
-
- CONFERENCE (INTERNATIONAL)
- Period VITS: Variational Inference With Explicit Pitch Modeling For End-to-End Emotional Speech
- Yuma Shirahata, Ryuichi Yamamoto, Eunwoo Song (NAVER), Ryo Terashima, Jae-Min Kim (NAVER), Kentaro Tachibana
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing
- June 04, 2023
-
- CONFERENCE (INTERNATIONAL)
- A Unified Accent Estimation Method Based on Multi-Task Learning for Japanese Text-to-Speech
- Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana
- The 23rd Annual Conference of the International Speech Communication Association
- September 18, 2022
-
- CONFERENCE (INTERNATIONAL)
- Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation
- Ryo Terashima, Ryuichi Yamamoto, Eunwoo Song (NAVER), Yuma Shirahata, Hyun-Wook Yoon (NAVER), Jae-Min Kim (NAVER), Kentaro Tachibana
- The 23rd Annual Conference of the International Speech Communication Association
- September 18, 2022
-
- CONFERENCE (INTERNATIONAL)
- DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning
- Takaaki Saeki (The University of Tokyo), Kentaro Tachibana, Ryuichi Yamamoto
- The 23rd Annual Conference of the International Speech Communication Association
- September 18, 2022
-
- CONFERENCE (INTERNATIONAL)
- Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems
- Hyun-Wook Yoon (NAVER), Ohsung Kwon (NAVER), Hoyeon Lee (NAVER), Ryuichi Yamamoto, Eunwoo Song (NAVER), Jae-Min Kim (NAVER), Min-Jae Hwang (NAVER)
- The 23rd Annual Conference of the International Speech Communication Association
- September 18, 2022