People
Yuma Shirahata Software Engineer
Graduated with a Master's degree from the Department of Electrical Engineering and Information Systems, Graduate School of Engineering, The University of Tokyo, in 2021. Joined LINE Corporation as a new graduate in April of the same year. Since October 2023, I have been in my current position. I am mainly engaged in research and development of speech synthesis, working on the development of high-quality emotional speech synthesis models and foundational models for speech generation using large-scale unlabeled speech data.
Publications
-
- OTHERS (INTERNATIONAL)
- Description-based Controllable Text-to-Speech with Cross-Lingual Voice Control
- Ryuichi Yamamoto, Yuma Shirahata, Masaya Kawamura, Kentaro Tachibana
- arXiv.org
- September 27, 2024
-
- CONFERENCE (INTERNATIONAL)
- Audio-conditioned phonemic and prosodic annotation for building text-to-speech models from unlabeled speech data
- Yuma Shirahata, Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana
- The 25th Annual Conference of the International Speech Communication Association
- September 04, 2024
-
- CONFERENCE (DOMESTIC)
- 感情音声合成のためのアラインメント手法の比較
- 蓮実 拓也, 白旗 悠真, Welly Naptali, 山本 龍一, Eunwoo Song (NAVER Cloud), 橘 健太郎, Jae-Min Kim (NAVER Cloud)
- 日本音響学会 2024年秋季研究発表会
- September 04, 2024
-
- CONFERENCE (INTERNATIONAL)
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning
- Masaya Kawamura, Ryuichi Yamamoto, Yuma Shirahata, Takuya Hasumi, Kentaro Tachibana
- The 25th Annual Conference of the International Speech Communication Association
- September 03, 2024
-
- CONFERENCE (INTERNATIONAL)
- Universal Score-based Speech Enhancement with High Content Preservation
- Robin Scheibler, Yusuke Fujita, Yuma Shirahata, Tatsuya Komatsu
- The 25th Annual Conference of the International Speech Communication Association
- September 01, 2024
-
- CONFERENCE (INTERNATIONAL)
- PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions
- Reo Shimizu (Tohoku University), Ryuichi Yamamoto, Masaya Kawamura, Yuma Shirahata, Hironori Doi, Tatsuya Komatsu, Kentaro Tachibana
- 2024 IEEE International Conference on Acoustics, Speech and Signal Processing
- April 14, 2024
-
- CONFERENCE (INTERNATIONAL)
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform
- Masaya Kawamura (The University of Tokyo), Yuma Shirahata, Ryuichi Yamamoto, Kentaro Tachibana
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing
- June 04, 2023
-
- CONFERENCE (INTERNATIONAL)
- Period VITS: Variational Inference With Explicit Pitch Modeling For End-to-End Emotional Speech
- Yuma Shirahata, Ryuichi Yamamoto, Eunwoo Song (NAVER), Ryo Terashima, Jae-Min Kim (NAVER), Kentaro Tachibana
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing
- June 04, 2023
-
- CONFERENCE (INTERNATIONAL)
- Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation
- Ryo Terashima, Ryuichi Yamamoto, Eunwoo Song (NAVER), Yuma Shirahata, Hyun-Wook Yoon (NAVER), Jae-Min Kim (NAVER), Kentaro Tachibana
- The 23rd Annual Conference of the International Speech Communication Association
- September 18, 2022