People
白旗 悠真 Yuma Shirahata ソフトウェアエンジニア
2021年東京大学大学院工学系研究科電気系工学専攻修士課程修了。同年4月にLINE株式会社に新卒入社し、2023年10月より現職。主に音声合成の研究開発に取り組んでおり、高品質な感情音声合成モデルや大規模ラベルなし音声データを用いた音声生成向け基盤モデルの開発などに従事。
Publications
-
- その他 (国際)
- Description-based Controllable Text-to-Speech with Cross-Lingual Voice Control
- Ryuichi Yamamoto, Yuma Shirahata, Masaya Kawamura, Kentaro Tachibana
- arXiv.org
- 2024.9.27
-
- カンファレンス (国際)
- Audio-conditioned phonemic and prosodic annotation for building text-to-speech models from unlabeled speech data
- Yuma Shirahata, Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana
- The 25th Annual Conference of the International Speech Communication Association
- 2024.9.4
-
- カンファレンス (国内)
- 感情音声合成のためのアラインメント手法の比較
- 蓮実 拓也, 白旗 悠真, Welly Naptali, 山本 龍一, Eunwoo Song (NAVER Cloud), 橘 健太郎, Jae-Min Kim (NAVER Cloud)
- 日本音響学会 2024年秋季研究発表会
- 2024.9.4
-
- カンファレンス (国際)
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning
- Masaya Kawamura, Ryuichi Yamamoto, Yuma Shirahata, Takuya Hasumi, Kentaro Tachibana
- The 25th Annual Conference of the International Speech Communication Association
- 2024.9.3
-
- カンファレンス (国際)
- Universal Score-based Speech Enhancement with High Content Preservation
- Robin Scheibler, Yusuke Fujita, Yuma Shirahata, Tatsuya Komatsu
- The 25th Annual Conference of the International Speech Communication Association
- 2024.9.1
-
- カンファレンス (国際)
- PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions
- Reo Shimizu (Tohoku University), Ryuichi Yamamoto, Masaya Kawamura, Yuma Shirahata, Hironori Doi, Tatsuya Komatsu, Kentaro Tachibana
- 2024 IEEE International Conference on Acoustics, Speech and Signal Processing
- 2024.4.14
-
- カンファレンス (国際)
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform
- Masaya Kawamura (The University of Tokyo), Yuma Shirahata, Ryuichi Yamamoto, Kentaro Tachibana
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing
- 2023.6.4
-
- カンファレンス (国際)
- Period VITS: Variational Inference With Explicit Pitch Modeling For End-to-End Emotional Speech
- Yuma Shirahata, Ryuichi Yamamoto, Eunwoo Song (NAVER), Ryo Terashima, Jae-Min Kim (NAVER), Kentaro Tachibana
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing
- 2023.6.4
-
- カンファレンス (国際)
- Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation
- Ryo Terashima, Ryuichi Yamamoto, Eunwoo Song (NAVER), Yuma Shirahata, Hyun-Wook Yoon (NAVER), Jae-Min Kim (NAVER), Kentaro Tachibana
- The 23rd Annual Conference of the International Speech Communication Association
- 2022.9.18