People
Yuma Shirahata Software Engineer
Graduated with a Master's degree from the Department of Electrical Engineering and Information Systems, Graduate School of Engineering, The University of Tokyo, in 2021. Joined LINE Corporation as a new graduate in April of the same year. Since October 2023, I have been in my current position. I am mainly engaged in research and development of speech synthesis, working on the development of high-quality emotional speech synthesis models and foundational models for speech generation using large-scale unlabeled speech data.
Publications
-
- WORKSHOP (INTERNATIONAL)
- CAVIARES: Corpus for Audio-Visual Expressive Voice Agent
- Jinsheng Chen (The University of Tokyo), Yuki Saito (The University of Tokyo), Dong Yang (The University of Tokyo), Naoko Tanji (The University of Tokyo), Hironori Doi, Byeongseon Park, Yuma Shirahata, Kentaro Tachibana, Hiroshi Saruwatari (The University of Tokyo)
- 2025 IEEE Automatic Speech Recognition and Understanding Workshop
- December 09, 2025
-
- CONFERENCE (DOMESTIC)
- BitTTS: 1.58-bit量子化と重みインデキシングによる軽量なテキスト音声合成
- 川村 真也, 蓮実 拓也, 白旗 悠真, 山本 龍一
- 日本音響学会 2025年秋季研究発表会
- September 11, 2025
-
- CONFERENCE (DOMESTIC)
- 音声からの音素・韻律ラベルの獲得とその応用
- 白旗 悠真, 朴 炳宣, 山本 龍一
- 日本音響学会 2025年秋季研究発表会
- September 10, 2025
-
- CONFERENCE (INTERNATIONAL)
- BitTTS: Highly Compact Text-to-Speech Using 1.58-bit Quantization and Weight Indexing
- Masaya Kawamura, Takuya Hasumi, Yuma Shirahata, Ryuichi Yamamoto
- The 26th Annual Conference of the International Speech Communication Association
- August 21, 2025
-
- CONFERENCE (INTERNATIONAL)
- SLASH: Self-Supervised Speech Pitch Estimation Leveraging DSP-derived Absolute Pitch
- Ryo Terashima, Yuma Shirahata, Masaya Kawamura
- The 26th Annual Conference of the International Speech Communication Association
- August 19, 2025
-
- CONFERENCE (INTERNATIONAL)
- Grapheme-Coherent Phonemic and Prosodic Annotation of Speech by Implicit and Explicit Grapheme Conditioning
- Hien Ohnaka (Nara Institute of Science and Technology), Yuma Shirahata, Byeongseon Park, Ryuichi Yamamoto
- The 26th Annual Conference of the International Speech Communication Association
- August 17, 2025
-
- CONFERENCE (INTERNATIONAL)
- Description-Based Controllable Text-to-Speech With Cross-Lingual Voice Control
- Ryuichi Yamamoto, Yuma Shirahata, Masaya Kawamura, Kentaro Tachibana
- 2025 IEEE International Conference on Acoustics, Speech and Signal Processing
- April 06, 2025
-
- CONFERENCE (DOMESTIC)
- マルチモーダル共感的対話音声合成に向けたコーパスの構築
- 齋藤 佑樹 (東京大学), 陳 晋升 (東京大学), 楊 棟 (東京大学), 丹治 尚子 (東京大学), 土井 啓成, 白旗 悠真, 朴 炳宣, 橘 健太郎, 猿渡 洋 (東京大学)
- 日本音響学会 2025年春季研究発表会
- March 17, 2025
-
- OTHERS (INTERNATIONAL)
- Description-based Controllable Text-to-Speech with Cross-Lingual Voice Control
- Ryuichi Yamamoto, Yuma Shirahata, Masaya Kawamura, Kentaro Tachibana
- arXiv.org
- September 27, 2024
-
- CONFERENCE (INTERNATIONAL)
- Audio-conditioned phonemic and prosodic annotation for building text-to-speech models from unlabeled speech data
- Yuma Shirahata, Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana
- The 25th Annual Conference of the International Speech Communication Association
- September 04, 2024