LINEヤフーの研究開発

JP
EN

People

白旗悠真 Yuma Shirahata ソフトウェアエンジニア

2021年東京大学大学院工学系研究科電気系工学専攻修士課程修了。同年4月にLINE株式会社に新卒入社し、2023年10月より現職。主に音声合成の研究開発に取り組んでおり、高品質な感情音声合成モデルや大規模ラベルなし音声データを用いた音声生成向け基盤モデルの開発などに従事。

Publications

カンファレンス (国際)

CC-G2PNP: Streaming Grapheme-to-Phoneme and Prosody with Conformer-CTC for Unsegmented Languages

Yuma Shirahata, Ryuichi Yamamoto

2026 IEEE International Conference on Acoustics, Speech and Signal Processing

2026.5.7
カンファレンス (国際)

Wave-Trainer-Fit: Neural Vocoder With Trainable Prior And Fixed-Point Iteration Towards High-Quality Speech Generation From SSL Features

Hien Ohnaka (Nara Institute of Science and Technology), Yuma Shirahata, Masaya Kawamura

2026 IEEE International Conference on Acoustics, Speech and Signal Processing

2026.5.5
カンファレンス (国内)

ニューラルオーディオコーデック特徴量を用いた音声から話者特有の表情予測モデルの構築及び分析

朴浚鎔 (東京大学), 陳晋升 (東京大学), 土井啓成, 朴炳宣, 白旗悠真, 橘健太郎, 楊棟 (東京大学), 齋藤佑樹 (東京大学), 猿渡洋 (東京大学)

日本音響学会 2026年春季研究発表会

2026.3.19
ワークショップ (国際)

CAVIARES: Corpus for Audio-Visual Expressive Voice Agent

Jinsheng Chen (The University of Tokyo), Yuki Saito (The University of Tokyo), Dong Yang (The University of Tokyo), Naoko Tanji (The University of Tokyo), Hironori Doi, Byeongseon Park, Yuma Shirahata, Kentaro Tachibana, Hiroshi Saruwatari (The University of Tokyo)

2025 IEEE Automatic Speech Recognition and Understanding Workshop

2025.12.9
カンファレンス (国内)

BitTTS: 1.58-bit量子化と重みインデキシングによる軽量なテキスト音声合成

川村真也, 蓮実拓也, 白旗悠真, 山本龍一

日本音響学会 2025年秋季研究発表会

2025.9.11
カンファレンス (国内)

音声からの音素・韻律ラベルの獲得とその応用

白旗悠真, 朴炳宣, 山本龍一

日本音響学会 2025年秋季研究発表会

2025.9.10
カンファレンス (国際)

BitTTS: Highly Compact Text-to-Speech Using 1.58-bit Quantization and Weight Indexing

Masaya Kawamura, Takuya Hasumi, Yuma Shirahata, Ryuichi Yamamoto

The 26th Annual Conference of the International Speech Communication Association

2025.8.21
カンファレンス (国際)

SLASH: Self-Supervised Speech Pitch Estimation Leveraging DSP-derived Absolute Pitch

Ryo Terashima, Yuma Shirahata, Masaya Kawamura

The 26th Annual Conference of the International Speech Communication Association

2025.8.19
カンファレンス (国際)

Grapheme-Coherent Phonemic and Prosodic Annotation of Speech by Implicit and Explicit Grapheme Conditioning

Hien Ohnaka (Nara Institute of Science and Technology), Yuma Shirahata, Byeongseon Park, Ryuichi Yamamoto

The 26th Annual Conference of the International Speech Communication Association

2025.8.17
カンファレンス (国際)

Description-Based Controllable Text-to-Speech With Cross-Lingual Voice Control

Ryuichi Yamamoto, Yuma Shirahata, Masaya Kawamura, Kentaro Tachibana

2025 IEEE International Conference on Acoustics, Speech and Signal Processing

2025.4.6

VIEW ALL