People
藤田 悠哉 Yuya Fujita ソフトウェアエンジニア
音声認識技術の研究開発に携わっております。
Awards
Publications
-
- カンファレンス (国内)
- トピックモデルを用いた教師なし学習によるHuBERTの意味表現向上
- 前角 高史, Jiatong Shi (カーネギーメロン大学), Xuankai Chang (カーネギーメロン大学), 藤田 悠哉, 渡部 晋治 (カーネギーメロン大学)
- 日本音響学会 2024年秋季研究発表会
- 2024.9.4
-
- カンファレンス (国内)
- 離散トークン音声認識におけるドメイン適応の検討
- 石井 敬章, 小松 達也, 藤田 雄介, 藤田 悠哉
- 日本音響学会 2024年秋季研究発表会
- 2024.9.4
-
- 論文誌 (国際)
- MC-Whisper: Extending Speech Foundation Models to Multichannel Distant Speech Recognition
- Xuankai Chang (Carnegie Mellon University), Pengcheng Guo (Northwestern Polytechnical University), Yuya Fujita, Takashi Maekaku, Shinji Watanabe (Carnegie Mellon University)
- IEEE Signal Processing Letters
- 2024.8.26
-
- カンファレンス (国際)
- Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing
- Brian Yan (Carnegie Mellon University), Xuankai Chang (Carnegie Mellon University), Antonios Anastasopoulos (George Mason University), Yuya Fujita, Shinji Watanabe (Carnegie Mellon University)
- 2024 IEEE International Conference on Acoustics, Speech and Signal Processing
- 2024.4.14
-
- その他 (国際)
- LV-CTC: Non-autoregressive ASR with CTC and latent variable models
- Yuya Fujita, Shinji Watanabe (Carnegie Mellon Univ.), Xuankai Chang (Carnegie Mellon Univ.), Takashi Maekaku
- arXiv.org
- 2024.3.28
-
- カンファレンス (国際)
- Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
- Xuankai Chang (Carnegie Mellon University), Brian Yan (Carnegie Mellon University), Kwanghee Choi (Carnegie Mellon University), Jee-Weon Jung (Carnegie Mellon University), Yichen Lu (Carnegie Mellon University), Soumi Maiti (Carnegie Mellon University), Roshan Sharma (Carnegie Mellon University), Jiatong Shi (Carnegie Mellon University), Jinchuan Tian (Carnegie Mellon University), Shinji Watanabe (Carnegie Mellon University), Yuya Fujita, Takashi Maekaku, Pengcheng Guo (Northwestern Polytechnical University), Yao-Fei Cheng (University of Washington), Pavel Denisov (University of Stuttgart), Kohei Saijo (Waseda University), Hsiu-Hsuan Wang (National Taiwan University)
- 2024 IEEE International Conference on Acoustics, Speech and Signal Processing
- 2024.3.20
-
- カンファレンス (国際)
- Hubertopic: Enhancing Semantic Representation of Hubert Through Self-Supervision Utilizing Topic Model
- Takashi Maekaku, Jiatong Shi (Carnegie Mellon University), Xuankai Chang (Carnegie Mellon University), Yuya Fujita, Shinji Watanabe (Carnegie Mellon University)
- 2024 IEEE International Conference on Acoustics, Speech and Signal Processing
- 2024.3.20
-
- カンファレンス (国際)
- LV-CTC: Non-autoregressive ASR with CTC and Latent Variable Models
- Yuya Fujita, Shinji Watanabe (Carnegie Mellon Univ.), Xuankai Chang (Carnegie Mellon Univ.), Takashi Maekaku
- The 2023 IEEE Workshop on Automatic Speech Recognition and Understanding
- 2023.12.16
-
- その他 (国際)
- HuBERTopic: Enhancing Semantic Representation of HuBERT through Self-supervision Utilizing Topic Model
- Takashi Maekaku, Jiatong Shi (Carnegie Mellon University), Xuankai Chang (Carnegie Mellon University), Yuya Fujita, Shinji Watanabe (Carnegie Mellon University)
- arXiv
- 2023.10.9
-
- カンファレンス (国内)
- 潜在変数モデルを用いたCTCによる非自己回帰型音声認識
- 藤田 悠哉, 渡部 晋治 (カーネギーメロン大学)
- 日本音響学会第150回(2023年秋季)研究発表会
- 2023.9.26