People
藤田 悠哉 Yuya Fujita ソフトウェアエンジニア
音声認識技術の研究開発に携わっております。
Publications
-
- カンファレンス (国際)
- Non-Autoregressive End-to-End Automatic Speech Recognition Incorporating Downstream Natural Language Processing
- Motoi Omachi, Yuya Fujita, Shinji Watanabe (Carnegie Mellon University), Tianzi Wang (Johns Hopkins University)
- 2022 IEEE International Conference on Acoustics, Speech and Signal Processing
- 2022.4.27
-
- その他 (国際)
- End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation
- Xuankai Chang (Carnegie Mellon University), Takashi Maekaku, Yuya Fujita, Shinji Watanabe (Carnegie Mellon University)
- arXiv.org
- 2022.4.1
-
- ワークショップ (国際)
- A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
- Yosuke Higuchi (Waseda University), Nanxin Chen (Johns Hopkins University), Yuya Fujita, Hirofumi Inaguma (Kyoto University), Tatsuya Komatsu (LINE Corporation), Jaesong Lee (Naver Corporation), Jumon Nozaki (Kyoto University, LINE Corporation), Tianzi Wang (Johns Hopkins University), Shinji Watanabe (Carnegie Mellon University)
- The 2021 IEEE Automatic Speech Recognition and Understanding Workshop
- 2021.12.14
-
- ワークショップ (国際)
- A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
- Yosuke Higuchi (Waseda University), Nanxin Chen (Johns Hopkins University), Yuya Fujita (Yahoo Japan Corporation), Hirofumi Inaguma (Kyoto University), Tatsuya Komatsu, Jaesong Lee (Naver Corporation), Jumon Nozaki (Kyoto University), Tianzi Wang (Johns Hopkins University), Shinji Watanabe (Carnegie Mellon University)
- 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)
- 2021.12.13
-
- その他 (国際)
- A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
- Yosuke Higuchi (Waseda University), Nanxin Chen (Johns Hopkins University), Yuya Fujita, Hirofumi Inaguma (Kyoto University), Tatsuya Komatsu (LINE Corporation), Jaesong Lee (Naver Corporation), Jumon Nozaki (Kyoto University, LINE Corporation), Tianzi Wang (Johns Hopkins University), Shinji Watanabe (Carnegie Mellon University)
- arXiv
- 2021.10.11
-
- カンファレンス (国内)
- Conformer CPCとDeep Cluster を用いたゼロリソース言語のための表現学習
- 前角 高史, Xuankai Chang (カーネギーメロン大学), 藤田 悠哉, Li-Wei Chen (カーネギーメロン大学), 渡部 晋治(カーネギーメロン大学), Alexander Rudnicky (カーネギーメロン大学)
- 日本音響学会2021年秋季研究発表会
- 2021.9.8
-
- カンファレンス (国内)
- 音声意味理解への応用を指向した非自己回帰型End-to-end音声認識
- 大町 基, 藤田 悠哉, 渡部 晋治 (Carnegie Mellon University), Tianzi Wang (Johns Hopkins University)
- 日本音響学会 2021年秋季研究発表会
- 2021.9.7
-
- カンファレンス (国際)
- Streaming End-to-End ASR based on Blockwise Non-Autoregressive Models
- Tianzi Wang (Johns Hopkins University), Yuya Fujita, Xuankai Chang (Carnegie Mellon Univ.), Shinji Watanabe (Carnegie Mellon University)
- INTERSPEECH 2021
- 2021.9.2
-
- カンファレンス (国際)
- Toward Streaming ASR with Non-Autoregressive Insertion-based Model
- Yuya Fujita, Tianzi Wang (Johns Hopkins University), Shinji Watanabe (Carnegie Mellon University), Motoi Omach
- INTERSPEECH 2021
- 2021.9.2
-
- カンファレンス (国際)
- Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021
- Takashi Maekaku, Xuankai Chang (Carnegie Mellon University), Yuya Fujita, Li, Shinji Watanabe (Carnegie Mellon University), Alexander Rudnicky (Carnegie Mellon University)
- INTERSPEECH 2021
- 2021.8.30
-
- その他 (国際)
- Streaming End-to-End ASR based on Blockwise Non-Autoregressive Models
- Tianzi Wang (Johns Hopkins Univ.), Yuya Fujita, Xuankai Chang (Carnegie Mellon Univ.), Shinji Watanabe (Carnegie Mellon Univ.)
- arXiv.org
- 2021.7.20
-
- その他 (国際)
- Toward Streaming ASR with Non-Autoregressive Insertion-based Model
- Yuya Fujita, Tianzi Wang (Johns Hopkins Univ.), Shinji Watanabe (Carnegie Mellon Univ.), Motoi Omachi
- arXiv.org
- 2021.7.16
-
- カンファレンス (国際)
- End-to-end ASR to jointly predict transcriptions and linguistic annotations
- Motoi Omachi, Yuya Fujita, Shinji Watanabe (Johns Hopkins University), Matthew Wiesner (Johns Hopkins University)
- The 2021 North American Chapter of the Association for Computational Linguistics : Human Language Technologies
- 2021.6.6
-
- カンファレンス (国内)
- 挿入操作に基づく End-to-End モデルによる音声認識と音声区間検出
- 藤田 悠哉, 渡部 晋治 (Johns Hopkins Univ.), 大町 基
- 日本音響学会2021年春季研究発表会
- 2021.3.10
-
- その他 (国際)
- End-to-End ASR and Audio Segmentation with Non-autoregressive Insertion-based model
- Yuya Fujita, Shinji Watanabe (Johns Hopkins Univ.), Motoi Omachi
- arXiv.org
- 2020.12.18
-
- カンファレンス (国際)
- End-to-End ASR with Adaptive Span Self-Attention
- Xuankai Chang (Johns Hopkins University), Aswin Shanmugam Subramanian (Johns Hopkins University), Pengcheng Guo (Northwestern Polytechnical University, Johns Hopkins University), Shinji Watanabe (Johns Hopkins University), Yuya Fujita, Motoi Omachi
- INTERSPEECH 2020
- 2020.10.25
-
- カンファレンス (国際)
- Insertion-Based Modeling for End-to-End Automatic Speech Recognition
- Yuya Fujita, Shinji Watanabe (Johns Hopkins University), Motoi Omachi, Xuankai Chang (Johns Hopkins University)
- INTERSPEECH 2020
- 2020.10.25
-
- カンファレンス (国内)
- 単語の表記と素性を同時出力するend-to-end音声認識
- 大町 基, 藤田 悠哉, 渡部 晋治 (Johns Hopkins University), Xuankai Chang (Johns Hopkins University)
- 日本音響学会2020年秋季研究発表会
- 2020.9.11
-
- カンファレンス (国内)
- 挿入操作に基づく End-to-End 音声認識
- 藤田 悠哉, 渡部 晋治 (Johns Hopkins Univ.), 大町 基, Xuankai Chang (Johns Hopkins Univ.)
- 日本音響学会2020年秋季研究発表会
- 2020.9.9
-
- その他 (国際)
- Insertion-Based Modeling for End-to-End Automatic Speech Recognition
- Yuya Fujita, Shinji Watanabe (Johns Hopkins University), Motoi Omachi, Xuankai Chang (Johns Hopkins University)
- arXiv.org
- 2020.5.27