People
前角 高史 Takashi Maekaku ソフトウェアエンジニア
2012年にヤフー株式会社(現LINEヤフー株式会社)に入社。音声認識を中心に音声処理関連の研究開発に取り組み、関連プロダクトの改善にも注力しています。
Publications
-
- カンファレンス (国内)
- トピックモデルを用いた教師なし学習によるHuBERTの意味表現向上
- 前角 高史, Jiatong Shi (カーネギーメロン大学), Xuankai Chang (カーネギーメロン大学), 藤田 悠哉, 渡部 晋治 (カーネギーメロン大学)
- 日本音響学会 2024年秋季研究発表会
- 2024.9.4
-
- 論文誌 (国際)
- MC-Whisper: Extending Speech Foundation Models to Multichannel Distant Speech Recognition
- Xuankai Chang (Carnegie Mellon University), Pengcheng Guo (Northwestern Polytechnical University), Yuya Fujita, Takashi Maekaku, Shinji Watanabe (Carnegie Mellon University)
- IEEE Signal Processing Letters
- 2024.8.26
-
- その他 (国際)
- LV-CTC: Non-autoregressive ASR with CTC and latent variable models
- Yuya Fujita, Shinji Watanabe (Carnegie Mellon Univ.), Xuankai Chang (Carnegie Mellon Univ.), Takashi Maekaku
- arXiv.org
- 2024.3.28
-
- カンファレンス (国際)
- Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
- Xuankai Chang (Carnegie Mellon University), Brian Yan (Carnegie Mellon University), Kwanghee Choi (Carnegie Mellon University), Jee-Weon Jung (Carnegie Mellon University), Yichen Lu (Carnegie Mellon University), Soumi Maiti (Carnegie Mellon University), Roshan Sharma (Carnegie Mellon University), Jiatong Shi (Carnegie Mellon University), Jinchuan Tian (Carnegie Mellon University), Shinji Watanabe (Carnegie Mellon University), Yuya Fujita, Takashi Maekaku, Pengcheng Guo (Northwestern Polytechnical University), Yao-Fei Cheng (University of Washington), Pavel Denisov (University of Stuttgart), Kohei Saijo (Waseda University), Hsiu-Hsuan Wang (National Taiwan University)
- 2024 IEEE International Conference on Acoustics, Speech and Signal Processing
- 2024.3.20
-
- カンファレンス (国際)
- Hubertopic: Enhancing Semantic Representation of Hubert Through Self-Supervision Utilizing Topic Model
- Takashi Maekaku, Jiatong Shi (Carnegie Mellon University), Xuankai Chang (Carnegie Mellon University), Yuya Fujita, Shinji Watanabe (Carnegie Mellon University)
- 2024 IEEE International Conference on Acoustics, Speech and Signal Processing
- 2024.3.20
-
- カンファレンス (国際)
- LV-CTC: Non-autoregressive ASR with CTC and Latent Variable Models
- Yuya Fujita, Shinji Watanabe (Carnegie Mellon Univ.), Xuankai Chang (Carnegie Mellon Univ.), Takashi Maekaku
- The 2023 IEEE Workshop on Automatic Speech Recognition and Understanding
- 2023.12.16
-
- その他 (国際)
- HuBERTopic: Enhancing Semantic Representation of HuBERT through Self-supervision Utilizing Topic Model
- Takashi Maekaku, Jiatong Shi (Carnegie Mellon University), Xuankai Chang (Carnegie Mellon University), Yuya Fujita, Shinji Watanabe (Carnegie Mellon University)
- arXiv
- 2023.10.9
-
- カンファレンス (国際)
- Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
- Xuankai Chang (Carnegie Mellon University), Brian Yan (Carnegie Mellon University), Yuya Fujita, Takashi Maekaku, Shinji Watanabe (Carnegie Mellon University)
- The 24th Annual Conference of the International Speech Communication Association
- 2023.8.20
-
- カンファレンス (国際)
- Fully Unsupervised Topic Clustering of Unlabelled Spoken Audio Using Self-Supervised Representation Learning and Topic Model
- Takashi Maekaku, Yuya Fujita, Xuankai Chang (Carnegie Mellon University), Shinji Watanabe (Carnegie Mellon University)
- The International Conference on Acoustics, Speech, & Signal Processing 2023
- 2023.6.7
-
- その他 (国際)
- Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
- Xuankai Chang (Carnegie Mellon University), Brian Yan (Carnegie Mellon University), Yuya Fujita, Takashi Maekaku, Shinji Watanabe (Carnegie Mellon University)
- arXiv
- 2023.5.29
-
- カンファレンス (国内)
- Transformerを用いた音声認識モデルにおける事前分布を用いた注意重みの平滑化の検討
- 前角 高史, 藤田 悠哉, Yifang Peng (Carnegie Mellon University), 渡部 晋治 (Carnegie Mellon University)
- 日本音響学会2023年春季研究発表会
- 2023.3.16
-
- カンファレンス (国際)
- Attention Weight Smoothing Using Prior Distributions for Transformer-Based End-to-End ASR
- Takashi Maekaku, Yuya Fujita, Yifan Peng (Carnegie Mellon University), Shinji Watanabe (Carnegie Mellon University)
- The 23rd Annual Conference of the International Speech Communication Association
- 2022.9.19
-
- カンファレンス (国際)
- End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation
- Xuankai Chang (Carnegie Mellon University), Takashi Maekaku, Yuya Fujita, Shinji Watanabe (Carnegie Mellon University)
- The 23rd Annual Conference of the International Speech Communication Association
- 2022.9.19
-
- カンファレンス (国際)
- An Exploration of Hubert with Large Number of Cluster Units and Model Assessment Using Bayesian Information Criterion
- Takashi Maekaku, Xuankai Chang (Carnegie Mellon University), Yuya Fujita, Shinji Watanabe (Carnegie Mellon University)
- The International Conference on Acoustics, Speech, & Signal Processing 2022
- 2022.5.10
-
- その他 (国際)
- End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation
- Xuankai Chang (Carnegie Mellon University), Takashi Maekaku, Yuya Fujita, Shinji Watanabe (Carnegie Mellon University)
- arXiv.org
- 2022.4.1
-
- ワークショップ (国際)
- An Exploration of Self-Supervised Pretrained Representations for End-to-End Speech Recognition
- Xuankai Chang (Carnegie Mellon University), Takashi Maekaku, Pengcheng Guo (Northwestern Polytechnical University), Jing Shi (Institute of Automation, Chinese Academy of Sciences), Yen, Aswin Shanmugam Subramanian (Johns Hopkins University), Tianzi Wang (Johns Hopkins University), Shu, Yu Tsao (Academia Sinica), Hung, Shinji Watanabe (Carnegie Mellon University)
- IEEE Automatic Speech Recognition and Understanding Workshop 2021
- 2021.12.15
-
- カンファレンス (国内)
- Conformer CPCとDeep Cluster を用いたゼロリソース言語のための表現学習
- 前角 高史, Xuankai Chang (カーネギーメロン大学), 藤田 悠哉, Li-Wei Chen (カーネギーメロン大学), 渡部 晋治(カーネギーメロン大学), Alexander Rudnicky (カーネギーメロン大学)
- 日本音響学会2021年秋季研究発表会
- 2021.9.8
-
- カンファレンス (国際)
- Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021
- Takashi Maekaku, Xuankai Chang (Carnegie Mellon University), Yuya Fujita, Li, Shinji Watanabe (Carnegie Mellon University), Alexander Rudnicky (Carnegie Mellon University)
- INTERSPEECH 2021
- 2021.8.30
-
- カンファレンス (国際)
- Simultaneous Detection and Localization of a Wake-Up Word using Multi-Task Learning of the Duration and Endpoint
- Takashi Maekaku, Yusuke Kida, Akihiko Sugiyama
- The 20th Annual Conference of the International Speech Communication Association
- 2019.9.19