Speech Processing
-
- OTHERS (INTERNATIONAL)
- End-to-End ASR and Audio Segmentation with Non-autoregressive Insertion-based model
- Yuya Fujita, Shinji Watanabe (Johns Hopkins Univ.), Motoi Omachi
- arXiv.org
- December 18, 2020
-
- CONFERENCE (INTERNATIONAL)
- A Study on More Realistic Room Simulation for Far-Field Keyword Spotting
- Eric Bezzam (EPFL/Sonos), Robin Scheibler, Cyril Cadoux (EPFL), Thibault Gisselbrecht (Sonos)
- Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2020 (APSIPA 2020)
- December 07, 2020
-
- CONFERENCE (INTERNATIONAL)
- Computer-Resource-Aware Deep Speech Separation with a Run-Time-Specified Number of BLSTM Layers
- Masahito Togami, Yoshiki Masuyama (Waseda University), Tatsuya Komatsu, Kazuyoshi Yoshii (Kyoto University), Tatsuya Kawahara (Kyoto University)
- Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2020 (APSIPA ASC 2020)
- December 07, 2020
-
- CONFERENCE (INTERNATIONAL)
- Integration of Semi-Blind Speech Source Separation and Voice Activity Detection for Flexible Spoken Dialogue
- Masaya Wake (Kyoto University), Masahito Togami, Kazuyoshi Yoshii (Kyoto University), Tatsuya Kawahara (Kyoto University)
- Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2020 (APSIPA ASC 2020)
- December 07, 2020
-
- CONFERENCE (INTERNATIONAL)
- Over-determined Speech Source Separation and Dereverberation
- Masahito Togami, Robin Scheibler
- Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2020 (APSIPA ASC 2020)
- December 07, 2020
-
- JOURNAL (INTERNATIONAL)
- Innovation, Standardization, and Business Success in Media Signal Processing
- Akihiko Sugiyama and Masahiro Serizawa (NEC Corporation)
- Institute of Electrical and Electronics Engineers, Consumer Electronics Magazine (MCE)
- November 03, 2020
-
- WORKSHOP (INTERNATIONAL)
- Conformer-based sound event detection with semi-supervised learning and data augmentation
- Koichi Miyazaki (Nagoya University), Tatsuya Komatsu, Tomoki Hayashi (Human Dataware Lab), Shinji Watanabe (Johns Hopkins University), Tomoki Toda (Nagoya University), Kazuya Takeda (Nagoya University)
- Detection and Classification of Acoustic Scenes and Events (DCASE 2020)
- November 02, 2020
-
- CONFERENCE (INTERNATIONAL)
- End-to-End ASR with Adaptive Span Self-Attention
- Xuankai Chang (Johns Hopkins University), Aswin Shanmugam Subramanian (Johns Hopkins University), Pengcheng Guo (Northwestern Polytechnical University, Johns Hopkins University), Shinji Watanabe (Johns Hopkins University), Yuya Fujita, Motoi Omachi
- INTERSPEECH 2020
- October 25, 2020
-
- CONFERENCE (INTERNATIONAL)
- Generalized Minimal Distortion Principle for Blind Source Separation
- Robin Scheibler
- The 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020)
- October 25, 2020
-
- CONFERENCE (INTERNATIONAL)
- Insertion-Based Modeling for End-to-End Automatic Speech Recognition
- Yuya Fujita, Shinji Watanabe (Johns Hopkins University), Motoi Omachi, Xuankai Chang (Johns Hopkins University)
- INTERSPEECH 2020
- October 25, 2020
-
- CONFERENCE (INTERNATIONAL)
- Mentoring-Reverse Mentoring for Unsupervised Multi-channel Speech Source Separation
- Yu Nakagome (Waseda University), Masahito Togami, Tetsuji Ogawa (Waseda University), Tetsunori Kobayashi (Waseda University)
- The 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020)
- October 25, 2020
-
- CONFERENCE (INTERNATIONAL)
- Neural Text-to-Speech with a Modeling-by-Generation Excitation Vocoder
- Eunwoo Song (NAVER), Min-Jae Hwang (Search Solutions Inc.), Ryuichi Yamamoto, Jin-Seob Kim (NAVER), Ohsung Kwon (NAVER), Jae-Min Kim (NAVER)
- The 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020)
- October 25, 2020
-
- CONFERENCE (INTERNATIONAL)
- Sparseness-Aware DOA Estimation with Majorization Minimization
- Masahito Togami, Robin Scheibler
- The 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020)
- October 25, 2020
-
- CONFERENCE (DOMESTIC)
- 単語の表記と素性を同時出力するend-to-end音声認識
- 大町 基, 藤田 悠哉, 渡部 晋治 (Johns Hopkins University), Xuankai Chang (Johns Hopkins University)
- 日本音響学会2020年秋季研究発表会 (音響学会)
- September 11, 2020
-
- CONFERENCE (DOMESTIC)
- A Generalized Minimal Distortion Principle to Solve the Scale Ambiguity in Blind Source Separation
- シャイブラー ロビン
- 日本音響学会 2020年秋季研究発表会 (ASJ 2020 autumn)
- September 09, 2020
-
- CONFERENCE (DOMESTIC)
- Mentoring-Reverse Mentoring:多チャンネル音源分離における教師なし学習のための知識伝搬フレームワーク
- 中込 優 (早稲田大学), 戸上 真人, 小川 哲司 (早稲田大学), 小林 哲則 (早稲田大学)
- 日本音響学会 2020年秋季研究発表会 (ASJ 2020 autumn)
- September 09, 2020
-
- CONFERENCE (DOMESTIC)
- 挿入操作に基づく End-to-End 音声認識
- 藤田 悠哉, 渡部 晋治 (Johns Hopkins Univ.), 大町 基, Xuankai Chang (Johns Hopkins Univ.)
- 日本音響学会2020年秋季研究発表会 (音響学会)
- September 09, 2020
-
- OTHERS (INTERNATIONAL)
- Insertion-Based Modeling for End-to-End Automatic Speech Recognition
- Yuya Fujita, Shinji Watanabe (Johns Hopkins University), Motoi Omachi, Xuankai Chang (Johns Hopkins University)
- arXiv.org
- May 27, 2020
-
- CONFERENCE (INTERNATIONAL)
- Attention-based ASR with Lightweight and Dynamic Convolutions
- Yuya Fujita, Aswin Shanmugam Subramanian (Johns Hopkins University), Motoi Omachi, Shinji Watanabe (Johns Hopkins University)
- 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020)
- May 08, 2020
-
- CONFERENCE (INTERNATIONAL)
- Consistency-Aware Multi-Channel Speech Enhancement Using Deep Neural Networks
- Yoshiki Masuyama (Waseda University), Masahito Togami, Tatsuya Komatsu
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020