音声処理
-
- カンファレンス (国際)
- Deep Multi-channel Speech Source Separation with Time-frequency Masking for Spatially Filtered Microphone Input Signal
- Masahito Togami
- 28th European Signal Processing Conference (EUSIPCO 2020)
- 2021.1.18
-
- カンファレンス (国際)
- Robust Acoustic Scene Classification to Multiple Devices Using Maximum Classifier Discrepancy and Knowledge Distillation
- Saori Takeyama (Tokyo Institute of Technology), Tatsuya Komatsu, Koichi Miyazaki (Nagoya University), Masahito Togami, Shunsuke Ono (Tokyo Institute of Technology)
- 28th European Signal Processing Conference (EUSIPCO 2020)
- 2021.1.18
-
- カンファレンス (国際)
- Sound Event Localization and Detection using a Recurrent Convolutional Neural Network and Gated Linear Unit
- Tatsuya Komatsu, Masahito Togami, Tsubasa Takahashi
- 28th European Signal Processing Conference (EUSIPCO 2020)
- 2021.1.18
-
- ワークショップ (国際)
- Disentangling Clustered Representations of Variational Autoencoders for Generating Diverse Samples
- Tsubasa Takahashi, Tatsuya Komatsu, Koki Yamada (Tokyo University of Agriculture and Technology)
- Learning Data Representation for Clustering (LDRC at IJCAI 2020)
- 2021.1.7
-
- その他 (国際)
- End-to-End ASR and Audio Segmentation with Non-autoregressive Insertion-based model
- Yuya Fujita, Shinji Watanabe (Johns Hopkins Univ.), Motoi Omachi
- arXiv.org
- 2020.12.18
-
- カンファレンス (国際)
- A Study on More Realistic Room Simulation for Far-Field Keyword Spotting
- Eric Bezzam (EPFL/Sonos), Robin Scheibler, Cyril Cadoux (EPFL), Thibault Gisselbrecht (Sonos)
- Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2020 (APSIPA 2020)
- 2020.12.7
-
- カンファレンス (国際)
- Computer-Resource-Aware Deep Speech Separation with a Run-Time-Specified Number of BLSTM Layers
- Masahito Togami, Yoshiki Masuyama (Waseda University), Tatsuya Komatsu, Kazuyoshi Yoshii (Kyoto University), Tatsuya Kawahara (Kyoto University)
- Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2020 (APSIPA ASC 2020)
- 2020.12.7
-
- カンファレンス (国際)
- Integration of Semi-Blind Speech Source Separation and Voice Activity Detection for Flexible Spoken Dialogue
- Masaya Wake (Kyoto University), Masahito Togami, Kazuyoshi Yoshii (Kyoto University), Tatsuya Kawahara (Kyoto University)
- Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2020 (APSIPA ASC 2020)
- 2020.12.7
-
- カンファレンス (国際)
- Over-determined Speech Source Separation and Dereverberation
- Masahito Togami, Robin Scheibler
- Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2020 (APSIPA ASC 2020)
- 2020.12.7
-
- 論文誌 (国際)
- Innovation, Standardization, and Business Success in Media Signal Processing
- Akihiko Sugiyama and Masahiro Serizawa (NEC Corporation)
- Institute of Electrical and Electronics Engineers, Consumer Electronics Magazine (MCE)
- 2020.11.3
-
- ワークショップ (国際)
- Conformer-based sound event detection with semi-supervised learning and data augmentation
- Koichi Miyazaki (Nagoya University), Tatsuya Komatsu, Tomoki Hayashi (Human Dataware Lab), Shinji Watanabe (Johns Hopkins University), Tomoki Toda (Nagoya University), Kazuya Takeda (Nagoya University)
- Detection and Classification of Acoustic Scenes and Events (DCASE 2020)
- 2020.11.2
-
- カンファレンス (国際)
- End-to-End ASR with Adaptive Span Self-Attention
- Xuankai Chang (Johns Hopkins University), Aswin Shanmugam Subramanian (Johns Hopkins University), Pengcheng Guo (Northwestern Polytechnical University, Johns Hopkins University), Shinji Watanabe (Johns Hopkins University), Yuya Fujita, Motoi Omachi
- INTERSPEECH 2020
- 2020.10.25
-
- カンファレンス (国際)
- Generalized Minimal Distortion Principle for Blind Source Separation
- Robin Scheibler
- The 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020)
- 2020.10.25
-
- カンファレンス (国際)
- Insertion-Based Modeling for End-to-End Automatic Speech Recognition
- Yuya Fujita, Shinji Watanabe (Johns Hopkins University), Motoi Omachi, Xuankai Chang (Johns Hopkins University)
- INTERSPEECH 2020
- 2020.10.25
-
- カンファレンス (国際)
- Mentoring-Reverse Mentoring for Unsupervised Multi-channel Speech Source Separation
- Yu Nakagome (Waseda University), Masahito Togami, Tetsuji Ogawa (Waseda University), Tetsunori Kobayashi (Waseda University)
- The 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020)
- 2020.10.25
-
- カンファレンス (国際)
- Neural Text-to-Speech with a Modeling-by-Generation Excitation Vocoder
- Eunwoo Song (NAVER), Min-Jae Hwang (Search Solutions Inc.), Ryuichi Yamamoto, Jin-Seob Kim (NAVER), Ohsung Kwon (NAVER), Jae-Min Kim (NAVER)
- The 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020)
- 2020.10.25
-
- カンファレンス (国際)
- Sparseness-Aware DOA Estimation with Majorization Minimization
- Masahito Togami, Robin Scheibler
- The 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020)
- 2020.10.25
-
- カンファレンス (国内)
- 単語の表記と素性を同時出力するend-to-end音声認識
- 大町 基, 藤田 悠哉, 渡部 晋治 (Johns Hopkins University), Xuankai Chang (Johns Hopkins University)
- 日本音響学会2020年秋季研究発表会 (音響学会)
- 2020.9.11
-
- カンファレンス (国内)
- A Generalized Minimal Distortion Principle to Solve the Scale Ambiguity in Blind Source Separation
- シャイブラー ロビン
- 日本音響学会 2020年秋季研究発表会 (ASJ 2020 autumn)
- 2020.9.9
-
- カンファレンス (国内)
- Mentoring-Reverse Mentoring:多チャンネル音源分離における教師なし学習のための知識伝搬フレームワーク
- 中込 優 (早稲田大学), 戸上 真人, 小川 哲司 (早稲田大学), 小林 哲則 (早稲田大学)
- 日本音響学会 2020年秋季研究発表会 (ASJ 2020 autumn)
- 2020.9.9