音声処理
-
- カンファレンス (国際)
- Surrogate Source Model Learning for Determined Source Separation
- Robin Scheilbler, Masahito Togami
- 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021)
- 2021.6.6
-
- カンファレンス (国際)
- TTS-by-TTS: TTS-Driven Data Augmentation for Fast and High-Quality Speech Synthesis
- Min-Jae Hwang (Search Solutions Inc), Ryuichi Yamamoto, Eunwoo Song (NAVER), Jae-Min Kim (NAVER)
- 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021)
- 2021.6.6
-
- 論文誌 (国際)
- Independent Vector Analysis via Log-Quadratically Penalized Quadratic Minimization
- Robin Scheibler
- IEEE Transactions on Signal Processing (IEEE TSP)
- 2021.4.9
-
- カンファレンス (国内)
- Attention モデルのTeacher-Forcing を用いた長時間音声とテキストの自動アライメント
- 木田 祐介, 小松 達也, 戸上 真人
- 日本音響学会 2021年春季研究発表会 (ASJ 2021 spring)
- 2021.3.10
-
- カンファレンス (国内)
- ドメイン適応と相互情報量最小化によるdisentangled な話者・言語表現に基づいたクロスリンガル音声合成
- 辛 徳泰 (東京都大学), 小松 達也, 高道 慎之介 (東京都大学), 猿渡 洋 (東京都大学)
- 日本音響学会 2021年春季研究発表会 (ASJ 2021 spring)
- 2021.3.10
-
- カンファレンス (国内)
- ペアデータを必要としない敵対的学習に基づく多チャンネル音源分離
- 中込 優 (早稲田大学), 戸上 真人, 小川 哲司 (早稲田大学), 小林 哲則 (早稲田大学)
- 日本音響学会 2021年春季研究発表会 (ASJ 2021 spring)
- 2021.3.10
-
- カンファレンス (国内)
- 挿入操作に基づく End-to-End モデルによる音声認識と音声区間検出
- 藤田 悠哉, 渡部 晋治 (Johns Hopkins Univ.), 大町 基
- 日本音響学会2021年春季研究発表会
- 2021.3.10
-
- カンファレンス (国際)
- Deep Multi-channel Speech Source Separation with Time-frequency Masking for Spatially Filtered Microphone Input Signal
- Masahito Togami
- 28th European Signal Processing Conference (EUSIPCO 2020)
- 2021.1.18
-
- カンファレンス (国際)
- Robust Acoustic Scene Classification to Multiple Devices Using Maximum Classifier Discrepancy and Knowledge Distillation
- Saori Takeyama (Tokyo Institute of Technology), Tatsuya Komatsu, Koichi Miyazaki (Nagoya University), Masahito Togami, Shunsuke Ono (Tokyo Institute of Technology)
- 28th European Signal Processing Conference (EUSIPCO 2020)
- 2021.1.18
-
- カンファレンス (国際)
- Sound Event Localization and Detection using a Recurrent Convolutional Neural Network and Gated Linear Unit
- Tatsuya Komatsu, Masahito Togami, Tsubasa Takahashi
- 28th European Signal Processing Conference (EUSIPCO 2020)
- 2021.1.18
-
- ワークショップ (国際)
- Disentangling Clustered Representations of Variational Autoencoders for Generating Diverse Samples
- Tsubasa Takahashi, Tatsuya Komatsu, Koki Yamada (Tokyo University of Agriculture and Technology)
- Learning Data Representation for Clustering (LDRC at IJCAI 2020)
- 2021.1.7
-
- その他 (国際)
- End-to-End ASR and Audio Segmentation with Non-autoregressive Insertion-based model
- Yuya Fujita, Shinji Watanabe (Johns Hopkins Univ.), Motoi Omachi
- arXiv.org
- 2020.12.18
-
- カンファレンス (国際)
- A Study on More Realistic Room Simulation for Far-Field Keyword Spotting
- Eric Bezzam (EPFL/Sonos), Robin Scheibler, Cyril Cadoux (EPFL), Thibault Gisselbrecht (Sonos)
- Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2020 (APSIPA 2020)
- 2020.12.7
-
- カンファレンス (国際)
- Computer-Resource-Aware Deep Speech Separation with a Run-Time-Specified Number of BLSTM Layers
- Masahito Togami, Yoshiki Masuyama (Waseda University), Tatsuya Komatsu, Kazuyoshi Yoshii (Kyoto University), Tatsuya Kawahara (Kyoto University)
- Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2020 (APSIPA ASC 2020)
- 2020.12.7
-
- カンファレンス (国際)
- Integration of Semi-Blind Speech Source Separation and Voice Activity Detection for Flexible Spoken Dialogue
- Masaya Wake (Kyoto University), Masahito Togami, Kazuyoshi Yoshii (Kyoto University), Tatsuya Kawahara (Kyoto University)
- Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2020 (APSIPA ASC 2020)
- 2020.12.7
-
- カンファレンス (国際)
- Over-determined Speech Source Separation and Dereverberation
- Masahito Togami, Robin Scheibler
- Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2020 (APSIPA ASC 2020)
- 2020.12.7
-
- 論文誌 (国際)
- Innovation, Standardization, and Business Success in Media Signal Processing
- Akihiko Sugiyama and Masahiro Serizawa (NEC Corporation)
- Institute of Electrical and Electronics Engineers, Consumer Electronics Magazine (MCE)
- 2020.11.3
-
- ワークショップ (国際)
- Conformer-based sound event detection with semi-supervised learning and data augmentation
- Koichi Miyazaki (Nagoya University), Tatsuya Komatsu, Tomoki Hayashi (Human Dataware Lab), Shinji Watanabe (Johns Hopkins University), Tomoki Toda (Nagoya University), Kazuya Takeda (Nagoya University)
- Detection and Classification of Acoustic Scenes and Events (DCASE 2020)
- 2020.11.2
-
- カンファレンス (国際)
- End-to-End ASR with Adaptive Span Self-Attention
- Xuankai Chang (Johns Hopkins University), Aswin Shanmugam Subramanian (Johns Hopkins University), Pengcheng Guo (Northwestern Polytechnical University, Johns Hopkins University), Shinji Watanabe (Johns Hopkins University), Yuya Fujita, Motoi Omachi
- INTERSPEECH 2020
- 2020.10.25
-
- カンファレンス (国際)
- Generalized Minimal Distortion Principle for Blind Source Separation
- Robin Scheibler
- The 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020)
- 2020.10.25