音声処理
-
- カンファレンス (国際)
- Sound Source Localization with Majorization Minimization
- Masahito Togami, Robin Scheilbler
- The 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021)
- 2021.8.30
-
- カンファレンス (国際)
- Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021
- Takashi Maekaku, Xuankai Chang (Carnegie Mellon University), Yuya Fujita, Li, Shinji Watanabe (Carnegie Mellon University), Alexander Rudnicky (Carnegie Mellon University)
- INTERSPEECH 2021
- 2021.8.30
-
- カンファレンス (国際)
- Multi-Source Domain Adaptation with Sinkhorn Barycenter
- Tatsuya Komatsu, Tomoko Matsui (The Institute of Statistical Mathematics), Junbin Gao (The University of Sydney)
- 29th European Signal Processing Conference (EUSIPCO 2021)
- 2021.8.23
-
- カンファレンス (国際)
- Multichannel Separation and Classification of Sound Events
- Robin Scheilbler, Tatsuya Komatsu, Masahito Togami
- 29th European Signal Processing Conference (EUSIPCO 2021)
- 2021.8.23
-
- その他 (国際)
- Streaming End-to-End ASR based on Blockwise Non-Autoregressive Models
- Tianzi Wang (Johns Hopkins Univ.), Yuya Fujita, Xuankai Chang (Carnegie Mellon Univ.), Shinji Watanabe (Carnegie Mellon Univ.)
- arXiv.org
- 2021.7.20
-
- その他 (国際)
- Toward Streaming ASR with Non-Autoregressive Insertion-based Model
- Yuya Fujita, Tianzi Wang (Johns Hopkins Univ.), Shinji Watanabe (Carnegie Mellon Univ.), Motoi Omachi
- arXiv.org
- 2021.7.16
-
- ワークショップ (国際)
- Improved Parallel WaveGAN with perceptually weighted spectrogram loss
- Eunwoo Song (NAVER), Ryuichi Yamamoto, Min-Jae Hwang (NAVER), Jin-Seob Kim (NAVER), Ohsung Kwon (NAVER), Jae-Min Kim (NAVER)
- 2021 IEEE Spoken Language Technology Workshop (SLT) (SLT 2021)
- 2021.6.19
-
- カンファレンス (国際)
- Disentangled Speaker and Language Representations Using Mutual Information Minimization and Domain Adaptation for Cross-Lingual TTS
- Detai Xin (The University of Tokyo), Tatsuya Komatsu, Shinnosuke Takamichi (The University of Tokyo), Hiroshi Saruwatari (The University of Tokyo)
- 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021)
- 2021.6.6
-
- カンファレンス (国際)
- End To End Learning For Convolutive Multi-Channel Wiener Filtering
- Masahito Togami
- 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021)
- 2021.6.6
-
- カンファレンス (国際)
- End-to-end ASR to jointly predict transcriptions and linguistic annotations
- Motoi Omachi, Yuya Fujita, Shinji Watanabe (Johns Hopkins University), Matthew Wiesner (Johns Hopkins University)
- The 2021 North American Chapter of the Association for Computational Linguistics : Human Language Technologies (NAACL-HLT2021)
- 2021.6.6
-
- カンファレンス (国際)
- Joint Dereverberation and Separation With Iterative Source Steering
- Taishi Nakashima (Tokyo Metropolitan University), Robin Scheilbler, Masahito Togami, Nobutaka Ono (Tokyo Metropolitan University)
- 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021)
- 2021.6.6
-
- カンファレンス (国際)
- Parallel Waveform Synthesis Based on Generative Adversarial Networks with Voicing-Aware Conditional Discriminators
- Ryuichi Yamamoto, Eunwoo Song (NAVER), Min-Jae Hwang (Search Solutions Inc), Jae-Min Kim (NAVER)
- 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021)
- 2021.6.6
-
- カンファレンス (国際)
- Refinement of Direction of Arrival Estimators by Majorization-Minimization Optimization on the Array Manifold
- Robin Scheilbler, Masahito Togami
- 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021)
- 2021.6.6
-
- カンファレンス (国際)
- Surrogate Source Model Learning for Determined Source Separation
- Robin Scheilbler, Masahito Togami
- 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021)
- 2021.6.6
-
- カンファレンス (国際)
- TTS-by-TTS: TTS-Driven Data Augmentation for Fast and High-Quality Speech Synthesis
- Min-Jae Hwang (Search Solutions Inc), Ryuichi Yamamoto, Eunwoo Song (NAVER), Jae-Min Kim (NAVER)
- 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021)
- 2021.6.6
-
- 論文誌 (国際)
- Independent Vector Analysis via Log-Quadratically Penalized Quadratic Minimization
- Robin Scheibler
- IEEE Transactions on Signal Processing (IEEE TSP)
- 2021.4.9
-
- カンファレンス (国内)
- Attention モデルのTeacher-Forcing を用いた長時間音声とテキストの自動アライメント
- 木田 祐介, 小松 達也, 戸上 真人
- 日本音響学会 2021年春季研究発表会 (ASJ 2021 spring)
- 2021.3.10
-
- カンファレンス (国内)
- ドメイン適応と相互情報量最小化によるdisentangled な話者・言語表現に基づいたクロスリンガル音声合成
- 辛 徳泰 (東京都大学), 小松 達也, 高道 慎之介 (東京都大学), 猿渡 洋 (東京都大学)
- 日本音響学会 2021年春季研究発表会 (ASJ 2021 spring)
- 2021.3.10
-
- カンファレンス (国内)
- ペアデータを必要としない敵対的学習に基づく多チャンネル音源分離
- 中込 優 (早稲田大学), 戸上 真人, 小川 哲司 (早稲田大学), 小林 哲則 (早稲田大学)
- 日本音響学会 2021年春季研究発表会 (ASJ 2021 spring)
- 2021.3.10
-
- カンファレンス (国内)
- 挿入操作に基づく End-to-End モデルによる音声認識と音声区間検出
- 藤田 悠哉, 渡部 晋治 (Johns Hopkins Univ.), 大町 基
- 日本音響学会2021年春季研究発表会
- 2021.3.10