音声処理
-
- カンファレンス (国内)
- Conformer CPCとDeep Cluster を用いたゼロリソース言語のための表現学習
- 前角 高史, Xuankai Chang (カーネギーメロン大学), 藤田 悠哉, Li-Wei Chen (カーネギーメロン大学), 渡部 晋治(カーネギーメロン大学), Alexander Rudnicky (カーネギーメロン大学)
- 日本音響学会2021年秋季研究発表会
- 2021.9.8
-
- カンファレンス (国内)
- CTC を用いた音声認識のための中間層予測による条件づけ
- 野崎 樹文 (京都大学), 小松 達也
- 日本音響学会 2021年秋季研究発表会 (ASJ 2021 autumn)
- 2021.9.7
-
- カンファレンス (国内)
- テキスト音声合成のための CycleGAN 声質変換を用いたデータ拡張の検討
- 寺島 涼, 山本 龍一, 橘 健太郎
- 日本音響学会 2021年秋季研究発表会 (ASJ 2021 autumn)
- 2021.9.7
-
- カンファレンス (国内)
- 音声意味理解への応用を指向した非自己回帰型End-to-end音声認識
- 大町 基, 藤田 悠哉, 渡部 晋治 (Carnegie Mellon University), Tianzi Wang (Johns Hopkins University)
- 日本音響学会 2021年秋季研究発表会 (音響学会)
- 2021.9.7
-
- カンファレンス (国際)
- Streaming End-to-End ASR based on Blockwise Non-Autoregressive Models
- Tianzi Wang (Johns Hopkins University), Yuya Fujita, Xuankai Chang (Carnegie Mellon Univ.), Shinji Watanabe (Carnegie Mellon University)
- INTERSPEECH 2021
- 2021.9.2
-
- カンファレンス (国際)
- Toward Streaming ASR with Non-Autoregressive Insertion-based Model
- Yuya Fujita, Tianzi Wang (Johns Hopkins University), Shinji Watanabe (Carnegie Mellon University), Motoi Omach
- INTERSPEECH 2021
- 2021.9.2
-
- カンファレンス (国際)
- Acoustic Event Detection with Classifier Chains
- Tatsuya Komatsu, Shinji Watanabe (Carnegie Mellon University), Koichi Miyazaki (Nagoya University), Tomoki Hayashi (Human Dataware Lab.)
- The 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021)
- 2021.8.30
-
- カンファレンス (国際)
- Efficient and Stable Adversarial Learning Using Unpaired Data for Unsupervised Multichannel Speech Separation
- Yu Nakagome (Waseda University), Masahito Togami, Tetsuji Ogawa (Waseda University), Tetsunori Kobayashi (Waseda University)
- The 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021)
- 2021.8.30
-
- カンファレンス (国際)
- High-fidelity Parallel WaveGAN with Multi-band Harmonic-plus-Noise Model
- Min-Jae Hwang (Search Solutions Inc), Ryuichi Yamamoto, Eunwoo Song (NAVER), Jae-Min Kim (NAVER)
- The 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021)
- 2021.8.30
-
- カンファレンス (国際)
- Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis
- Kosuke Futamata, Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana
- The 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021)
- 2021.8.30
-
- カンファレンス (国際)
- Relaxing the Conditional Independence Assumption of CTC-based ASR by Conditioning on Intermediate Predictions
- Jumon Nozaki (Kyoto University), Tatsuya Komatsu
- The 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021)
- 2021.8.30
-
- カンファレンス (国際)
- Sound Source Localization with Majorization Minimization
- Masahito Togami, Robin Scheilbler
- The 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021)
- 2021.8.30
-
- カンファレンス (国際)
- Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021
- Takashi Maekaku, Xuankai Chang (Carnegie Mellon University), Yuya Fujita, Li, Shinji Watanabe (Carnegie Mellon University), Alexander Rudnicky (Carnegie Mellon University)
- INTERSPEECH 2021
- 2021.8.30
-
- カンファレンス (国際)
- Multi-Source Domain Adaptation with Sinkhorn Barycenter
- Tatsuya Komatsu, Tomoko Matsui (The Institute of Statistical Mathematics), Junbin Gao (The University of Sydney)
- 29th European Signal Processing Conference (EUSIPCO 2021)
- 2021.8.23
-
- カンファレンス (国際)
- Multichannel Separation and Classification of Sound Events
- Robin Scheilbler, Tatsuya Komatsu, Masahito Togami
- 29th European Signal Processing Conference (EUSIPCO 2021)
- 2021.8.23
-
- その他 (国際)
- Streaming End-to-End ASR based on Blockwise Non-Autoregressive Models
- Tianzi Wang (Johns Hopkins Univ.), Yuya Fujita, Xuankai Chang (Carnegie Mellon Univ.), Shinji Watanabe (Carnegie Mellon Univ.)
- arXiv.org
- 2021.7.20
-
- その他 (国際)
- Toward Streaming ASR with Non-Autoregressive Insertion-based Model
- Yuya Fujita, Tianzi Wang (Johns Hopkins Univ.), Shinji Watanabe (Carnegie Mellon Univ.), Motoi Omachi
- arXiv.org
- 2021.7.16
-
- ワークショップ (国際)
- Improved Parallel WaveGAN with perceptually weighted spectrogram loss
- Eunwoo Song (NAVER), Ryuichi Yamamoto, Min-Jae Hwang (NAVER), Jin-Seob Kim (NAVER), Ohsung Kwon (NAVER), Jae-Min Kim (NAVER)
- 2021 IEEE Spoken Language Technology Workshop (SLT) (SLT 2021)
- 2021.6.19
-
- カンファレンス (国際)
- Disentangled Speaker and Language Representations Using Mutual Information Minimization and Domain Adaptation for Cross-Lingual TTS
- Detai Xin (The University of Tokyo), Tatsuya Komatsu, Shinnosuke Takamichi (The University of Tokyo), Hiroshi Saruwatari (The University of Tokyo)
- 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021)
- 2021.6.6
-
- カンファレンス (国際)
- End To End Learning For Convolutive Multi-Channel Wiener Filtering
- Masahito Togami
- 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021)
- 2021.6.6