音声処理
-
- 論文誌 (国際)
- Computationally-Efficient Overdetermined Blind Source Separation Based on Iterative Source Steering
- Yicheng Du (Kyoto University), Robin Scheibler, Masahito Togami, Kazuyoshi Yoshii (Kyoto University), Tatsuya Kawahara (Kyoto University)
- IEEE Signal Processing Letters (IEEE SPL)
- 2021.12.13
-
- その他 (国内)
- 国際会議Interspeech2021参加報告
- 田中 智大 (NTT), 山本 龍一
- 第251回自然言語処理・第139回音声言語情報処理合同研究発表会 (SLP/NL 2021)
- 2021.11.24
-
- その他 (国際)
- A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
- Yosuke Higuchi (Waseda University), Nanxin Chen (Johns Hopkins University), Yuya Fujita, Hirofumi Inaguma (Kyoto University), Tatsuya Komatsu (LINE Corporation), Jaesong Lee (Naver Corporation), Jumon Nozaki (Kyoto University, LINE Corporation), Tianzi Wang (Johns Hopkins University), Shinji Watanabe (Carnegie Mellon University)
- arXiv
- 2021.10.11
-
- カンファレンス (国内)
- Conformer CPCとDeep Cluster を用いたゼロリソース言語のための表現学習
- 前角 高史, Xuankai Chang (カーネギーメロン大学), 藤田 悠哉, Li-Wei Chen (カーネギーメロン大学), 渡部 晋治(カーネギーメロン大学), Alexander Rudnicky (カーネギーメロン大学)
- 日本音響学会2021年秋季研究発表会
- 2021.9.8
-
- カンファレンス (国内)
- CTC を用いた音声認識のための中間層予測による条件づけ
- 野崎 樹文 (京都大学), 小松 達也
- 日本音響学会 2021年秋季研究発表会 (ASJ 2021 autumn)
- 2021.9.7
-
- カンファレンス (国内)
- テキスト音声合成のための CycleGAN 声質変換を用いたデータ拡張の検討
- 寺島 涼, 山本 龍一, 橘 健太郎
- 日本音響学会 2021年秋季研究発表会 (ASJ 2021 autumn)
- 2021.9.7
-
- カンファレンス (国内)
- 音声意味理解への応用を指向した非自己回帰型End-to-end音声認識
- 大町 基, 藤田 悠哉, 渡部 晋治 (Carnegie Mellon University), Tianzi Wang (Johns Hopkins University)
- 日本音響学会 2021年秋季研究発表会 (音響学会)
- 2021.9.7
-
- カンファレンス (国際)
- Streaming End-to-End ASR based on Blockwise Non-Autoregressive Models
- Tianzi Wang (Johns Hopkins University), Yuya Fujita, Xuankai Chang (Carnegie Mellon Univ.), Shinji Watanabe (Carnegie Mellon University)
- INTERSPEECH 2021
- 2021.9.2
-
- カンファレンス (国際)
- Toward Streaming ASR with Non-Autoregressive Insertion-based Model
- Yuya Fujita, Tianzi Wang (Johns Hopkins University), Shinji Watanabe (Carnegie Mellon University), Motoi Omach
- INTERSPEECH 2021
- 2021.9.2
-
- カンファレンス (国際)
- Acoustic Event Detection with Classifier Chains
- Tatsuya Komatsu, Shinji Watanabe (Carnegie Mellon University), Koichi Miyazaki (Nagoya University), Tomoki Hayashi (Human Dataware Lab.)
- The 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021)
- 2021.8.30
-
- カンファレンス (国際)
- Efficient and Stable Adversarial Learning Using Unpaired Data for Unsupervised Multichannel Speech Separation
- Yu Nakagome (Waseda University), Masahito Togami, Tetsuji Ogawa (Waseda University), Tetsunori Kobayashi (Waseda University)
- The 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021)
- 2021.8.30
-
- カンファレンス (国際)
- High-fidelity Parallel WaveGAN with Multi-band Harmonic-plus-Noise Model
- Min-Jae Hwang (Search Solutions Inc), Ryuichi Yamamoto, Eunwoo Song (NAVER), Jae-Min Kim (NAVER)
- The 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021)
- 2021.8.30
-
- カンファレンス (国際)
- Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis
- Kosuke Futamata, Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana
- The 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021)
- 2021.8.30
-
- カンファレンス (国際)
- Relaxing the Conditional Independence Assumption of CTC-based ASR by Conditioning on Intermediate Predictions
- Jumon Nozaki (Kyoto University), Tatsuya Komatsu
- The 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021)
- 2021.8.30
-
- カンファレンス (国際)
- Sound Source Localization with Majorization Minimization
- Masahito Togami, Robin Scheilbler
- The 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021)
- 2021.8.30
-
- カンファレンス (国際)
- Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021
- Takashi Maekaku, Xuankai Chang (Carnegie Mellon University), Yuya Fujita, Li, Shinji Watanabe (Carnegie Mellon University), Alexander Rudnicky (Carnegie Mellon University)
- INTERSPEECH 2021
- 2021.8.30
-
- カンファレンス (国際)
- Multi-Source Domain Adaptation with Sinkhorn Barycenter
- Tatsuya Komatsu, Tomoko Matsui (The Institute of Statistical Mathematics), Junbin Gao (The University of Sydney)
- 29th European Signal Processing Conference (EUSIPCO 2021)
- 2021.8.23
-
- カンファレンス (国際)
- Multichannel Separation and Classification of Sound Events
- Robin Scheilbler, Tatsuya Komatsu, Masahito Togami
- 29th European Signal Processing Conference (EUSIPCO 2021)
- 2021.8.23
-
- その他 (国際)
- Streaming End-to-End ASR based on Blockwise Non-Autoregressive Models
- Tianzi Wang (Johns Hopkins Univ.), Yuya Fujita, Xuankai Chang (Carnegie Mellon Univ.), Shinji Watanabe (Carnegie Mellon Univ.)
- arXiv.org
- 2021.7.20
-
- その他 (国際)
- Toward Streaming ASR with Non-Autoregressive Insertion-based Model
- Yuya Fujita, Tianzi Wang (Johns Hopkins Univ.), Shinji Watanabe (Carnegie Mellon Univ.), Motoi Omachi
- arXiv.org
- 2021.7.16