音声処理
-
- カンファレンス (国際)
- Insertion-Based Modeling for End-to-End Automatic Speech Recognition
- Yuya Fujita, Shinji Watanabe (Johns Hopkins University), Motoi Omachi, Xuankai Chang (Johns Hopkins University)
- INTERSPEECH 2020
- 2020.10.25
-
- カンファレンス (国際)
- Mentoring-Reverse Mentoring for Unsupervised Multi-channel Speech Source Separation
- Yu Nakagome (Waseda University), Masahito Togami, Tetsuji Ogawa (Waseda University), Tetsunori Kobayashi (Waseda University)
- The 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020)
- 2020.10.25
-
- カンファレンス (国際)
- Neural Text-to-Speech with a Modeling-by-Generation Excitation Vocoder
- Eunwoo Song (NAVER), Min-Jae Hwang (Search Solutions Inc.), Ryuichi Yamamoto, Jin-Seob Kim (NAVER), Ohsung Kwon (NAVER), Jae-Min Kim (NAVER)
- The 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020)
- 2020.10.25
-
- カンファレンス (国際)
- Sparseness-Aware DOA Estimation with Majorization Minimization
- Masahito Togami, Robin Scheibler
- The 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020)
- 2020.10.25
-
- カンファレンス (国内)
- 単語の表記と素性を同時出力するend-to-end音声認識
- 大町 基, 藤田 悠哉, 渡部 晋治 (Johns Hopkins University), Xuankai Chang (Johns Hopkins University)
- 日本音響学会2020年秋季研究発表会 (音響学会)
- 2020.9.11
-
- カンファレンス (国内)
- A Generalized Minimal Distortion Principle to Solve the Scale Ambiguity in Blind Source Separation
- シャイブラー ロビン
- 日本音響学会 2020年秋季研究発表会 (ASJ 2020 autumn)
- 2020.9.9
-
- カンファレンス (国内)
- Mentoring-Reverse Mentoring:多チャンネル音源分離における教師なし学習のための知識伝搬フレームワーク
- 中込 優 (早稲田大学), 戸上 真人, 小川 哲司 (早稲田大学), 小林 哲則 (早稲田大学)
- 日本音響学会 2020年秋季研究発表会 (ASJ 2020 autumn)
- 2020.9.9
-
- カンファレンス (国内)
- 挿入操作に基づく End-to-End 音声認識
- 藤田 悠哉, 渡部 晋治 (Johns Hopkins Univ.), 大町 基, Xuankai Chang (Johns Hopkins Univ.)
- 日本音響学会2020年秋季研究発表会 (音響学会)
- 2020.9.9
-
- その他 (国際)
- Insertion-Based Modeling for End-to-End Automatic Speech Recognition
- Yuya Fujita, Shinji Watanabe (Johns Hopkins University), Motoi Omachi, Xuankai Chang (Johns Hopkins University)
- arXiv.org
- 2020.5.27
-
- カンファレンス (国際)
- Attention-based ASR with Lightweight and Dynamic Convolutions
- Yuya Fujita, Aswin Shanmugam Subramanian (Johns Hopkins University), Motoi Omachi, Shinji Watanabe (Johns Hopkins University)
- 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020)
- 2020.5.8
-
- カンファレンス (国際)
- Consistency-Aware Multi-Channel Speech Enhancement Using Deep Neural Networks
- Yoshiki Masuyama (Waseda University), Masahito Togami, Tatsuya Komatsu
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- 2020.5.4
-
- カンファレンス (国際)
- Deep Speech Extraction with Time-Varying Spatial Filtering Guided By Desired Direction Attractor
- Yu Nakagome (Waseda University), Masahito Togami, Tetsuji Ogawa (Waseda University), Tetsunori Kobayashi (Waseda University)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- 2020.5.4
-
- カンファレンス (国際)
- ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit
- Tomoki Hayashi (Nagoya University), Ryuichi Yamamoto, Katsuki Inoue (Okayama University), Takenori Yoshimura (Nagoya University), Shinji Watanabe (Johns Hopkins University), Tomoki Toda (Nagoya University), Kazuya Takeda (Nagoya University), Yu Zhang (Google AI), Xu Tan (Microsoft Research)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- 2020.5.4
-
- カンファレンス (国際)
- Fast Start-Up Algorithm for Adaptive Noise Cancellers with Novel SNR Estimation and Stepsize Control
- Akihiko Sugiyama
- International Conference on Acoustics, Speech, and Signal Processing 2020 (ICASSP2020)
- 2020.5.4
-
- カンファレンス (国際)
- Improving LPCNet-Based Text-to-Speech with Linear Prediction-Structured Mixture Density Network
- Min-Jae Hwang (Search Solutions Inc), Eunwoo Song (NAVER), Ryuichi Yamamoto, Frank Soong (Microsoft Research Asia), Hong-Goo Kang (Yonsei University)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- 2020.5.4
-
- カンファレンス (国際)
- Joint Training of Deep Neural Networks for Multi-Channel Dereverberation and Speech Source Separation
- Masahito Togami
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- 2020.5.4
-
- カンファレンス (国際)
- Multi-Channel Speech Source Separation and Dereverberation With Sequential Integration of Determined and Underdetermined Models
- Masahito Togami
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- 2020.5.4
-
- カンファレンス (国際)
- Parallel WaveGAN: A Fast Waveform Generation Model Based on Generative Adversarial Networks with Multi-Resolution Spectrogram
- Ryuichi Yamamoto, Eunwoo Song (NAVER), Jae-Min Kim (NAVER)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- 2020.5.4
-
- カンファレンス (国際)
- Scene-Dependent Acoustic Event Detection with Scene Conditioning and Fake-Scene-Conditioned Loss
- Tatsuya Komatsu, Keisuke Imoto (Ritsumeikan University), Masahito Togami
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- 2020.5.4
-
- カンファレンス (国際)
- Semi-Supervised Speaker Adaptation for End-to-End Speech Synthesis with Pretrained Models
- Katsuki Inoue (Okayama University), Sunao Hara (Okayama University), Masanobu Abe (Okayama University), Tomoki Hayashi (Nagoya University), Ryuichi Yamamoto, Shinji Watanabe (Johns Hopkins University)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- 2020.5.4