Speech Processing
-
- CONFERENCE (DOMESTIC)
- 挿入操作に基づく End-to-End 音声認識
- 藤田 悠哉, 渡部 晋治 (Johns Hopkins Univ.), 大町 基, Xuankai Chang (Johns Hopkins Univ.)
- 日本音響学会2020年秋季研究発表会 (音響学会)
- September 09, 2020
-
- OTHERS (INTERNATIONAL)
- Insertion-Based Modeling for End-to-End Automatic Speech Recognition
- Yuya Fujita, Shinji Watanabe (Johns Hopkins University), Motoi Omachi, Xuankai Chang (Johns Hopkins University)
- arXiv.org
- May 27, 2020
-
- CONFERENCE (INTERNATIONAL)
- Attention-based ASR with Lightweight and Dynamic Convolutions
- Yuya Fujita, Aswin Shanmugam Subramanian (Johns Hopkins University), Motoi Omachi, Shinji Watanabe (Johns Hopkins University)
- 45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020)
- May 08, 2020
-
- CONFERENCE (INTERNATIONAL)
- Consistency-Aware Multi-Channel Speech Enhancement Using Deep Neural Networks
- Yoshiki Masuyama (Waseda University), Masahito Togami, Tatsuya Komatsu
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020
-
- CONFERENCE (INTERNATIONAL)
- Deep Speech Extraction with Time-Varying Spatial Filtering Guided By Desired Direction Attractor
- Yu Nakagome (Waseda University), Masahito Togami, Tetsuji Ogawa (Waseda University), Tetsunori Kobayashi (Waseda University)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020
-
- CONFERENCE (INTERNATIONAL)
- ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit
- Tomoki Hayashi (Nagoya University), Ryuichi Yamamoto, Katsuki Inoue (Okayama University), Takenori Yoshimura (Nagoya University), Shinji Watanabe (Johns Hopkins University), Tomoki Toda (Nagoya University), Kazuya Takeda (Nagoya University), Yu Zhang (Google AI), Xu Tan (Microsoft Research)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020
-
- CONFERENCE (INTERNATIONAL)
- Fast Start-Up Algorithm for Adaptive Noise Cancellers with Novel SNR Estimation and Stepsize Control
- Akihiko Sugiyama
- International Conference on Acoustics, Speech, and Signal Processing 2020 (ICASSP2020)
- May 04, 2020
-
- CONFERENCE (INTERNATIONAL)
- Improving LPCNet-Based Text-to-Speech with Linear Prediction-Structured Mixture Density Network
- Min-Jae Hwang (Search Solutions Inc), Eunwoo Song (NAVER), Ryuichi Yamamoto, Frank Soong (Microsoft Research Asia), Hong-Goo Kang (Yonsei University)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020
-
- CONFERENCE (INTERNATIONAL)
- Joint Training of Deep Neural Networks for Multi-Channel Dereverberation and Speech Source Separation
- Masahito Togami
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020
-
- CONFERENCE (INTERNATIONAL)
- Multi-Channel Speech Source Separation and Dereverberation With Sequential Integration of Determined and Underdetermined Models
- Masahito Togami
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020
-
- CONFERENCE (INTERNATIONAL)
- Parallel WaveGAN: A Fast Waveform Generation Model Based on Generative Adversarial Networks with Multi-Resolution Spectrogram
- Ryuichi Yamamoto, Eunwoo Song (NAVER), Jae-Min Kim (NAVER)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020
-
- CONFERENCE (INTERNATIONAL)
- Scene-Dependent Acoustic Event Detection with Scene Conditioning and Fake-Scene-Conditioned Loss
- Tatsuya Komatsu, Keisuke Imoto (Ritsumeikan University), Masahito Togami
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020
-
- CONFERENCE (INTERNATIONAL)
- Semi-Supervised Speaker Adaptation for End-to-End Speech Synthesis with Pretrained Models
- Katsuki Inoue (Okayama University), Sunao Hara (Okayama University), Masanobu Abe (Okayama University), Tomoki Hayashi (Nagoya University), Ryuichi Yamamoto, Shinji Watanabe (Johns Hopkins University)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020
-
- CONFERENCE (INTERNATIONAL)
- Unsupervised Training for Deep Speech Source Separation with Kullback-Leibler Divergence Based Probabilistic Loss Function
- Masahito Togami, Yoshiki Masuyama (Waseda University), Tatsuya Komatsu, Yu Nakagome (Waseda University)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020
-
- CONFERENCE (INTERNATIONAL)
- Weakly-Supervised Sound Event Detection with Self-Attention
- Koichi Miyazaki, Tatsuya Komatsu, Tomoki Hayashi (Nagoya University), Shinji Watanabe (Johns Hopkins University), Tomoki Toda (Nagoya University), Kazuya Takeda (Nagoya University)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020
-
- CONFERENCE (DOMESTIC)
- (招待講演)End-to-end 音声合成の研究を加速させるオープンソースツールキット ESPnet-TTS
- 林 知樹 (名古屋大学), 山本 龍一, 井上 勝喜 (岡山大学), 吉村 建慶 (岡山大学), 武田 一哉 (名古屋大学), 戸田 智基 (名古屋大学), 渡部 晋治 (Johns Hopkins University)
- 日本音響学会 2020年春季研究発表会 (ASJ 2020 spring)
- March 16, 2020
-
- CONFERENCE (DOMESTIC)
- Self-attention を用いた弱教師あり音響イベント検出
- 宮崎 晃一 (名古屋大学), 小松 達也, 林 知樹 (名古屋大学), 渡部 晋治 (Johns Hopkins University), 戸田 智基 (名古屋大学), 武田 一哉 (名古屋大学)
- 日本音響学会 2020年春季研究発表会 (ASJ 2020 spring)
- March 16, 2020
-
- CONFERENCE (DOMESTIC)
- 所望音源の方向アトラクターに基づく時変の空間フィルタを用いた DNN 音声抽出
- 中込 優 (早稲田大学), 戸上 真人, 小川 哲司 (早稲田大学), 小林 哲則 (早稲田大学)
- 日本音響学会 2020年春季研究発表会 (ASJ 2020 spring)
- March 16, 2020
-
- CONFERENCE (DOMESTIC)
- End-to-End 音声認識を用いた音声合成の半教師あり話者適応
- 井上 勝喜 (岡山大学), 原 直 (岡山大学), 阿部 匡伸 (岡山大学), 林 知樹 (名古屋大学), 山本 龍一, 渡部 晋治 (Johns Hopkins University)
- 日本音響学会 2020年春季研究発表会 (ASJ 2020 spring)
- March 16, 2020
-
- CONFERENCE (DOMESTIC)
- 軽量・動的畳み込みを用いたend-to-end音声認識
- 藤田 悠哉, Aswin Shanmugam Subramanian*, 大町 基, 渡部晋治* (* Johns Hopkins University)
- 日本音響学会2020年春季研究発表会 (音響学会)
- March 09, 2020