Speech Processing
-
- CONFERENCE (INTERNATIONAL)
- Diffusion-based Generative Speech Source Separation
- Robin Scheibler, Youna Ji (NAVER Cloud), Soo-Whan Chung (NAVER Cloud), Jaeuk Byun (NAVER Cloud), Soyeon Choe (NAVER Cloud), Min-Seok Choi (NAVER Cloud)
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023)
- June 04, 2023
-
- CONFERENCE (INTERNATIONAL)
- Effectiveness of Inter- and Intra-subarray Spatial Features for Acoustic Scene Classification
- Takao Kawamura (Tokyo Metropolitan University), Yuma Kinoshita (Tokyo Metropolitan University), Nobutaka Ono (Tokyo Metropolitan University), Robin Scheibler
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023)
- June 04, 2023
-
- CONFERENCE (INTERNATIONAL)
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform
- Masaya Kawamura (The University of Tokyo), Yuma Shirahata, Ryuichi Yamamoto, Kentaro Tachibana
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023)
- June 04, 2023
-
- CONFERENCE (INTERNATIONAL)
- Neural Diarization with Non-Autoregressive Intermediate Attractors
- Yusuke Fujita, Tatsuya Komatsu, Robin Scheibler, Yusuke Kida, Tetsuji Ogawa (Waseda University)
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023)
- June 04, 2023
-
- CONFERENCE (INTERNATIONAL)
- NNSVS: A Neural Network-Based Singing Voice Synthesis Toolkit
- Ryuichi Yamamoto (LINE/Nagoya University), Reo Yoneyama (Nagoya University), Tomoki Toda (Nagoya University)
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023)
- June 04, 2023
-
- CONFERENCE (INTERNATIONAL)
- Non-parallel High-Quality Audio Super Resolution with Domain Adaptation and Resampling CycleGANs
- Reo Yoneyama (Nagoya University), Ryuichi Yamamoto, Kentaro Tachibana
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023)
- June 04, 2023
-
- CONFERENCE (INTERNATIONAL)
- Period VITS: Variational Inference With Explicit Pitch Modeling For End-to-End Emotional Speech
- Yuma Shirahata, Ryuichi Yamamoto, Eunwoo Song (NAVER), Ryo Terashima, Jae-Min Kim (NAVER), Kentaro Tachibana
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023)
- June 04, 2023
-
- OTHERS (INTERNATIONAL)
- Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
- Xuankai Chang (Carnegie Mellon University), Brian Yan (Carnegie Mellon University), Yuya Fujita, Takashi Maekaku, Shinji Watanabe (Carnegie Mellon University)
- arXiv
- May 29, 2023
-
- CONFERENCE (INTERNATIONAL)
- Align, Write, Re-order: Explainable End-to-End Speech Translation via Operation Sequence Generation
- Motoi Omachi, Brian Yan (Carnegie Mellon University), Siddharth Dalmia (Carnegie Mellon University), Yuya Fujita, Shinji Watanabe (Carnegie Mellon University)
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2023)
- May 08, 2023
-
- CONFERENCE (DOMESTIC)
- 訳語対の推定と順序入れ替え操作による説明可能なEnd-to-end音声翻訳
- 大町 基, Brian Yan (Carnegie Mellon University), Siddharth Dalmia (Carnegie Mellon University), 藤田 悠哉, 渡部 晋治 (Carnegie Mellon University)
- 日本音響学会2023年春季研究発表会 (音響学会)
- March 22, 2023
-
- CONFERENCE (DOMESTIC)
- Transformerを用いた音声認識モデルにおける事前分布を用いた注意重みの平滑化の検討
- 前角 高史, 藤田 悠哉, Yifang Peng (Carnegie Mellon University), 渡部 晋治 (Carnegie Mellon University)
- 日本音響学会2023年春季研究発表会
- March 16, 2023
-
- CONFERENCE (DOMESTIC)
- Diffusion-Mixing Process for Speech Source Separation
- シャイブラー ロビン, Ji Youna (NAVER), Chung Soo-Whan (NAVER), Byun Jaeuk (NAVER), Choe Soyeon (NAVER), Choi Min-Seok (NAVER)
- 日本音響学会 2023年春季研究発表会 (ASJ 2023 spring)
- March 15, 2023
-
- CONFERENCE (DOMESTIC)
- ヘビーテイル生成モデルに基づく独立低ランク行列分析における iterative source steering を用いた分離行列の更新
- 蓮実 拓也, シャイブラー ロビン
- 日本音響学会 2023年春季研究発表会 (ASJ 2023 spring)
- March 15, 2023
-
- CONFERENCE (DOMESTIC)
- 中間層予測を用いたEnd-to-end ダイアライゼーション
- 藤田 雄介, 小松 達也, Scheibler Robin, 木田 祐介, 小川 哲司 (早稲田大学)
- 日本音響学会 2023年春季研究発表会 (ASJ 2023 spring)
- March 15, 2023
-
- CONFERENCE (DOMESTIC)
- ストリーミング End-to-End 音声認識のための RNN Transducer の最小遅延学習
- 篠原 雄介, 渡部 晋治 (Carnegie Mellon University)
- 日本音響学会2023年春季研究発表会
- March 15, 2023
-
- OTHERS (DOMESTIC)
- 日本語音声認識における語彙集合分割とマルチタスク学習による 目的語彙抽出
- 伊藤 葵 (LINE/法政大学), 小松 達也, 藤田 雄介
- 電子情報通信学会/日本音響学会 音声研究会 (SP研究会)
- February 28, 2023
-
- CONFERENCE (INTERNATIONAL)
- Alternate Intermediate Conditioning with Syllable-level and Character-level Targets for Japanese ASR
- Yusuke Fujita, Tatsuya Komatsu, Yusuke Kida
- The 2022 IEEE Spoken Language Technology Workshop (SLT 2022)
- January 09, 2023
-
- CONFERENCE (INTERNATIONAL)
- End-to-End Multi-speaker ASR with Independent Vector Analysis
- Robin Scheibler, Wangyou Zhang (Shanghai Jiao Tong University), Xuankai Chang (Carnegie Mellon University), Shinji Watanabe (Carnegie Mellon University), Yanmin Qian (Shanghai Jiao Tong University)
- The 2022 IEEE Spoken Language Technology Workshop (SLT 2022)
- January 09, 2023
-
- CONFERENCE (INTERNATIONAL)
- Inter-Decoder: Using Attention-Decoder losses as Intermediate Regularization for CTC-based Speech Recognition
- Tatsuya Komatsu, Yusuke Fujita
- The 2022 IEEE Spoken Language Technology Workshop (SLT 2022)
- January 09, 2023
-
- CONFERENCE (INTERNATIONAL)
- Adaptive Noise Canceller Algorithm with an SNR-Based Stepsize and Controlled Averaging
- Akihiko Sugiyama
- IEEE International Conference on Consumer Electronics (ICCE)
- January 06, 2023