音声処理
-
- カンファレンス (国際)
- Multi-channel separation of dynamic speech and sound events
- Takuya Fujimura (Nagoya University), Robin Scheibler
- The 24th Annual Conference of the International Speech Communication Association (INTERSPEECH 2023)
- 2023.8.20
-
- カンファレンス (国際)
- Target Vocabulary Recognition Based on Multi-Task Learning with Decomposed Teacher Sequences
- Aoi Ito (Hosei University), Tatsuya Komatsu, Yusuke Fujita, Yusuke Kida
- The 24th Annual Conference of the International Speech Communication Association (INTERSPEECH 2023)
- 2023.8.20
-
- 論文誌 (国際)
- Audio Signal Processing in the 21st Century
- Gaël Richard (Telecom-Paris), Paris Smaragdis (University of Illinois Urbana-Champaign), Sharon Gannot (Bar-Ilan University), Patrick A. Naylor (Imperial College London), Shoji Makino (Waseda University), Walter Kellermann (University of Erlangen-N ̈urnberg), Akihiko Sugiyama
- IEEE Signal Processing Magazine (Signal Processing Magazine)
- 2023.7.19
-
- ワークショップ (国内)
- ChatGPT-EDSS: ChatGPT由来のContext Word Embeddingから学習される共感的対話音声合成モデル
- 齋藤 佑樹 (東京大学), 高道 慎之介 (東京大学), 飯森 英治 (東京大学), 橘 健太郎, 猿渡 洋 (東京大学)
- 第137回MUS・第147回SLP合同研究発表会 (音学シンポジウム 2023)
- 2023.6.23
-
- カンファレンス (国際)
- Fully Unsupervised Topic Clustering of Unlabelled Spoken Audio Using Self-Supervised Representation Learning and Topic Model
- Takashi Maekaku, Yuya Fujita, Xuankai Chang (Carnegie Mellon University), Shinji Watanabe (Carnegie Mellon University)
- The International Conference on Acoustics, Speech, & Signal Processing 2023 (ICASSP 2023)
- 2023.6.7
-
- カンファレンス (国際)
- Adaptive Noise Canceller Algorithm with SNR-Based Stepsize and Data-Dependent Averaging
- Akihiko Sugiyama
- 2023 International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023)
- 2023.6.5
-
- カンファレンス (国際)
- Linear Microphone Array Parallel to the Driving Direction for In-Car Speech Enhancement
- Masanori Tsujikawa (NEC), Akihiko Sugiyama, Ken Hanazawa (NEC America), Yoshinobu Kajikawa (Kansai University)
- 2023 International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023)
- 2023.6.5
-
- カンファレンス (国際)
- Conversation-oriented ASR with multi-look-ahead CBS architecture
- Huaibo Zhao (Waseda University), Shinya Fujie (Waseda University), Tetsuji Ogawa (Waseda University), Jin Sakuma (Waseda University), Yusuke Kida, Tetsunori Kobayashi (Waseda University)
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023)
- 2023.6.4
-
- カンファレンス (国際)
- Diffusion-based Generative Speech Source Separation
- Robin Scheibler, Youna Ji (NAVER Cloud), Soo-Whan Chung (NAVER Cloud), Jaeuk Byun (NAVER Cloud), Soyeon Choe (NAVER Cloud), Min-Seok Choi (NAVER Cloud)
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023)
- 2023.6.4
-
- カンファレンス (国際)
- Effectiveness of Inter- and Intra-subarray Spatial Features for Acoustic Scene Classification
- Takao Kawamura (Tokyo Metropolitan University), Yuma Kinoshita (Tokyo Metropolitan University), Nobutaka Ono (Tokyo Metropolitan University), Robin Scheibler
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023)
- 2023.6.4
-
- カンファレンス (国際)
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform
- Masaya Kawamura (The University of Tokyo), Yuma Shirahata, Ryuichi Yamamoto, Kentaro Tachibana
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023)
- 2023.6.4
-
- カンファレンス (国際)
- Neural Diarization with Non-Autoregressive Intermediate Attractors
- Yusuke Fujita, Tatsuya Komatsu, Robin Scheibler, Yusuke Kida, Tetsuji Ogawa (Waseda University)
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023)
- 2023.6.4
-
- カンファレンス (国際)
- NNSVS: A Neural Network-Based Singing Voice Synthesis Toolkit
- Ryuichi Yamamoto (LINE/Nagoya University), Reo Yoneyama (Nagoya University), Tomoki Toda (Nagoya University)
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023)
- 2023.6.4
-
- カンファレンス (国際)
- Non-parallel High-Quality Audio Super Resolution with Domain Adaptation and Resampling CycleGANs
- Reo Yoneyama (Nagoya University), Ryuichi Yamamoto, Kentaro Tachibana
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023)
- 2023.6.4
-
- カンファレンス (国際)
- Period VITS: Variational Inference With Explicit Pitch Modeling For End-to-End Emotional Speech
- Yuma Shirahata, Ryuichi Yamamoto, Eunwoo Song (NAVER), Ryo Terashima, Jae-Min Kim (NAVER), Kentaro Tachibana
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023)
- 2023.6.4
-
- その他 (国際)
- Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
- Xuankai Chang (Carnegie Mellon University), Brian Yan (Carnegie Mellon University), Yuya Fujita, Takashi Maekaku, Shinji Watanabe (Carnegie Mellon University)
- arXiv
- 2023.5.29
-
- カンファレンス (国際)
- Align, Write, Re-order: Explainable End-to-End Speech Translation via Operation Sequence Generation
- Motoi Omachi, Brian Yan (Carnegie Mellon University), Siddharth Dalmia (Carnegie Mellon University), Yuya Fujita, Shinji Watanabe (Carnegie Mellon University)
- 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2023)
- 2023.5.8
-
- カンファレンス (国内)
- 訳語対の推定と順序入れ替え操作による説明可能なEnd-to-end音声翻訳
- 大町 基, Brian Yan (Carnegie Mellon University), Siddharth Dalmia (Carnegie Mellon University), 藤田 悠哉, 渡部 晋治 (Carnegie Mellon University)
- 日本音響学会2023年春季研究発表会 (音響学会)
- 2023.3.22
-
- カンファレンス (国内)
- Transformerを用いた音声認識モデルにおける事前分布を用いた注意重みの平滑化の検討
- 前角 高史, 藤田 悠哉, Yifang Peng (Carnegie Mellon University), 渡部 晋治 (Carnegie Mellon University)
- 日本音響学会2023年春季研究発表会
- 2023.3.16
-
- カンファレンス (国内)
- Diffusion-Mixing Process for Speech Source Separation
- シャイブラー ロビン, Ji Youna (NAVER), Chung Soo-Whan (NAVER), Byun Jaeuk (NAVER), Choe Soyeon (NAVER), Choi Min-Seok (NAVER)
- 日本音響学会 2023年春季研究発表会 (ASJ 2023 spring)
- 2023.3.15