Speech Processing
-
- CONFERENCE (INTERNATIONAL)
- Enhancing Multilingual TTS with Voice Conversion Based Data Augmentation and Posterior Embedding
- Hyun-Wook Yoon (NAVER Cloud), Jin-Seob Kim (NAVER Cloud), Ryuichi Yamamoto, Ryo Terashima, Chan-Ho Song (NAVER Cloud), Jae-Min Kim (NAVER Cloud), Eunwoo Song (NAVER Cloud)
- 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024)
- April 14, 2024
-
- CONFERENCE (INTERNATIONAL)
- Keep Decoding Parallel With Effective Knowledge Distillation From Language Models To End-To-End Speech Recognisers
- Michael Hentschel (LINE WORKS Corporation), Yuta Nishikawa (Nara Institute of Science and Technology), Tatsuya Komatsu, Yusuke Fujita
- 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024)
- April 14, 2024
-
- CONFERENCE (INTERNATIONAL)
- PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions
- Reo Shimizu (Tohoku University), Ryuichi Yamamoto, Masaya Kawamura, Yuma Shirahata, Hironori Doi, Tatsuya Komatsu, Kentaro Tachibana
- 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024)
- April 14, 2024
-
- CONFERENCE (DOMESTIC)
- 日本語テキストと音楽の対照学習の実験的評価
- 蓮実 拓也, 小松 達也, 藤田 雄介, 二又 航介, 橘 健太郎
- 日本音響学会 2024年春季研究発表会 (ASJ 2024 spring)
- March 07, 2024
-
- CONFERENCE (DOMESTIC)
- 拡散過程と敵対的学習の併用による普遍音声強調
- シャイブラー ロビン, 藤田 雄介, 橘 健太郎
- 日本音響学会 2024年春季研究発表会 (ASJ 2024 spring)
- March 06, 2024
-
- CONFERENCE (DOMESTIC)
- 音声品質と音響環境の潜在変数で条件付けたDenoising Trainingによるノイズロバスト音声変換
- 五十嵐 琢斗 (東京大学), 齋藤 佑樹 (東京大学), 関 健太郎 (東京大学), 高道 慎之介 (東京大学), 山本 龍一, 橘 健太郎, 猿渡 洋 (東京大学)
- 電子情報通信学会/日本音響学会 音声研究会 (IEICE/ASJ-SP)
- February 22, 2024
-
- WORKSHOP (INTERNATIONAL)
- A Comparative Study of Voice Conversion Models with Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 2023
- Ryuichi Yamamoto (Nagoya University / LINE Corp.), Reo Yoneyama (Nagoya University), Lester Phillip Violeta (Nagoya University), Wen-Chin Huang (Nagoya University), Tomoki Toda (Nagoya University)
- The 2023 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2023)
- December 19, 2023
-
- CONFERENCE (INTERNATIONAL)
- Domain Adaptation by Data Distribution Matching via Submodularity for Speech Recognition
- Yusuke Shinohara, Shinji Watanabe (Carnegie Mellon University)
- The 2023 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2023)
- December 16, 2023
-
- CONFERENCE (INTERNATIONAL)
- LV-CTC: Non-autoregressive ASR with CTC and Latent Variable Models
- Yuya Fujita, Shinji Watanabe (Carnegie Mellon Univ.), Xuankai Chang (Carnegie Mellon Univ.), Takashi Maekaku
- The 2023 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU 2023)
- December 16, 2023
-
- JOURNAL (INTERNATIONAL)
- Self-conditioning via Intermediate Predictions for End-to-end Neural Speaker Diarization
- Yusuke Fujita, Tetsuji Ogawa (Waseda University), Tetsunori Kobayashi (Waseda University)
- IEEE Access
- December 07, 2023
-
- OTHERS (INTERNATIONAL)
- HuBERTopic: Enhancing Semantic Representation of HuBERT through Self-supervision Utilizing Topic Model
- Takashi Maekaku, Jiatong Shi (Carnegie Mellon University), Xuankai Chang (Carnegie Mellon University), Yuya Fujita, Shinji Watanabe (Carnegie Mellon University)
- arXiv
- October 09, 2023
-
- CONFERENCE (DOMESTIC)
- End-to-end 音声認識器の中間層への言語知識転移
- Michael Hentschel (WORKS MOBILE JAPAN), 西川 勇太 (奈良先端科学技術大学院大学), 小松 達也, 藤田 雄介
- 日本音響学会 2023年秋季研究発表会 (ASJ 2023 autumn)
- September 26, 2023
-
- CONFERENCE (DOMESTIC)
- NNSVS: ニューラルネットワークに基づく歌声合成のためのオープンソースソフトウェア
- 山本 龍一 (名古屋大学/LINE), 米山 怜於 (名古屋大学), 戸田 智基 (名古屋大学)
- 日本音響学会 2023年秋季研究発表会 (ASJ 2023 autumn)
- September 26, 2023
-
- CONFERENCE (DOMESTIC)
- 独立低ランク行列分析における iterative projection with adjustment を用いた分離行列の更新
- 蓮実 拓也, シャイブラー ロビン
- 日本音響学会 2023年秋季研究発表会 (ASJ 2023 autumn)
- September 26, 2023
-
- CONFERENCE (DOMESTIC)
- Foley Sound Synthesis with a Class-Conditioned Latent Diffusion Model and FAD-Based Post-filtering
- シャイブラー ロビン, 蓮実 拓也, 藤田 雄介, 小松 達也, 山本 龍一, 橘 健太郎
- 日本音響学会 2023年秋季研究発表会 (ASJ 2023 autumn)
- September 26, 2023
-
- CONFERENCE (DOMESTIC)
- データの分布マッチングによる End-to-End 音声認識モデルのドメイン適応
- 篠原 雄介, 渡部 晋治 (CMU)
- 日本音響学会 第150回(2023年秋季)研究発表会
- September 26, 2023
-
- CONFERENCE (DOMESTIC)
- 潜在変数モデルを用いたCTCによる非自己回帰型音声認識
- 藤田 悠哉, 渡部 晋治 (カーネギーメロン大学)
- 日本音響学会第150回(2023年秋季)研究発表会
- September 26, 2023
-
- WORKSHOP (INTERNATIONAL)
- Foley Sound Synthesis with a Class-conditioned Latent Diffusion Model
- Robin Scheibler, Takuya Hasumi, Yusuke Fujita, Tatsuya Komatsu, Ryuichi Yamamoto, Kentaro Tachibana
- Detection and Classification of Acoustic Scenes and Events (DCASE 2023)
- September 20, 2023
-
- CONFERENCE (INTERNATIONAL)
- CALLS: Japanese Empathetic Dialogue Speech Corpus of Complaint Handling and Attentive Listening in Customer Center
- Yuki Saito (The University of Tokyo), Eiji Iimori (The University of Tokyo), Shinnosuke Takamichi (The University of Tokyo), Kentaro Tachibana, Hiroshi Saruwatari (The University of Tokyo)
- The 24th Annual Conference of the International Speech Communication Association (INTERSPEECH 2023)
- August 20, 2023
-
- CONFERENCE (INTERNATIONAL)
- ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings
- Yuki Saito (The University of Tokyo), Shinnosuke Takamichi (The University of Tokyo), Eiji Iimori (The University of Tokyo), Kentaro Tachibana, Hiroshi Saruwatari (The University of Tokyo)
- The 24th Annual Conference of the International Speech Communication Association (INTERSPEECH 2023)
- August 20, 2023