Generative AI
-
- CONFERENCE (INTERNATIONAL)
- Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation
- Ryo Terashima, Ryuichi Yamamoto, Eunwoo Song (NAVER), Yuma Shirahata, Hyun-Wook Yoon (NAVER), Jae-Min Kim (NAVER), Kentaro Tachibana
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- September 18, 2022
-
- CONFERENCE (INTERNATIONAL)
- DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning
- Takaaki Saeki (The University of Tokyo), Kentaro Tachibana, Ryuichi Yamamoto
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- September 18, 2022
-
- CONFERENCE (INTERNATIONAL)
- TTS-by-TTS 2: Data-selective Augmentation for Neural Speech Synthesis Using Ranking Support Vector Machine with Variational Autoencoder
- Eunwoo Song (NAVER), Ryuichi Yamamoto, Ohsung Kwon (NAVER), Chan-Ho Song, Min-Jae Hwang (NAVER), Suhyeon Oh (NAVER), Hyun-Wook Yoon (NAVER), Jin-Seob Kim (NAVER), Jae-Min Kim (NAVER)
- The 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022)
- September 18, 2022
-
- CONFERENCE (DOMESTIC)
- 大規模汎用言語モデルによるペルソナを考慮した応答生成
- 川本 稔己 (LINE/東京工業大学), 山崎 天, 佐藤 敏紀, 奥村 学 (東京工業大学)
- 言語処理学会第28回年次大会 (NLP 2022)
- March 14, 2022
-
- CONFERENCE (DOMESTIC)
- 大規模汎用言語モデルを用いた雑談対話システムの対人関係性に基づく発話制御の検討
- 山崎 天, 川本 稔己 (LINE/東京工業大学), 吉川 克正, 佐藤 敏紀
- 言語処理学会第28回年次大会 (NLP 2022)
- March 14, 2022
-
- CONFERENCE (DOMESTIC)
- 日本語GPTを用いたトークナイザの影響の調査
- 井上 誠一 (LINE/東京都立大学), Nguyen Tung, 中町 礼文, 李 聖哲, 佐藤 敏紀
- 言語処理学会第28回年次大会 (NLP 2022)
- March 14, 2022
-
- WORKSHOP (DOMESTIC)
- 対話履歴の韻律情報を考慮した共感的対話音声合成
- 西邑 勇人 (東京大学), 齋藤 佑樹 (東京大学), 高道 慎之介 (東京大学), 橘 健太郎, 猿渡 洋 (東京大学)
- 第140回音声言語情報処理研究発表会 (SLP 2022)
- March 01, 2022
-
- WORKSHOP (DOMESTIC)
- HyperCLOVA を利用したプロンプトプログラミングによるシチュエーションに適した応答生成
- 川本 稔己 (LINE/東京工業大学), 山崎 天, 坂田 亘, 佐藤 敏紀
- 第12回 対話システムシンポジウム (第12回 対話システムシンポジウム)
- November 29, 2021
-
- WORKSHOP (DOMESTIC)
- ペルソナ一貫性の考慮と知識ベースを統合した HyperCLOVA を用いた雑談対話システム
- 山崎 天, 坂田 亘, 川本 稔己 (LINE/東京工業大学), 小林 滉河, Nguyen Tung, 上村 卓史, 中町礼文, 李聖哲, 佐藤 敏紀
- 第12回 対話システムシンポジウム (第12回 対話システムシンポジウム)
- November 29, 2021
-
- CONFERENCE (INTERNATIONAL)
- High-fidelity Parallel WaveGAN with Multi-band Harmonic-plus-Noise Model
- Min-Jae Hwang (Search Solutions Inc), Ryuichi Yamamoto, Eunwoo Song (NAVER), Jae-Min Kim (NAVER)
- The 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021)
- August 30, 2021
-
- WORKSHOP (INTERNATIONAL)
- Improved Parallel WaveGAN with perceptually weighted spectrogram loss
- Eunwoo Song (NAVER), Ryuichi Yamamoto, Min-Jae Hwang (NAVER), Jin-Seob Kim (NAVER), Ohsung Kwon (NAVER), Jae-Min Kim (NAVER)
- 2021 IEEE Spoken Language Technology Workshop (SLT) (SLT 2021)
- June 19, 2021
-
- CONFERENCE (INTERNATIONAL)
- Parallel Waveform Synthesis Based on Generative Adversarial Networks with Voicing-Aware Conditional Discriminators
- Ryuichi Yamamoto, Eunwoo Song (NAVER), Min-Jae Hwang (Search Solutions Inc), Jae-Min Kim (NAVER)
- 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021)
- June 06, 2021
-
- CONFERENCE (INTERNATIONAL)
- TTS-by-TTS: TTS-Driven Data Augmentation for Fast and High-Quality Speech Synthesis
- Min-Jae Hwang (Search Solutions Inc), Ryuichi Yamamoto, Eunwoo Song (NAVER), Jae-Min Kim (NAVER)
- 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2021)
- June 06, 2021
-
- CONFERENCE (INTERNATIONAL)
- Neural Text-to-Speech with a Modeling-by-Generation Excitation Vocoder
- Eunwoo Song (NAVER), Min-Jae Hwang (Search Solutions Inc.), Ryuichi Yamamoto, Jin-Seob Kim (NAVER), Ohsung Kwon (NAVER), Jae-Min Kim (NAVER)
- The 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020)
- October 25, 2020
-
- CONFERENCE (INTERNATIONAL)
- ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit
- Tomoki Hayashi (Nagoya University), Ryuichi Yamamoto, Katsuki Inoue (Okayama University), Takenori Yoshimura (Nagoya University), Shinji Watanabe (Johns Hopkins University), Tomoki Toda (Nagoya University), Kazuya Takeda (Nagoya University), Yu Zhang (Google AI), Xu Tan (Microsoft Research)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020
-
- CONFERENCE (INTERNATIONAL)
- Improving LPCNet-Based Text-to-Speech with Linear Prediction-Structured Mixture Density Network
- Min-Jae Hwang (Search Solutions Inc), Eunwoo Song (NAVER), Ryuichi Yamamoto, Frank Soong (Microsoft Research Asia), Hong-Goo Kang (Yonsei University)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020
-
- CONFERENCE (INTERNATIONAL)
- Parallel WaveGAN: A Fast Waveform Generation Model Based on Generative Adversarial Networks with Multi-Resolution Spectrogram
- Ryuichi Yamamoto, Eunwoo Song (NAVER), Jae-Min Kim (NAVER)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020
-
- CONFERENCE (INTERNATIONAL)
- Semi-Supervised Speaker Adaptation for End-to-End Speech Synthesis with Pretrained Models
- Katsuki Inoue (Okayama University), Sunao Hara (Okayama University), Masanobu Abe (Okayama University), Tomoki Hayashi (Nagoya University), Ryuichi Yamamoto, Shinji Watanabe (Johns Hopkins University)
- 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)
- May 04, 2020
-
- CONFERENCE (INTERNATIONAL)
- Probability Density Distillation with Generative Adversarial Networks for High-Quality Parallel Waveform Generation
- Ryuichi Yamamoto, Eunwoo Song (NAVER), Jae-Min Kim (NAVER)
- The 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019)
- September 15, 2019