-
- CONFERENCE (INTERNATIONAL)
- Data Collection-free Masked Video Modeling
- Yuchi Ishikawa, Masayoshi Kondo, Yoshimitsu Aoki (Keio University)
- The 18th European Conference on Computer Vision 2024 (ECCV 2024)
- October 03, 2024
-
- CONFERENCE (INTERNATIONAL)
- Verifying Finger-Fitts Models for Normalizing Subjective Speed-Accuracy Biases
- Shota Yamanaka, Hiroki Usuba, Yosuke Oba (Meiji University), Taiki Kinoshita (Meiji University), Ryuto Tomihari (Meiji University), Nobuhito Kasahara (Meiji University), Homei Miyashita (Meiji University)
- The ACM International Conference on Mobile Human-Computer Interaction (MobileHCI 2024)
- October 01, 2024
-
- CONFERENCE (INTERNATIONAL)
- Chronologically Accurate Retrieval for Temporal Grounding of Motion-Language Models
- Kent Fujiwara, Mikihiro Tanaka, Qing Yu
- The 18th European Conference on Computer Vision 2024 (ECCV 2024)
- September 29, 2024
-
- OTHERS (INTERNATIONAL)
- Description-based Controllable Text-to-Speech with Cross-Lingual Voice Control
- Ryuichi Yamamoto, Yuma Shirahata, Masaya Kawamura, Kentaro Tachibana
- arXiv.org (arXiv)
- September 27, 2024
-
- OTHERS (DOMESTIC)
- スマートフォンを用いた手の疲労度推定
- 田島 孔明 (慶應義塾大学), 池松 香, 礒本 俊弥, 加藤 邦拓 (東京工科大学), 杉浦 裕太 (慶應義塾大学)
- 第213回ヒューマンインタフェース学会研究会 (SIG-DeMO-18)
- September 26, 2024
-
- OTHERS (INTERNATIONAL)
- DisasterNeedFinder: Understanding the Information Needs in the 2024 Noto Earthquake (Comprehensive Explanation)
- Kota Tsubouchi, Shuji Yamaguchi, Keijirou Saitou (Japan Broadcasting Corporation), Akihisa Soemori (NHK Global Media Servises), Masato Morita (NHK Global Media Servises), Shigeki Asou (Japan Broadcasting Corporation)
- arXiv.org (arXiv)
- September 11, 2024
-
- CONFERENCE (DOMESTIC)
- Japanese MT-bench++: より自然なマルチターン対話設定における大規模日本語ベンチマーク
- 植松 拓也 (早稲田大学), 福田 創 (早稲田大学), 河原 大輔 (早稲田大学), 柴田 知秀
- NLP若手の会 第19回シンポジウム (YANS2024)
- September 06, 2024
-
- CONFERENCE (INTERNATIONAL)
- Audio-conditioned phonemic and prosodic annotation for building text-to-speech models from unlabeled speech data
- Yuma Shirahata, Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana
- The 25th Annual Conference of the International Speech Communication Association (INTERSPEECH 2024)
- September 04, 2024
-
- CONFERENCE (DOMESTIC)
- トピックモデルを用いた教師なし学習によるHuBERTの意味表現向上
- 前角 高史, Jiatong Shi (カーネギーメロン大学), Xuankai Chang (カーネギーメロン大学), 藤田 悠哉, 渡部 晋治 (カーネギーメロン大学)
- 日本音響学会 2024年秋季研究発表会 (ASJ 2024 autumn)
- September 04, 2024
-
- CONFERENCE (DOMESTIC)
- 感情音声合成のためのアラインメント手法の比較
- 蓮実 拓也, 白旗 悠真, Welly Naptali, 山本 龍一, Eunwoo Song (NAVER Cloud), 橘 健太郎, Jae-Min Kim (NAVER Cloud)
- 日本音響学会 2024年秋季研究発表会 (ASJ 2024 autumn)
- September 04, 2024
-
- CONFERENCE (DOMESTIC)
- 離散トークン音声認識におけるドメイン適応の検討
- 石井 敬章, 小松 達也, 藤田 雄介, 藤田 悠哉
- 日本音響学会 2024年秋季研究発表会 (ASJ 2024 autumn)
- September 04, 2024
-
- CONFERENCE (INTERNATIONAL)
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning
- Masaya Kawamura, Ryuichi Yamamoto, Yuma Shirahata, Takuya Hasumi, Kentaro Tachibana
- The 25th Annual Conference of the International Speech Communication Association (INTERSPEECH 2024)
- September 03, 2024
-
- CONFERENCE (INTERNATIONAL)
- Song Data Cleansing for End-to-End Neural Singer Diarization Using Neural Analysis and Synthesis Framework
- Hokuto Munakata, Ryo Terashima, Yusuke Fujita
- The 25th Annual Conference of the International Speech Communication Association (INTERSPEECH 2024)
- September 03, 2024
-
- CONFERENCE (INTERNATIONAL)
- Audio Fingerprinting with Holographic Reduced Representations
- Yusuke Fujita, Tatsuya Komatsu
- The 25th Annual Conference of the International Speech Communication Association (INTERSPEECH 2024)
- September 01, 2024
-
- CONFERENCE (INTERNATIONAL)
- Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment
- Takuto Igarashi (The University of Tokyo), Yuki Saito (The University of Tokyo), Kentaro Seki (The University of Tokyo), Shinnosuke Takamichi (The University of Tokyo), Ryuichi Yamamoto, Kentaro Tachibana, Hiroshi Saruwatari (The University of Tokyo)
- The 25th Annual Conference of the International Speech Communication Association (INTERSPEECH 2024)
- September 01, 2024
-
- CONFERENCE (INTERNATIONAL)
- SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark
- Yuki Saito (The University of Tokyo), Takuto Igarashi (The University of Tokyo), Kentaro Seki (The University of Tokyo), Shinnosuke Takamichi (The University of Tokyo), Ryuichi Yamamoto, Kentaro Tachibana, Hiroshi Saruwatari (The University of Tokyo)
- The 25th Annual Conference of the International Speech Communication Association (INTERSPEECH 2024)
- September 01, 2024
-
- CONFERENCE (INTERNATIONAL)
- Universal Score-based Speech Enhancement with High Content Preservation
- Robin Scheibler, Yusuke Fujita, Yuma Shirahata, Tatsuya Komatsu
- The 25th Annual Conference of the International Speech Communication Association (INTERSPEECH 2024)
- September 01, 2024
-
- JOURNAL (INTERNATIONAL)
- MC-Whisper: Extending Speech Foundation Models to Multichannel Distant Speech Recognition
- Xuankai Chang (Carnegie Mellon University), Pengcheng Guo (Northwestern Polytechnical University), Yuya Fujita, Takashi Maekaku, Shinji Watanabe (Carnegie Mellon University)
- IEEE Signal Processing Letters (IEEE SPL)
- August 26, 2024
-
- WORKSHOP (INTERNATIONAL)
- Leveraging Instrumental Variables in Online Advertising Auctions : Robust Click-Through-Rate Prediction
- Ryohei Emori (Keio University), Shinya Suzumura, Takahiro Hoshino (Keio University), Nobuyuki Shimizu
- ADKDD 2024: the 17th International Workshop on Data Mining and Audience Intelligence for Advertising (AdKDD 2024)
- August 25, 2024
-
- CONFERENCE (DOMESTIC)
- Real-SRGD: 分類器無しガイダンスによる実世界超解像向け拡散モデルの画像品質改善
- 土井 賢治, 岡田 俊太郎, 吉橋 亮太, 片岡 裕雄
- 第27回 画像の認識・理解シンポジウム MIRU2024 (MIRU2024)
- August 09, 2024