-
- カンファレンス (国内)
- Foley Sound Synthesis with a Class-Conditioned Latent Diffusion Model and FAD-Based Post-filtering
- シャイブラー ロビン, 蓮実 拓也, 藤田 雄介, 小松 達也, 山本 龍一, 橘 健太郎
- 日本音響学会 2023年秋季研究発表会 (ASJ 2023 autumn)
- 2023.9.26
-
- カンファレンス (国内)
- データの分布マッチングによる End-to-End 音声認識モデルのドメイン適応
- 篠原 雄介, 渡部 晋治 (CMU)
- 日本音響学会 第150回(2023年秋季)研究発表会
- 2023.9.26
-
- カンファレンス (国内)
- 潜在変数モデルを用いたCTCによる非自己回帰型音声認識
- 藤田 悠哉, 渡部 晋治 (カーネギーメロン大学)
- 日本音響学会第150回(2023年秋季)研究発表会
- 2023.9.26
-
- 論文誌 (国際)
- Mechanisms to Address Different Privacy Requirements for Users and Locations
- Ryota HIRAISHI (Kyoto univ.), Masatoshi YOSHIKAWA (Kyoto univ.), Yang CAO (Hokkaido univ.), Sumio FUJITA, Hidehito GOMI
- The IEICE Transactions on Information and Systems (IEICE Transactions)
- 2023.9.25
-
- ワークショップ (国際)
- Foley Sound Synthesis with a Class-conditioned Latent Diffusion Model
- Robin Scheibler, Takuya Hasumi, Yusuke Fujita, Tatsuya Komatsu, Ryuichi Yamamoto, Kentaro Tachibana
- Detection and Classification of Acoustic Scenes and Events (DCASE 2023)
- 2023.9.20
-
- カンファレンス (国際)
- Single-tap Latency Reduction with Single- or Double- tap Prediction
- Naoto Nishida* (The University of Tokyo) , Kaori Ikematsu*, Junichi Sato, Shota Yamanaka, Kota Tsubouchi, *co-first authors
- The ACM International Conference on Mobile Human-Computer Interaction (MobileHCI2023)
- 2023.9.13
-
- カンファレンス (国際)
- An Open-Domain Avatar Chatbot by Exploiting a Large Language Model
- Takato Yamazaki, Tomoya Mizumoto, Katsumasa Yoshikawa, Masaya Ohagi, Toshiki Kawamoto (LINE/Tokyo Institute of Technology), Toshinori Sato
- 24th Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL 2023)
- 2023.9.11
-
- カンファレンス (国際)
- Bridging the Gap between Subword and Character Segmentation in Pretrained Language Models
- Shun Kiyono, Sho Takase, Shengzhe Li, Toshinori Sato
- Recent Advances in Natural Language Processing (RANLP 2023)
- 2023.9.4
-
- カンファレンス (国際)
- On Text Localization in End-to-End OCR-Free Document Understanding Transformer without Text Localization Supervision
- Geewook Kim (NAVER Cloud), Shuhei Yokoo, Sukmin Seo (NAVER Cloud), Atsuki Osanai, Yamato Okamoto, Youngmin Baek (NAVER Cloud)
- 10th International Workshop on Camera-Based Document Analysis and Recognition (CBDAR2023)
- 2023.8.25
-
- 論文誌 (国際)
- Building a hospitable and reliable dialogue system for android robots: a scenario-based approach with large language models
- Takato Yamazaki, Katsumasa Yoshikawa, Toshiki Kawamoto (LINE/Tokyo Institute of Technology), Tomoya Mizumoto, Masaya Ohagi, Toshinori Sato
- Advanced Robotics (Advanced Robotics)
- 2023.8.22
-
- カンファレンス (国際)
- CALLS: Japanese Empathetic Dialogue Speech Corpus of Complaint Handling and Attentive Listening in Customer Center
- Yuki Saito (The University of Tokyo), Eiji Iimori (The University of Tokyo), Shinnosuke Takamichi (The University of Tokyo), Kentaro Tachibana, Hiroshi Saruwatari (The University of Tokyo)
- The 24th Annual Conference of the International Speech Communication Association (INTERSPEECH 2023)
- 2023.8.20
-
- カンファレンス (国際)
- ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings
- Yuki Saito (The University of Tokyo), Shinnosuke Takamichi (The University of Tokyo), Eiji Iimori (The University of Tokyo), Kentaro Tachibana, Hiroshi Saruwatari (The University of Tokyo)
- The 24th Annual Conference of the International Speech Communication Association (INTERSPEECH 2023)
- 2023.8.20
-
- カンファレンス (国際)
- Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
- Xuankai Chang (Carnegie Mellon University), Brian Yan (Carnegie Mellon University), Yuya Fujita, Takashi Maekaku, Shinji Watanabe (Carnegie Mellon University)
- The 24th Annual Conference of the International Speech Communication Association (INTERSPEECH 2023)
- 2023.8.20
-
- カンファレンス (国際)
- Multi-channel separation of dynamic speech and sound events
- Takuya Fujimura (Nagoya University), Robin Scheibler
- The 24th Annual Conference of the International Speech Communication Association (INTERSPEECH 2023)
- 2023.8.20
-
- カンファレンス (国際)
- Target Vocabulary Recognition Based on Multi-Task Learning with Decomposed Teacher Sequences
- Aoi Ito (Hosei University), Tatsuya Komatsu, Yusuke Fujita, Yusuke Kida
- The 24th Annual Conference of the International Speech Communication Association (INTERSPEECH 2023)
- 2023.8.20
-
- カンファレンス (国際)
- Off-Policy Evaluation of Ranking Policies under Diverse User Behavior
- Haruka Kiyohara (Tokyo Institute of Technology), Masatoshi Uehara (Cornell University), Yusuke Narita (Yale University), Nobuyuki Shimizu (Yahoo Japan Corporation), Yasuo Yamamoto (Yahoo Japan Corporation), Yuta Saito (Cornell University)
- 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD 2023)
- 2023.8.4
-
- その他 (国内)
- Webブラウザに表示されるプライベートな情報を視覚的に遮蔽するシステムについての予備調査
- 石田 瑞季 (お茶の水女子大学), 池松 香, 五十嵐 悠紀 (お茶の水女子大学)
- 情報処理学会 第204回ヒューマンコンピュータインタラクション研究会
- 2023.8.1
-
- ワークショップ (国際)
- Towards Consistency Filtering-Free Unsupervised Learning for Dense Retrieval
- Haoxiang Shi (Waseda University), Sumio Fujita, Tetsuya Sakai (Waseda University)
- Workshop on Reaching Efficiency in Neural Information Retrieval (ReNeuIR '23)
- 2023.7.27
-
- カンファレンス (国際)
- Maintenance-Free Smart Hand Dynamometer
- Sarii Yamamoto (Keio University), Fei Gu (Keio University), Kaori Ikematsu, Kunihiro Kato (Tokyo University of Technology), Yuta Sugiura (Keio University)
- Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC2023)
- 2023.7.26
-
- カンファレンス (国内)
- アテンションはアノテーションの代わりになるか?:テキスト−画像生成モデルの注視機構を利用した領域分割の弱教師あり学習
- 吉橋 亮太, 大塚 雄也, 土井 賢治, 田中 智大
- 第26回 画像の認識・理解シンポジウム (MIRU2023)
- 2023.7.26