Generative AI
-
- CONFERENCE (INTERNATIONAL)
- BitTTS: Highly Compact Text-to-Speech Using 1.58-bit Quantization and Weight Indexing
- Masaya Kawamura, Takuya Hasumi, Yuma Shirahata, Ryuichi Yamamoto
- The 26th Annual Conference of the International Speech Communication Association (INTERSPEECH 2025)
- August 21, 2025
-
- CONFERENCE (INTERNATIONAL)
- Comparative Analysis of Fast and High-Fidelity Neural Vocoders for Low-Latency Streaming Synthesis in Resource-Constrained Environments
- Reo Yoneyama (Nagoya University), Masaya Kawamura, Ryo Terashima, Ryuichi Yamamoto (Nagoya University/LY Corporation), Tomoki Toda (Nagoya University)
- The 26th Annual Conference of the International Speech Communication Association (INTERSPEECH 2025)
- August 21, 2025
-
- CONFERENCE (INTERNATIONAL)
- Grapheme-Coherent Phonemic and Prosodic Annotation of Speech by Implicit and Explicit Grapheme Conditioning
- Hien Ohnaka (Nara Institute of Science and Technology), Yuma Shirahata, Byeongseon Park, Ryuichi Yamamoto
- The 26th Annual Conference of the International Speech Communication Association (INTERSPEECH 2025)
- August 17, 2025
-
- CONFERENCE (DOMESTIC)
- SCAdapter: A Content-Style Disentanglement Approach for Diffusion-Based Style Transfer
- Luan Thanh Trinh, 土井 賢治, 長内 淳樹
- 第28回 画像の認識・理解シンポジウム MIRU2025 (MIRU2025)
- July 30, 2025
-
- OTHERS (INTERNATIONAL)
- A Provable Approach for End-to-End Safe Reinforcement Learning
- Akifumi Wachi, Kohei Miyaguchi, Takumi Tanabe, Rei Sato, Youhei Akimoto (University of Tsukuba)
- arXiv.org (arXiv)
- May 28, 2025
-
- WORKSHOP (DOMESTIC)
- 夜間景観画像の雰囲気評価定量予測
- 早川 季寿 (東京科学大学), 井手 海翔 (東京科学大学), 安納 爽響 (東京科学大学), 坪内 孝太, 下坂 正倫 (東京科学大学)
- 情報処理学会ユビキタスコンピューティングシステム研究会 (IPSJ SIGUBI)
- May 15, 2025
-
- WORKSHOP (INTERNATIONAL)
- Do Interpersonal Skills Affect Human-AI Collaboration Performance? A Study with ChatGPT
- Ryuki Nishioka (NAIST), Shoko Wakamiya (NAIST), Nobuyuki Shimizu, Sumio Fujita, Eiji Aramaki (NAIST)
- AutomationXP25: Hybrid Automation Experiences (AutomationXP25)
- April 27, 2025
-
- CONFERENCE (INTERNATIONAL)
- ReMoGPT: Part-Level Retrieval-Augmented Motion-Language Models
- Qing Yu, Mikihiro Tanaka, Kent Fujiwara
- The 39th Annual AAAI Conference on Artificial Intelligence (AAAI-25)
- April 11, 2025
-
- CONFERENCE (DOMESTIC)
- レビュー情報を用いた LLM による観光地比較表生成
- 辻本 陵 (奈良先端科学技術大学院大学), 坪内 孝太, 山下 達雄, 松田 裕貴 (岡山大学), 諏訪 博彦 (奈良先端科学技術大学院大学), 大内 啓樹 (奈良先端科学技術大学院大学)
- 言語処理学会第31回年次大会 (NLP2025)
- March 11, 2025
-
- CONFERENCE (DOMESTIC)
- LLMを用いたクロールデータからの人物略歴文抽出
- 中野 佑哉, 猪野 麻巳子, 二葉 知泰, 丸山 翼, 岸本 耀平, 永井 隆広
- 言語処理学会第31回年次大会 (NLP2025)
- March 03, 2025
-
- CONFERENCE (DOMESTIC)
- アドホック検索タスクにおけるモデルマージの効果検証
- 佐々木 泰河 (兵庫県立大), 山本 岳洋 (兵庫県立大), 大島 裕明 (兵庫県立大), 藤田 澄男
- 第17回データ工学と情報マネジメントに関するフォーラム(第23回日本データベース学会年次大会) (DEIM 2025)
- February 27, 2025
-
- OTHERS (INTERNATIONAL)
- Vulnerability Mitigation for Safety-Aligned Language Models via Debiasing
- Thien Q. Tran, Akifumi Wachi, Rei Sato, Takumi Tanabe, Youhei AKimoto (University of Tsukuba, RIKEN AIP)
- arXiv.org (arXiv)
- February 04, 2025
-
- CONFERENCE (INTERNATIONAL)
- Real-SRGD: Enhancing Real-World Image Super-Resolution with Classifier-Free Guided Diffusion
- Kenji Doi, Shuntaro Okada, Ryota Yoshihashi, Hirokatsu Kataoka
- 17th Asian Conference on Computer Vision (ACCV 2024)
- December 12, 2024
-
- CONFERENCE (INTERNATIONAL)
- Stepwise Alignment for Constrained Language Model Policy Optimization
- Akifumi Wachi, Thien Q. Tran, Rei Sato, Takumi Tanabe, Youhei Akimoto (University of Tsukuba)
- The 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024)
- December 11, 2024
-
- CONFERENCE (INTERNATIONAL)
- Local Curvature Smoothing with Stein's Identity for Efficient Score Matching
- Genki Osada, Makoto Shing (Sakana AI), Takashi Nishide (University of Tsukuba)
- The 38th Annual Conference on Neural Information Processing Systems (NeurIPS 2024)
- December 09, 2024
-
- OTHERS (DOMESTIC)
- LLMによるシーン中の物体の形容記述を用いた景観画像の印象予測
- 井手 海翔 (東京科学大学), 安納 爽響 (東京科学大学), 坪内 孝太, 下坂 正倫 (東京科学大学)
- 情報処理学会ユビキタスコンピューティングシステム研究会 (IPSJ SIGUBI)
- November 11, 2024
-
- CONFERENCE (INTERNATIONAL)
- Chronologically Accurate Retrieval for Temporal Grounding of Motion-Language Models
- Kent Fujiwara, Mikihiro Tanaka, Qing Yu
- The 18th European Conference on Computer Vision 2024 (ECCV 2024)
- September 29, 2024
-
- OTHERS (INTERNATIONAL)
- Description-based Controllable Text-to-Speech with Cross-Lingual Voice Control
- Ryuichi Yamamoto, Yuma Shirahata, Masaya Kawamura, Kentaro Tachibana
- arXiv.org (arXiv)
- September 27, 2024
-
- CONFERENCE (INTERNATIONAL)
- Audio-conditioned phonemic and prosodic annotation for building text-to-speech models from unlabeled speech data
- Yuma Shirahata, Byeongseon Park, Ryuichi Yamamoto, Kentaro Tachibana
- The 25th Annual Conference of the International Speech Communication Association (INTERSPEECH 2024)
- September 04, 2024
-
- CONFERENCE (INTERNATIONAL)
- Universal Score-based Speech Enhancement with High Content Preservation
- Robin Scheibler, Yusuke Fujita, Yuma Shirahata, Tatsuya Komatsu
- The 25th Annual Conference of the International Speech Communication Association (INTERSPEECH 2024)
- September 01, 2024