LY Corporation R&D

JP
EN

Publications

Speech Processing

OTHERS (INTERNATIONAL)

End-to-End ASR and Audio Segmentation with Non-autoregressive Insertion-based model

Yuya Fujita, Shinji Watanabe (Johns Hopkins Univ.), Motoi Omachi

arXiv.org

December 18, 2020
CONFERENCE (INTERNATIONAL)

A Study on More Realistic Room Simulation for Far-Field Keyword Spotting

Eric Bezzam (EPFL/Sonos), Robin Scheibler, Cyril Cadoux (EPFL), Thibault Gisselbrecht (Sonos)

Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2020 (APSIPA 2020)

December 07, 2020
CONFERENCE (INTERNATIONAL)

Computer-Resource-Aware Deep Speech Separation with a Run-Time-Specified Number of BLSTM Layers

Masahito Togami, Yoshiki Masuyama (Waseda University), Tatsuya Komatsu, Kazuyoshi Yoshii (Kyoto University), Tatsuya Kawahara (Kyoto University)

Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2020 (APSIPA ASC 2020)

December 07, 2020
CONFERENCE (INTERNATIONAL)

Integration of Semi-Blind Speech Source Separation and Voice Activity Detection for Flexible Spoken Dialogue

Masaya Wake (Kyoto University), Masahito Togami, Kazuyoshi Yoshii (Kyoto University), Tatsuya Kawahara (Kyoto University)

Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2020 (APSIPA ASC 2020)

December 07, 2020
CONFERENCE (INTERNATIONAL)

Over-determined Speech Source Separation and Dereverberation

Masahito Togami, Robin Scheibler

Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2020 (APSIPA ASC 2020)

December 07, 2020
JOURNAL (INTERNATIONAL)

Innovation, Standardization, and Business Success in Media Signal Processing

Akihiko Sugiyama and Masahiro Serizawa (NEC Corporation)

Institute of Electrical and Electronics Engineers, Consumer Electronics Magazine (MCE)

November 03, 2020
WORKSHOP (INTERNATIONAL)

Conformer-based sound event detection with semi-supervised learning and data augmentation

Koichi Miyazaki (Nagoya University), Tatsuya Komatsu, Tomoki Hayashi (Human Dataware Lab), Shinji Watanabe (Johns Hopkins University), Tomoki Toda (Nagoya University), Kazuya Takeda (Nagoya University)

Detection and Classification of Acoustic Scenes and Events (DCASE 2020)

November 02, 2020
CONFERENCE (INTERNATIONAL)

End-to-End ASR with Adaptive Span Self-Attention

Xuankai Chang (Johns Hopkins University), Aswin Shanmugam Subramanian (Johns Hopkins University), Pengcheng Guo (Northwestern Polytechnical University, Johns Hopkins University), Shinji Watanabe (Johns Hopkins University), Yuya Fujita, Motoi Omachi

INTERSPEECH 2020

October 25, 2020
CONFERENCE (INTERNATIONAL)

Generalized Minimal Distortion Principle for Blind Source Separation

Robin Scheibler

The 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020)

October 25, 2020
CONFERENCE (INTERNATIONAL)

Insertion-Based Modeling for End-to-End Automatic Speech Recognition

Yuya Fujita, Shinji Watanabe (Johns Hopkins University), Motoi Omachi, Xuankai Chang (Johns Hopkins University)

INTERSPEECH 2020

October 25, 2020
CONFERENCE (INTERNATIONAL)

Mentoring-Reverse Mentoring for Unsupervised Multi-channel Speech Source Separation

Yu Nakagome (Waseda University), Masahito Togami, Tetsuji Ogawa (Waseda University), Tetsunori Kobayashi (Waseda University)

The 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020)

October 25, 2020
CONFERENCE (INTERNATIONAL)

Neural Text-to-Speech with a Modeling-by-Generation Excitation Vocoder

Eunwoo Song (NAVER), Min-Jae Hwang (Search Solutions Inc.), Ryuichi Yamamoto, Jin-Seob Kim (NAVER), Ohsung Kwon (NAVER), Jae-Min Kim (NAVER)

The 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020)

October 25, 2020
CONFERENCE (INTERNATIONAL)

Sparseness-Aware DOA Estimation with Majorization Minimization

Masahito Togami, Robin Scheibler

The 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020)

October 25, 2020
CONFERENCE (DOMESTIC)

単語の表記と素性を同時出力するend-to-end音声認識

大町基, 藤田悠哉, 渡部晋治 (Johns Hopkins University), Xuankai Chang (Johns Hopkins University)

日本音響学会2020年秋季研究発表会 (音響学会)

September 11, 2020
CONFERENCE (DOMESTIC)

A Generalized Minimal Distortion Principle to Solve the Scale Ambiguity in Blind Source Separation

シャイブラーロビン

日本音響学会 2020年秋季研究発表会 (ASJ 2020 autumn)

September 09, 2020
CONFERENCE (DOMESTIC)

Mentoring-Reverse Mentoring：多チャンネル音源分離における教師なし学習のための知識伝搬フレームワーク

中込優 (早稲田大学), 戸上真人, 小川哲司 (早稲田大学), 小林哲則 (早稲田大学)

日本音響学会 2020年秋季研究発表会 (ASJ 2020 autumn)

September 09, 2020
CONFERENCE (DOMESTIC)

挿入操作に基づく End-to-End 音声認識

藤田悠哉, 渡部晋治 (Johns Hopkins Univ.), 大町基, Xuankai Chang (Johns Hopkins Univ.)

日本音響学会2020年秋季研究発表会 (音響学会)

September 09, 2020
OTHERS (INTERNATIONAL)

Insertion-Based Modeling for End-to-End Automatic Speech Recognition

Yuya Fujita, Shinji Watanabe (Johns Hopkins University), Motoi Omachi, Xuankai Chang (Johns Hopkins University)

arXiv.org

May 27, 2020
CONFERENCE (INTERNATIONAL)

Attention-based ASR with Lightweight and Dynamic Convolutions

Yuya Fujita, Aswin Shanmugam Subramanian (Johns Hopkins University), Motoi Omachi, Shinji Watanabe (Johns Hopkins University)

45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020)

May 08, 2020
CONFERENCE (INTERNATIONAL)

Consistency-Aware Multi-Channel Speech Enhancement Using Deep Neural Networks

Yoshiki Masuyama (Waseda University), Masahito Togami, Tatsuya Komatsu

2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2020)

May 04, 2020

prev

prev

1
…
9
10
11
…
13

next

next