Follow
Tatsuya Komatsu
Tatsuya Komatsu
LINE Corporation
Verified email at linecorp.com
Title
Cited by
Cited by
Year
Weakly-supervised sound event detection with self-attention
K Miyazaki, T Komatsu, T Hayashi, S Watanabe, T Toda, K Takeda
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
902020
Convolution-augmented transformer for semi-supervised sound event detection
K Miyazaki, T Komatsu, T Hayashi, S Watanabe, T Toda, K Takeda
Proc. workshop detection classification Acoust. Scenes events (DCASE), 100-104, 2020
862020
Acoustic Event Detection Method Using Semi-Supervised Non-Negative Matrix Factorization with Mixtures of Local Dictionaries.
T Komatsu, T Toizumi, R Kondo, Y Senda
DCASE, 45-49, 2016
852016
Relaxing the conditional independence assumption of CTC-based ASR by conditioning on intermediate predictions
J Nozaki, T Komatsu
arXiv preprint arXiv:2104.02724, 2021
732021
Conformer-based sound event detection with semi-supervised learning and data augmentation
K Miyazaki, T Komatsu, T Hayashi, S Watanabe, T Toda, K Takeda
dim 1 (4), 2020
592020
A comparative study on non-autoregressive modelings for speech-to-text generation
Y Higuchi, N Chen, Y Fujita, H Inaguma, T Komatsu, J Lee, J Nozaki, ...
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 47-54, 2021
482021
Anomalous sound event detection based on wavenet
T Hayashi, T Komatsu, R Kondo, T Toda, K Takeda
2018 26th European Signal Processing Conference (EUSIPCO), 2494-2498, 2018
482018
Acoustic event detection based on non-negative matrix factorization with mixtures of local dictionaries and activation aggregation
T Komatsu, Y Senda, R Kondo
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
482016
Disentangled speaker and language representations using mutual information minimization and domain adaptation for cross-lingual TTS
D Xin, T Komatsu, S Takamichi, H Saruwatari
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
342021
Unsupervised training for deep speech source separation with Kullback-Leibler divergence based probabilistic loss function
M Togami, Y Masuyama, T Komatsu, Y Nakagome
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
302020
PromptTTS++: Controlling speaker identity in prompt-based text-to-speech using natural language descriptions
R Shimizu, R Yamamoto, M Kawamura, Y Shirahata, H Doi, T Komatsu, ...
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
152024
Neural diarization with non-autoregressive intermediate attractors
Y Fujita, T Komatsu, R Scheibler, Y Kida, T Ogawa
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
142023
Sound event localization and detection using convolutional recurrent neural networks and gated linear units
T Komatsu, M Togami, T Takahashi
2020 28th European Signal Processing Conference (EUSIPCO), 41-45, 2021
132021
Scene-dependent acoustic event detection with scene conditioning and fake-scene-conditioned loss
T Komatsu, K Imoto, M Togami
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
132020
Detection of anomaly acoustic scenes based on a temporal dissimilarity model
T Komatsu, R Kondo
2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017
132017
Initial stage of Pd adsorption on Si (111) 7× 7 surface studied by AES and EELS
S Nishigaki, T Komatsu, M Arimoto, M Sugihara
Surface science 167 (1), 27-38, 1986
131986
Robust acoustic scene classification to multiple devices using maximum classifier discrepancy and knowledge distillation
S Takeyama, T Komatsu, K Miyazaki, M Togami, S Ono
2020 28th European Signal Processing Conference (EUSIPCO), 36-40, 2021
112021
Multichannel loss function for supervised speech source separation by mask-based beamforming
Y Masuyama, M Togami, T Komatsu
arXiv preprint arXiv:1907.04984, 2019
112019
Acoustic event detection with classifier chains
T Komatsu, S Watanabe, K Miyazaki, T Hayashi
arXiv preprint arXiv:2202.08470, 2022
102022
MLP-based architecture with variable length input for automatic speech recognition
J Sakuma, T Komatsu, R Scheibler
102022
The system can't perform the operation now. Try again later.
Articles 1–20