Multi-accdoa: Localizing and detecting overlapping sounds from the same class with auxiliary duplicating permutation invariant training K Shimada, Y Koyama, S Takahashi, N Takahashi, E Tsunoo, Y Mitsufuji ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 90 | 2022 |
Transformer ASR with contextual block processing E Tsunoo, Y Kashiwagi, T Kumakura, S Watanabe 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 75 | 2019 |
Streaming transformer asr with blockwise synchronous beam search E Tsunoo, Y Kashiwagi, S Watanabe 2021 IEEE Spoken Language Technology Workshop (SLT), 22-29, 2021 | 55 | 2021 |
Beyond timbral statistics: Improving music classification using percussive patterns and bass lines E Tsunoo, G Tzanetakis, N Ono, S Sagayama IEEE Transactions on Audio, Speech, and Language Processing 19 (4), 1003-1014, 2010 | 50* | 2010 |
Harmonic and percussive sound separation and its application to MIR-related tasks N Ono, K Miyamoto, H Kameoka, J Le Roux, Y Uchiyama, E Tsunoo, ... Advances in music information retrieval, 213-236, 2010 | 45 | 2010 |
Audio genre classification using percussive pattern clustering combined with timbral features E Tsunoo, G Tzanetakis, N Ono, S Sagayama 2009 IEEE International Conference on Multimedia and Expo, 382-385, 2009 | 40 | 2009 |
Towards online end-to-end transformer automatic speech recognition E Tsunoo, Y Kashiwagi, T Kumakura, S Watanabe arXiv preprint arXiv:1910.11871, 2019 | 38 | 2019 |
Autoregressive MFCC Models for Genre Classification Improved by Harmonic-percussion Separation. H Rump, S Miyabe, E Tsunoo, N Ono, S Sagayama ISMIR, 87-92, 2010 | 36 | 2010 |
Information processing device, method of information processing, and program Y Taki, S Kawano, T Shibuya, E Tsunoo US Patent 10,546,582, 2020 | 35 | 2020 |
Rhythm map: Extraction of unit rhythmic patterns and analysis of rhythmic structure from music acoustic signals E Tsunoo, N Ono, S Sagayama 2009 IEEE International Conference on Acoustics, Speech and Signal …, 2009 | 33 | 2009 |
Ensemble of ACCDOA-and EINV2-based systems with D3Nets and impulse response simulation for sound event localization and detection K Shimada, N Takahashi, Y Koyama, S Takahashi, E Tsunoo, ... arXiv preprint arXiv:2106.10806, 2021 | 29 | 2021 |
Hierarchical Recurrent Neural Network for Story Segmentation. E Tsunoo, P Bell, S Renals INTERSPEECH, 2919-2923, 2017 | 25 | 2017 |
Music mood classification by rhythm and bass-line unit pattern analysis E Tsunoo, T Akase, N Ono, S Sagayama 2010 IEEE International Conference on Acoustics, Speech and Signal …, 2010 | 24 | 2010 |
Musical Bass-Line Pattern Clustering and Its Application to Audio Genre Classification. E Tsunoo, N Ono, S Sagayama ISMIR, 219-224, 2009 | 21 | 2009 |
Spatial data augmentation with simulated room impulse responses for sound event localization and detection Y Koyama, K Shigemi, M Takahashi, K Shimada, N Takahashi, E Tsunoo, ... ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 15 | 2022 |
Making punctuation restoration robust and fast with multi-task learning and knowledge distillation M Hentschel, E Tsunoo, T Okuda ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 15 | 2021 |
Residual language model for end-to-end speech recognition E Tsunoo, Y Kashiwagi, C Narisetty, S Watanabe arXiv preprint arXiv:2206.07430, 2022 | 14 | 2022 |
Joint speech recognition and audio captioning C Narisetty, E Tsunoo, X Chang, Y Kashiwagi, M Hentschel, S Watanabe ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 11 | 2022 |
Streaming transformer ASR with blockwise synchronous inference E Tsunoo, Y Kashiwagi, S Watanabe arXiv preprint arXiv:2006.14941, 2020 | 11 | 2020 |
Polyphone disambiguation and accent prediction using pre-trained language models in Japanese TTS front-end R Hida, M Hamada, C Kamada, E Tsunoo, T Sekiya, T Kumakura ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 10 | 2022 |