Conv-tasnet: Surpassing ideal time–frequency magnitude masking for speech separation Y Luo, N Mesgarani IEEE/ACM transactions on audio, speech, and language processing 27 (8), 1256 …, 2019 | 365 | 2019 |
Deep attractor network for single-microphone speaker separation Z Chen, Y Luo, N Mesgarani https://arxiv.org/abs/1611.08930, 2016 | 285 | 2016 |
Tasnet: time-domain audio separation network for real-time, single-channel speech separation Y Luo, N Mesgarani 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 207 | 2018 |
Speaker-independent speech separation with deep attractor network Y Luo, Z Chen, N Mesgarani IEEE/ACM Transactions on Audio, Speech, and Language Processing 26 (4), 787-796, 2018 | 145 | 2018 |
Deep Clustering and Conventional Networks for Music Separation: Stronger Together Y Luo, Z Chen, JR Hershey, JL Roux, N Mesgarani https://arxiv.org/abs/1611.06265, 2016 | 123 | 2016 |
Dual-path rnn: efficient long sequence modeling for time-domain single-channel speech separation Y Luo, Z Chen, T Yoshioka ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 83 | 2020 |
Continuous speech separation: dataset and analysis Z Chen, T Yoshioka, L Lu, T Zhou, Z Meng, Y Luo, J Wu, X Xiao, J Li ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 27 | 2020 |
Real-time single-channel dereverberation and separation with time-domain audio separation network. Y Luo, N Mesgarani Interspeech, 342-346, 2018 | 27 | 2018 |
Speaker-independent auditory attention decoding without access to clean speech sources C Han, J O’Sullivan, Y Luo, J Herrero, AD Mehta, N Mesgarani Science advances 5 (5), eaav6134, 2019 | 26 | 2019 |
FaSNet: Low-latency adaptive beamforming for multi-microphone audio processing Y Luo, C Han, N Mesgarani, E Ceolini, SC Liu 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 23 | 2019 |
End-to-end microphone permutation and number invariant multi-channel speech separation Y Luo, Z Chen, N Mesgarani, T Yoshioka ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 22 | 2020 |
Music Source Activity Detection and Separation Using Deep Attractor Network. R Kumar, Y Luo, N Mesgarani INTERSPEECH, 347-351, 2018 | 12 | 2018 |
Systems and methods for speech separation and neural decoding of attentional selection in multi-speaker environments N Mesgarani, Y Luo, J O'sullivan, Z Chen US Patent App. 16/169,194, 2019 | 9 | 2019 |
Online deep attractor network for real-time single-channel speech separation C Han, Y Luo, N Mesgarani ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 8 | 2019 |
Separating varying numbers of sources with auxiliary autoencoding loss Y Luo, N Mesgarani arXiv preprint arXiv:2003.12326, 2020 | 7 | 2020 |
Augmented time-frequency mask estimation in cluster-based source separation algorithms Y Luo, N Mesgarani ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 5 | 2019 |
Real-time binaural speech separation with preserved spatial cues C Han, Y Luo, N Mesgarani ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 4 | 2020 |
Distortion-controlled training for end-to-end reverberant speech separation with auxiliary autoencoding loss Y Luo, C Han, N Mesgarani arXiv preprint arXiv:2011.07338, 2020 | 3 | 2020 |
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis D Raj, P Denisov, Z Chen, H Erdogan, Z Huang, M He, S Watanabe, J Du, ... arXiv preprint arXiv:2011.02014, 2020 | 3 | 2020 |
Deep clustering for singing voice separation Y Luo, Z Chen, DPW Ellis MIREX, Task of Singing Voice Separation, 1-2, 2016 | 3 | 2016 |