Huaming Wang

Geciteerd door

	Alles	Sinds 2019
Citaties	1761	1389
h-index	17	15
i10-index	18	16

620

310

155

465

201520162017201820192020202120222023202424 100 113 129 79 75 80 114 420 610

Medeauteurs

Zhuo ChenBytedance (formerly Microsoft, Columbia University)Geverifieerd e-mailadres voor columbia.edu
Takuya YoshiokaAssemblyAIGeverifieerd e-mailadres voor assemblyai.com
Sefik Emre EskimezMicrosoftGeverifieerd e-mailadres voor microsoft.com

Volgen

Huaming Wang

Partner Group Engineering Manager, Microsoft

Geverifieerd e-mailadres voor microsoft.com

Artificial Intelligence Audio and Speech Processing Signal Processing


Titel Sorteren op citaties Sorteren op jaar Sorteren op titel	Geciteerd door Geciteerd door	Jaar
An introduction to computational networks and the computational network toolkit D Yu, A Eversole, M Seltzer, K Yao, Z Huang, B Guenter, O Kuchaiev, ... Microsoft Technical Report MSR-TR-2014–112, 2014	472	2014
Neural codec language models are zero-shot text to speech synthesizers C Wang, S Chen, Y Wu, Z Zhang, L Zhou, S Liu, Z Chen, Y Liu, H Wang, ... arXiv preprint arXiv:2301.02111, 2023	379	2023
Clap learning audio concepts from natural language supervision B Elizalde, S Deshmukh, M Al Ismail, H Wang ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	246	2023
Speak foreign languages with your own voice: Cross-lingual neural codec language modeling Z Zhang, L Zhou, C Wang, S Chen, Y Wu, S Liu, Z Chen, Y Liu, H Wang, ... arXiv preprint arXiv:2303.03926, 2023	90	2023
Advances in online audio-visual meeting transcription T Yoshioka, I Abramovski, C Aksoylar, Z Chen, M David, D Dimitriadis, ... 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019	85	2019
Multi-channel speech separation Z Chen, J Li, X Xiao, T Yoshioka, H Wang, Z Wang, Y Gong US Patent 10,839,822, 2020	73	2020
Personalized speech enhancement: New models and comprehensive evaluation SE Eskimez, T Yoshioka, H Wang, X Wang, Z Chen, X Huang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	58	2022
Pengi: An audio language model for audio tasks S Deshmukh, B Elizalde, R Singh, H Wang Advances in Neural Information Processing Systems 36, 18090-18108, 2023	56	2023
Cracking the cocktail party problem by multi-beam deep attractor network Z Chen, J Li, X Xiao, T Yoshioka, H Wang, Z Wang, Y Gong 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017	52	2017
Audio retrieval with wavtext5k and clap training S Deshmukh, B Elizalde, H Wang arXiv preprint arXiv:2209.14275, 2022	41	2022
Fast real-time personalized speech enhancement: End-to-end enhancement network (E3Net) and knowledge distillation M Thakker, SE Eskimez, T Yoshioka, H Wang arXiv preprint arXiv:2204.00771, 2022	30	2022
Human listening and live captioning: Multi-task training for speech enhancement SE Eskimez, X Wang, M Tang, H Yang, Z Zhu, Z Chen, H Wang, ... arXiv preprint arXiv:2106.02896, 2021	26	2021
An introduction to computational networks and the computational network toolkit Y Dong, E Adam, S Mike, Y Kaisheng, H Zhi-Heng, G Brian, K Oleksii, ... Tech. Rep. MSR-TR-2014-112, 2014	24	2014
One model to enhance them all: array geometry agnostic multi-channel personalized speech enhancement H Taherian, SE Eskimez, T Yoshioka, H Wang, Z Chen, X Huang ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	22	2022
An overview of microsoft deep qa system on stanford webquestions benchmark Z Wang, S Yan, H Wang, X Huang 2018-09-15]. https://www. microsoft. com/en-us/research/publication/an …, 2014	19	2014
Natural language supervision for general-purpose audio representations B Elizalde, S Deshmukh, H Wang ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	18	2024
Artificial intelligence system utilizing microphone array and fisheye camera Z Wang, X Huang, L Qin, K Wu, H Wang US Patent App. 15/885,518, 2019	17	2019
Online verification of custom wake word K Shahid, K Kumar, T Yi, V Miljanic, H Wang, Y Gong, HA Khalil US Patent 11,158,305, 2021	11	2021
Describing emotions with acoustic property prompts for speech emotion recognition H Dhamyal, B Elizalde, S Deshmukh, H Wang, B Raj, R Singh arXiv preprint arXiv:2211.07737, 2022	9	2022
An introduction to computational networks and the computational network toolkit (invited talk). D Yu, A Eversole, ML Seltzer, K Yao, B Guenter, O Kuchaiev, F Seide, ... INTERSPEECH, 2014	8	2014

Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.

Artikelen 1–20

Citaties per jaar

Dubbele citaties

Samengevoegde citaties

Medeauteurs toevoegenMedeauteurs

Volgen

Geciteerd door

Medeauteurs