Follow
Yu Wu (吴俣)
Yu Wu (吴俣)
Other namesYu Wu, Y. Wu
DeepSeek AI
Verified email at deepseek.com - Homepage
Title
Cited by
Cited by
Year
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
S Chen, C Wang, Z Chen, Y Wu, S Liu, Z Chen, J Li, N Kanda, T Yoshioka, ...
IEEE Journal of Selected Topics in Signal Processing, 2021
19622021
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers
S Chen, C Wang, Y Wu, Z Zhang, L Zhou, S Liu, Z Chen, Y Liu, H Wang, ...
IEEE Transactions on Audio, Speech and Language Processing, 2025
683*2025
Sequential Matching Network: A New Architecture for Multi-turn Response Selection in Retrieval-Based Chatbots
Y Wu, W Wu, C Xing, M Zhou, Z Li
Proc. ACL, 496-505, 2017
6442017
DeepSeek-Coder: When the Large Language Model Meets Programming--The Rise of Code Intelligence
D Guo, Q Zhu, D Yang, Z Xie, K Dong, W Zhang, G Chen, X Bi, Y Wu, ...
arXiv preprint arXiv:2401.14196, 2024
5912024
Topic Aware Neural Response Generation
C Xing, W Wu, Y Wu, J Liu, Y Huang, M Zhou, WY Ma
Proc. AAAI, 3351-3357, 2017
5892017
Template-Based Named Entity Recognition Using BART
L Cui, Y Wu, J Liu, S Yang, Y Zhang
ACL 2021, 2021
3992021
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
D Guo, D Yang, H Zhang, J Song, R Zhang, R Xu, Q Zhu, S Ma, P Wang, ...
arXiv preprint arXiv:2501.12948, 2025
3242025
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Z Shao, P Wang, Q Zhu, R Xu, J Song, M Zhang, YK Li, Y Wu, D Guo
arXiv preprint arXiv:2402.03300, 2024
3172024
BEATs: Audio Pre-Training with Acoustic Tokenizers
S Chen, Y Wu, C Wang, S Liu, D Tompkins, Z Chen, F Wei
Proc. ICML 2023, 2022
3132022
SpeechT5: Unified-Modal Encoder-Decoder Pre-training for Spoken Language Processing
J Ao, R Wang, L Zhou, S Liu, S Ren, Y Wu, T Ko, Q Li, Y Zhang, Z Wei, ...
ACL 2022, 2021
2622021
Deepseek llm: Scaling open-source language models with longtermism
X Bi, D Chen, G Chen, S Chen, D Dai, C Deng, H Ding, K Dong, Q Du, ...
arXiv preprint arXiv:2401.02954, 2024
2522024
DeepSeek-V3 Technical Report
A Liu, B Feng, B Xue, B Wang, B Wu, C Lu, C Zhao, C Deng, C Zhang, ...
arXiv preprint arXiv:2412.19437, 2024
2362024
Math-shepherd: Verify and reinforce llms step-by-step without human annotations
P Wang, L Li, Z Shao, RX Xu, D Dai, Y Li, D Chen, Y Wu, Z Sui
Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024
235*2024
Hierarchical Recurrent Attention Network for Response Generation
C Xing, W Wu, Y Wu, M Zhou, Y Huang, WY Ma
Proc. AAAI, 5610-5617, 2018
2352018
Developing Real-time Streaming Transformer Transducer for Speech Recognition on Large-scale Dataset
X Chen, Y Wu, Z Wang, S Liu, J Li
ICASSP 2021, 2020
2242020
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
D Dai, C Deng, C Zhao, RX Xu, H Gao, D Chen, J Li, W Zeng, X Yu, Y Wu, ...
Proceedings of the 62nd Annual Meeting of the Association for Computational …, 2024
1922024
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
Q Zhu, D Guo, Z Shao, D Yang, P Wang, R Xu, Y Wu, Y Li, H Gao, S Ma, ...
arXiv preprint arXiv:2406.11931, 2024
179*2024
Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling
Z Zhang, L Zhou, C Wang, S Chen, Y Wu, S Liu, Z Chen, Y Liu, H Wang, ...
arXiv preprint arXiv:2303.03926, 2023
1782023
Keyphrase Generation with Correlation Constraints
J Chen, X Zhang, Y Wu, Z Yan, Z Li
Proc. EMNLP, 4057-4066, 2018
1732018
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
A Liu, B Feng, B Wang, B Wang, B Liu, C Zhao, C Dengr, C Ruan, D Dai, ...
arXiv preprint arXiv:2405.04434, 2024
1662024
The system can't perform the operation now. Try again later.
Articles 1–20