Zhilin Yang

Cited by

	All	Since 2019
Citations	25007	23572
h-index	28	27
i10-index	37	36

7000

3500

1750

5250

20162017201820192020202120222023202469 335 811 1741 3726 4501 4934 6398 2262

Public access

View all

14 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Ruslan SalakhutdinovUPMC Professor, Machine Learning Department, CMUVerified email at cs.cmu.edu
William W. CohenCMU and Google DeepMindVerified email at google.com
Quoc V. LeResearch Scientist, GoogleVerified email at stanford.edu
Tang JieProfessor, Tsinghua UniversityVerified email at tsinghua.edu.cn
Yutao ZhangMoonshot AIVerified email at mails.tsinghua.edu.cn
Peng QiScientist, Amazon AWS AIVerified email at amazon.com
Yoshua BengioProfessor of computer science, University of Montreal, Mila, IVADO, CIFARVerified email at umontreal.ca
Christopher D ManningProfessor of Computer Science and Linguistics, Stanford UniversityVerified email at stanford.edu
Kaiming HeAssociate Professor, EECS, MITVerified email at mit.edu
Yann LeCunChief AI Scientist at Facebook & Silver Professor at the Courant Institute, New York UniversityVerified email at cs.nyu.edu
Junbo ZhaoZhejiang University, ZJU100 Young ProfessorVerified email at zju.edu.cn
Jason WestonMetaVerified email at fb.com
Saizheng Zhang
Zihang Dai

Zhilin Yang

Carnegie Mellon University

Verified email at cs.cmu.edu - Homepage

Deep Learning Machine Learning Natural Language Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Xlnet: Generalized autoregressive pretraining for language understanding Z Yang, Z Dai, Y Yang, J Carbonell, RR Salakhutdinov, QV Le Advances in neural information processing systems 32, 2019	10061*	2019
Transformer-xl: Attentive language models beyond a fixed-length context Z Dai, Z Yang, Y Yang, J Carbonell, QV Le, R Salakhutdinov arXiv preprint arXiv:1901.02860, 2019	3935	2019
Revisiting semi-supervised learning with graph embeddings Z Yang, W Cohen, R Salakhudinov International conference on machine learning, 40-48, 2016	2059	2016
HotpotQA: A dataset for diverse, explainable multi-hop question answering Z Yang, P Qi, S Zhang, Y Bengio, WW Cohen, R Salakhutdinov, ... arXiv preprint arXiv:1809.09600, 2018	1699	2018
GPT understands, too X Liu, Y Zheng, Z Du, M Ding, Y Qian, Z Yang, J Tang AI Open, 2023	1051*	2023
Glm: General language model pretraining with autoregressive blank infilling Z Du, Y Qian, X Liu, M Ding, J Qiu, Z Yang, J Tang arXiv preprint arXiv:2103.10360, 2021	735	2021
P-tuning v2: Prompt tuning can be comparable to fine-tuning universally across scales and tasks X Liu, K Ji, Y Fu, WL Tam, Z Du, Z Yang, J Tang arXiv preprint arXiv:2110.07602, 2021	699	2021
Differentiable learning of logical rules for knowledge base reasoning F Yang, Z Yang, WW Cohen Advances in neural information processing systems 30, 2017	637	2017
Multi-task cross-lingual sequence tagging from scratch Z Yang, R Salakhutdinov, W Cohen arXiv preprint arXiv:1603.06270, 2016	632*	2016
Good semi-supervised learning that requires a bad gan Z Dai, Z Yang, F Yang, WW Cohen, RR Salakhutdinov Advances in neural information processing systems 30, 2017	553	2017
Gated-Attention Readers for Text Comprehension B Dhingra, H Liu, Z Yang, WW Cohen, R Salakhutdinov arXiv preprint arXiv:1606.01549, 2016	452	2016
Review networks for caption generation Z Yang, Y Yuan, Y Wu, WW Cohen, RR Salakhutdinov Advances in neural information processing systems 29, 2016	394*	2016
Breaking the softmax bottleneck: A high-rank RNN language model Z Yang, Z Dai, R Salakhutdinov, WW Cohen arXiv preprint arXiv:1711.03953, 2017	393	2017
Cosnet: Connecting heterogeneous social networks with local and global consistency Y Zhang, J Tang, Z Yang, J Pei, PS Yu Proceedings of the 21th ACM SIGKDD international conference on knowledge …, 2015	346	2015
Neural cross-lingual named entity recognition with minimal resources J Xie, Z Yang, G Neubig, NA Smith, J Carbonell arXiv preprint arXiv:1808.09861, 2018	200	2018
Semi-supervised QA with generative domain-adaptive nets Z Yang, J Hu, R Salakhutdinov, WW Cohen arXiv preprint arXiv:1702.02206, 2017	184	2017
Linguistic knowledge as memory for recurrent neural networks B Dhingra, Z Yang, WW Cohen, R Salakhutdinov arXiv preprint arXiv:1703.02620, 2017	149*	2017
Codegeex: A pre-trained model for code generation with multilingual evaluations on humaneval-x Q Zheng, X Xia, X Zou, Y Dong, S Wang, Y Xue, Z Wang, L Shen, A Wang, ... arXiv preprint arXiv:2303.17568, 2023	122	2023
Words or characters? fine-grained gating for reading comprehension Z Yang, B Dhingra, Y Yuan, J Hu, WW Cohen, R Salakhutdinov arXiv preprint arXiv:1611.01724, 2016	97	2016
Flipda: Effective and robust data augmentation for few-shot learning J Zhou, Y Zheng, J Tang, J Li, Z Yang arXiv preprint arXiv:2108.06332, 2021	65	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors