Follow
Yikang Shen
Yikang Shen
MIT-IBM Watson Lab
Verified email at ibm.com
Title
Cited by
Cited by
Year
Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks
Y Shen, S Tan, A Sordoni, A Courville
ICLR 2019, 2019
3192019
Long range arena: A benchmark for efficient transformers
Y Tay, M Dehghani, S Abnar, Y Shen, D Bahri, P Pham, J Rao, L Yang, ...
ICLR 2021, 2020
2752020
Banditsum: Extractive summarization as a contextual bandit
Y Dong, Y Shen, E Crawford, H van Hoof, JCK Cheung
EMNLP 2018, 2018
1822018
Neural language modeling by jointly learning syntax and lexicon
Y Shen, Z Lin, CW Huang, A Courville
ICLR 2018, 2017
1652017
Straight to the tree: Constituency parsing with neural syntactic distance
Y Shen, Z Lin, AP Jacob, A Sordoni, A Courville, Y Bengio
ACL 2018, 2018
832018
Question/answer matching for CQA system via combining lexical and sequential information
Y Shen, W Rong, Z Sun, Y Ouyang, Z Xiong
AAAI 2015, 2015
672015
Convolutional neural network based sentiment analysis using Adaboost combination
Y Gao, W Rong, Y Shen, Z Xiong
2016 International Joint Conference on Neural Networks (IJCNN), 1333-1338, 2016
532016
Word embedding based correlation model for question/answer matching
Y Shen, W Rong, N Jiang, B Peng, J Tang, Z Xiong
AAAI 2017 31 (1), 2017
522017
Structformer: Joint unsupervised induction of dependency and constituency structure from masked language modeling
Y Shen, Y Tay, C Zheng, D Bahri, D Metzler, A Courville
ACL 2021, 2020
262020
Prompting decision transformer for few-shot policy generalization
M Xu, Y Shen, S Zhang, Y Lu, D Zhao, J Tenenbaum, C Gan
International Conference on Machine Learning, 24631-24645, 2022
252022
Ordered memory
Y Shen, S Tan, A Hosseini, Z Lin, A Sordoni, A Courville
NeurIPS 2019, 2019
172019
Exploiting syntactic structure for better language modeling: A syntactic distance approach
W Du, Z Lin, Y Shen, TJ O'Donnell, Y Bengio, Y Zhang
ACL 2020, 2020
152020
Learning Task Decomposition with Ordered Memory Policy Network
Y Lu, Y Shen, S Zhou, A Courville, JB Tenenbaum, C Gan
International Conference on Learning Representations, 2021
142021
Self-organized hierarchical softmax
Y Shen, S Tan, C Pal, A Courville
arXiv preprint arXiv:1707.08588, 2017
82017
Investigating biases in textual entailment datasets
S Tan, Y Shen, C Huang, A Courville
arXiv preprint arXiv:1906.09635, 2019
72019
Planning with large language models for code generation
S Zhang, Z Chen, Y Shen, M Ding, JB Tenenbaum, C Gan
arXiv preprint arXiv:2303.05510, 2023
52023
Explicitly Modeling Syntax in Language Models with Incremental Parsing and a Dynamic Oracle
Y Shen, S Tan, A Sordoni, S Reddy, A Courville
NAACL 2021, 1660-1672, 2021
52021
Exploration of Tree-based Hierarchical Softmax for Recurrent Language Models.
N Jiang, W Rong, M Gao, Y Shen, Z Xiong, N Jiang, W Rong, M Gao, ...
IJCAI, 1951-1957, 2017
52017
Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
Z Sun, Y Shen, Q Zhou, H Zhang, Z Chen, D Cox, Y Yang, C Gan
arXiv preprint arXiv:2305.03047, 2023
42023
Unsupervised Dependency Graph Network
Y Shen, S Tan, A Sordoni, P Li, J Zhou, A Courville
Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022
42022
The system can't perform the operation now. Try again later.
Articles 1–20