Du Tran
Du Tran
Facebook AI
Verified email at fb.com - Homepage
Title
Cited by
Cited by
Year
Learning spatiotemporal features with 3d convolutional networks
D Tran, L Bourdev, R Fergus, L Torresani, M Paluri
2015 IEEE International Conference on Computer Vision (ICCV), 4489-4497, 2015
38002015
A closer look at spatiotemporal convolutions for action recognition
D Tran, H Wang, L Torresani, J Ray, Y LeCun, M Paluri
Proceedings of the IEEE conference on Computer Vision and Pattern …, 2018
5402018
Human activity recognition with metric learning
D Tran, A Sorokin, D Forsyth
Computer Vision–ECCV 2008, 548-561, 2008
3842008
Building an automatic vehicle license plate recognition system
TD Duan, TLH Du, TV Phuoc, NV Hoang
Proc. Int. Conf. Comput. Sci. RIVF, 59-63, 2005
3002005
C3D: generic features for video analysis
D Tran, LD Bourdev, R Fergus, L Torresani, M Paluri
CoRR, abs/1412.0767 2 (7), 8, 2014
2982014
Convnet architecture search for spatiotemporal feature learning
D Tran, J Ray, Z Shou, SF Chang, M Paluri
arXiv preprint arXiv:1708.05038, 2017
1922017
Combining Hough transform and contour algorithm for detecting vehicles' license-plates
TD Duan, DA Duc, TLH Du
Proceedings of 2004 International Symposium on Intelligent Multimedia, Video …, 2004
1662004
Detect-and-track: Efficient pose estimation in videos
R Girdhar, G Gkioxari, L Torresani, M Paluri, D Tran
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018
1122018
Cooperative learning of audio and video models from self-supervised synchronization
B Korbar, D Tran, L Torresani
Advances in Neural Information Processing Systems, 7763-7774, 2018
109*2018
Deep end2end voxel2voxel prediction
D Tran, L Bourdev, R Fergus, L Torresani, M Paluri
Proceedings of the IEEE conference on computer vision and pattern …, 2016
1042016
Video event detection: From subvolume localization to spatiotemporal path search
D Tran, J Yuan, D Forsyth
IEEE transactions on pattern analysis and machine intelligence 36 (2), 404-416, 2013
972013
Video classification with channel-separated convolutional networks
D Tran, H Wang, L Torresani, M Feiszli
Proceedings of the IEEE International Conference on Computer Vision, 5552-5561, 2019
622019
Max-margin structured output regression for spatio-temporal action localization
D Tran, J Yuan
Advances in neural information processing systems, 350-358, 2012
602012
Optimal spatio-temporal path discovery for video event detection
D Tran, J Yuan
CVPR 2011, 3321-3328, 2011
562011
Large-scale weakly-supervised pre-training for video action recognition
D Ghadiyaram, D Tran, D Mahajan
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2019
542019
Transformation-based models of video sequences
J Van Amersfoort, A Kannan, MA Ranzato, A Szlam, D Tran, S Chintala
arXiv preprint arXiv:1701.08435, 2017
442017
Self-supervised learning by cross-modal audio-video clustering
H Alwassel, D Mahajan, L Torresani, B Ghanem, D Tran
arXiv preprint arXiv:1911.12667, 2019
212019
What Makes Training Multi-Modal Classification Networks Hard?
W Wang, D Tran, M Feiszli
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
17*2020
Scsampler: Sampling salient clips from video for efficient action recognition
B Korbar, D Tran, L Torresani
Proceedings of the IEEE International Conference on Computer Vision, 6232-6242, 2019
172019
Distinit: Learning video representations without a single labeled video
R Girdhar, D Tran, L Torresani, D Ramanan
Proceedings of the IEEE International Conference on Computer Vision, 852-861, 2019
172019
The system can't perform the operation now. Try again later.
Articles 1–20