Follow
Shuang Ma
Shuang Ma
Microsoft AI & Research
Verified email at microsoft.com - Homepage
Title
Cited by
Cited by
Year
A-lamp: Adaptive layout-aware multi-patch deep convolutional neural network for photo aesthetic assessment
S Ma, J Liu, C Wen Chen
Proceedings of the IEEE conference on computer vision and pattern …, 2017
1742017
Da-gan: Instance-level image translation by deep attention generative adversarial networks
S Ma, J Fu, CW Chen, T Mei
Proceedings of the IEEE conference on computer vision and pattern …, 2018
1512018
Active contrastive learning of audio-visual video representations
S Ma, Z Zeng, D McDuff, Y Song
arXiv preprint arXiv:2009.09805, 2020
452020
A generative adversarial network for style modeling in a text-to-speech system
S Ma, D Mcduff, Y Song
International Conference on Learning Representations 2, 2019
43*2019
Characterizing bias in classifiers using generative models
D McDuff, S Ma, Y Song, A Kapoor
Advances in neural information processing systems 32, 2019
302019
Multi-reference neural TTS stylization with adversarial cycle consistency
M Whitehill, S Ma, D McDuff, Y Song
arXiv preprint arXiv:1910.11958, 2019
292019
Contrastive learning of global and local video representations
Z Zeng, D McDuff, Y Song
Advances in Neural Information Processing Systems 34, 7025-7040, 2021
232021
Pose maker: A pose recommendation system for person in the landscape photographing
S Ma, Y Fan, CW Chen
Proceedings of the 22nd ACM international conference on Multimedia, 1053-1056, 2014
202014
Unpaired image-to-speech synthesis with multimodal information bottleneck
S Ma, D McDuff, Y Song
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
172019
Finding your spot: A photography suggestion system for placing human in the scene
S Ma, Y Fan, CW Chen
2014 IEEE International Conference on Image Processing (ICIP), 556-560, 2014
132014
Causalcity: Complex simulations with agency for causal discovery and reasoning
D McDuff, Y Song, J Lee, V Vineet, S Vemprala, NA Gyde, H Salman, ...
Conference on Causal Learning and Reasoning, 559-575, 2022
122022
Reshaping robot trajectories using natural language commands: A study of multi-modal data alignment using transformers
A Bucker, L Figueredo, S Haddadinl, A Kapoor, S Ma, R Bonatti
2022 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2022
92022
Compass: Contrastive multimodal pretraining for autonomous systems
S Ma, S Vemprala, W Wang, JK Gupta, Y Song, D McDufft, A Kapoor
2022 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2022
42022
Learning audio-visual representations with active contrastive coding
S Ma, Z Zeng, D McDuff, Y Song
arXiv preprint arXiv:2009.09805 2, 2020
42020
M3D-GAN: Multi-modal multi-domain translation with universal attention
S Ma, D McDuff, Y Song
arXiv preprint arXiv:1907.04378, 2019
32019
D-Sempre: Learning deep semantic-preserving embeddings for user interests-social contents modeling
S Ma, CW Chen
arXiv preprint arXiv:1802.06451, 2018
32018
Approach for license plate location using edge features filter and multi-decision mechanism
MA Shuang, C Jiangning, LU Hu
Computer Engineering and Applications 50 (9), 145-149, 2014
32014
LaTTe: Language Trajectory TransformEr
A Bucker, L Figueredo, S Haddadin, A Kapoor, S Ma, R Bonatti
arXiv preprint arXiv:2208.02918, 2022
22022
Automatic creation of magazine-page-like social media visual summary for mobile browsing
S Ma, CW Chen
2016 IEEE International Conference on Image Processing (ICIP), 469-473, 2016
22016
Pact: Perception-action causal transformer for autoregressive robotics pre-training
R Bonatti, S Vemprala, S Ma, F Frujeri, S Chen, A Kapoor
arXiv preprint arXiv:2209.11133, 2022
12022
The system can't perform the operation now. Try again later.
Articles 1–20