Follow
Shuang Ma
Shuang Ma
Microsoft AI & Research
Verified email at buffalo.edu - Homepage
Title
Cited by
Cited by
Year
A-lamp: Adaptive layout-aware multi-patch deep convolutional neural network for photo aesthetic assessment
S Ma, J Liu, C Wen Chen
Proceedings of the IEEE conference on computer vision and pattern …, 2017
1522017
Da-gan: Instance-level image translation by deep attention generative adversarial networks
S Ma, J Fu, CW Chen, T Mei
Proceedings of the IEEE conference on computer vision and pattern …, 2018
1432018
A generative adversarial network for style modeling in a text-to-speech system
S Ma, D Mcduff, Y Song
International Conference on Learning Representations 2, 2019
33*2019
Active contrastive learning of audio-visual video representations
S Ma, Z Zeng, D McDuff, Y Song
arXiv preprint arXiv:2009.09805, 2020
272020
Characterizing bias in classifiers using generative models
D McDuff, S Ma, Y Song, A Kapoor
Advances in Neural Information Processing Systems 32, 2019
252019
Multi-reference neural TTS stylization with adversarial cycle consistency
M Whitehill, S Ma, D McDuff, Y Song
arXiv preprint arXiv:1910.11958, 2019
222019
Pose maker: A pose recommendation system for person in the landscape photographing
S Ma, Y Fan, CW Chen
Proceedings of the 22nd ACM international conference on Multimedia, 1053-1056, 2014
192014
Unpaired image-to-speech synthesis with multimodal information bottleneck
S Ma, D McDuff, Y Song
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
122019
Finding your spot: A photography suggestion system for placing human in the scene
S Ma, Y Fan, CW Chen
2014 IEEE International Conference on Image Processing (ICIP), 556-560, 2014
112014
Causalcity: Complex simulations with agency for causal discovery and reasoning
D McDuff, Y Song, J Lee, V Vineet, S Vemprala, NA Gyde, H Salman, ...
Conference on Causal Learning and Reasoning, 559-575, 2022
82022
Contrastive learning of global and local audio-visual representations
S Ma, Z Zeng, D McDuff, Y Song
arXiv preprint arXiv:2104.05418, 2021
62021
Learning audio-visual representations with active contrastive coding
S Ma, Z Zeng, D McDuff, Y Song
arXiv preprint arXiv:2009.09805 2, 2020
42020
M3D-GAN: Multi-modal multi-domain translation with universal attention
S Ma, D McDuff, Y Song
arXiv preprint arXiv:1907.04378, 2019
32019
D-Sempre: Learning deep semantic-preserving embeddings for user interests-social contents modeling
S Ma, CW Chen
arXiv preprint arXiv:1802.06451, 2018
32018
Approach for license plate location using edge features filter and multi-decision mechanism
MA Shuang, C Jiangning, LU Hu
Computer Engineering and Applications 50 (9), 145-149, 2014
32014
COMPASS: Contrastive Multimodal Pretraining for Autonomous Systems
S Ma, S Vemprala, W Wang, JK Gupta, Y Song, D McDuff, A Kapoor
arXiv preprint arXiv:2203.15788, 2022
12022
Automatic creation of magazine-page-like social media visual summary for mobile browsing
S Ma, CW Chen
2016 IEEE International Conference on Image Processing (ICIP), 469-473, 2016
12016
Reshaping Robot Trajectories Using Natural Language Commands: A Study of Multi-Modal Data Alignment Using Transformers
A Bucker, L Figueredo, S Haddadin, A Kapoor, S Ma, R Bonatti
arXiv preprint arXiv:2203.13411, 2022
2022
Learning to Build Multimodal Intelligence across Vision, Language and Speech
S Ma
State University of New York at Buffalo, 2019
2019
一种基于视频的车牌识别方法
2014
The system can't perform the operation now. Try again later.
Articles 1–20