Achieving fairness in the stochastic multi-armed bandit problem V Patil, G Ghalme, V Nair, Y Narahari Journal of Machine Learning Research 22 (174), 1-31, 2021 | 117 | 2021 |
Budgeted and non-budgeted causal bandits V Nair, V Patil, G Sinha International Conference on Artificial Intelligence and Statistics, 2017-2025, 2021 | 33 | 2021 |
Multi-armed bandits with bounded arm-memory: Near-optimal guarantees for best-arm identification and regret minimization A Maiti, V Patil, A Khan Advances in Neural Information Processing Systems 34, 19553-19565, 2021 | 12* | 2021 |
Long-term resource allocation fairness in average markov decision process (amdp) environment G Ghalme, V Nair, V Patil, Y Zhou arXiv preprint arXiv:2102.07120, 2021 | 7* | 2021 |
Mitigating disparity while maximizing reward: tight anytime guarantee for improving bandits V Patil, V Nair, G Ghalme, A Khan arXiv preprint arXiv:2208.09254, 2022 | 1 | 2022 |