Follow
Valerii Likhosherstov
Valerii Likhosherstov
Verified email at cam.ac.uk
Title
Cited by
Cited by
Year
Rethinking attention with performers
K Choromanski, V Likhosherstov, D Dohan, X Song, A Gane, T Sarlos, ...
arXiv preprint arXiv:2009.14794, 2020
6612020
Masked language modeling for proteins via linearly scalable long-context transformers
K Choromanski, V Likhosherstov, D Dohan, X Song, A Gane, T Sarlos, ...
arXiv preprint arXiv:2006.03555, 2020
542020
Polyvit: Co-training vision transformers on images, videos and audio
V Likhosherstov, A Arnab, K Choromanski, M Lucic, Y Tay, A Weller, ...
arXiv preprint arXiv:2111.12993, 2021
242021
Ode to an ODE
KM Choromanski, JQ Davis, V Likhosherstov, X Song, JJ Slotine, J Varley, ...
Advances in Neural Information Processing Systems 33, 3338-3350, 2020
162020
Rethinking attention with Performers. arXiv 2020
K Choromanski, V Likhosherstov, D Dohan, X Song, A Gane, T Sarlos, ...
arXiv preprint arXiv:2009.14794, 0
16
Large‐scale log analysis of digital reading
P Braslavski, V Likhosherstov, V Petras, M Gäde
Proceedings of the Association for Information Science and Technology 53 (1 …, 2016
142016
Rethinking attention with performers. arXiv
K Choromanski, V Likhosherstov, D Dohan, X Song, A Gane, T Sarlos, ...
preprint, 2020
132020
Sub-linear memory: How to make performers slim
V Likhosherstov, KM Choromanski, JQ Davis, X Song, A Weller
Advances in Neural Information Processing Systems 34, 6707-6719, 2021
112021
Hybrid random features
K Choromanski, H Chen, H Lin, Y Ma, A Sehanobish, D Jain, MS Ryoo, ...
arXiv preprint arXiv:2110.04367, 2021
82021
UFO-BLO: Unbiased first-order bilevel optimization
V Likhosherstov, X Song, K Choromanski, J Davis, A Weller
arXiv preprint arXiv:2006.03631, 2020
62020
Stochastic flows and geometric optimization on the orthogonal group
K Choromanski, D Cheikhi, J Davis, V Likhosherstov, A Nazaret, ...
International Conference on Machine Learning, 1918-1928, 2020
52020
On the expressive power of self-attention matrices
V Likhosherstov, K Choromanski, A Weller
arXiv preprint arXiv:2106.03764, 2021
42021
Ten months of digital reading: An exploratory log study
P Braslavski, V Petras, V Likhosherstov, M Gäde
Research and Advanced Technology for Digital Libraries: 20th International …, 2016
42016
Debiasing a first-order heuristic for approximate bi-level optimization
V Likhosherstov, X Song, K Choromanski, JQ Davis, A Weller
International Conference on Machine Learning, 6621-6630, 2021
32021
CWY parametrization for scalable learning of orthogonal and stiefel matrices
V Likhosherstov, J Davis, K Choromanski, A Weller
CoRR, abs/2004.08675, 2020
32020
Inference and Sampling of -free Ising Models
V Likhosherstov, Y Maximov, M Chertkov
International Conference on Machine Learning, 3963-3972, 2019
32019
From block-Toeplitz matrices to differential equations on graphs: towards a general theory for scalable masked Transformers
K Choromanski, H Lin, H Chen, T Zhang, A Sehanobish, V Likhosherstov, ...
International Conference on Machine Learning, 3962-3983, 2022
22022
Unlocking pixels for reinforcement learning via implicit attention
KM Choromanski, D Jain, W Yu, X Song, J Parker-Holder, T Zhang, ...
arXiv preprint arXiv:2102.04353, 2021
22021
Tractable minor-free generalization of planar zero-field Ising models
V Likhosherstov, Y Maximov, M Chertkov
Journal of Statistical Mechanics: Theory and Experiment 2020 (12), 124007, 2020
22020
A new family of tractable Ising models
V Likhosherstov, Y Maximov, M Chertkov
arXiv preprint arXiv:1906.06431, 2019
22019
The system can't perform the operation now. Try again later.
Articles 1–20