Thomas Wolf
Thomas Wolf
Chief Science Officer at HuggingFace
Verified email at - Homepage
Cited by
Cited by
Transformers: State-of-the-art natural language processing
T Wolf, J Chaumond, L Debut, V Sanh, C Delangue, A Moi, P Cistac, ...
Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
V Sanh, L Debut, J Chaumond, T Wolf
arXiv preprint arXiv:1910.01108, 2019
Two-dimensional superconductivity at a Mott insulator/band insulator interface LaTiO 3/SrTiO 3
J Biscaras, N Bergeal, A Kushwaha, T Wolf, A Rastogi, RC Budhani, ...
Nature communications 1 (1), 1-5, 2010
Transfertransfo: A transfer learning approach for neural network based conversational agents
T Wolf, V Sanh, J Chaumond, C Delangue
arXiv preprint arXiv:1901.08149, 2019
Transfer learning in natural language processing
S Ruder, ME Peters, S Swayamdipta, T Wolf
Proceedings of the 2019 Conference of the North American Chapter of the …, 2019
A hierarchical multi-task approach for learning embeddings from semantic tasks
V Sanh, T Wolf, S Ruder
Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 6949-6956, 2019
Strong field-matching effects in superconducting YBaCuO films with vortex energy landscapes engineered via masked ion irradiation
I Swiecicki, C Ulysse, T Wolf, R Bernard, N Bergeal, J Briatico, G Faini, ...
Physical Review B 85 (22), 224502, 2012
Large-scale transfer learning for natural language generation
S Golovanov, R Kurbanov, S Nikolenko, K Truskovskyi, A Tselousov, ...
Proceedings of the 57th Annual Meeting of the Association for Computational …, 2019
Lysandre Debut, Victor Sanh, Clement Delangue, Anthony Moi, Pierric Cistac, Morgan Funtowicz, Joe Davison, Sam Shleifer, et al. 2020. Transformers: State-of-theart natural …
T Wolf, J Chaumond
Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 0
Movement pruning: Adaptive sparsity by fine-tuning
V Sanh, T Wolf, AM Rush
arXiv preprint arXiv:2005.07683, 2020
YBCO Josephson junctions and striplines for RSFQ circuits made by ion irradiation
T Wolf, N Bergeal, J Lesueur, CJ Fourie, G Faini, C Ulysse, P Febvre
IEEE Transactions on applied superconductivity 23 (2), 1101205-1101205, 2013
Some additional experiments extending the tech report” Assessing BERTs syntactic abilities” by Yoav Goldberg
T Wolf
Technical report, 2019
Toward terahertz heterodyne detection with superconducting Josephson junctions
M Malnou, A Luo, T Wolf, Y Wang, C Feuillet-Palma, C Ulysse, G Faini, ...
Applied physics letters 101 (23), 233505, 2012
Influence of ion implantation on the magnetic and transport properties of manganite films
M Sirena, A Zimmers, N Haberkorn, EE Kaul, LB Steren, J Lesueur, T Wolf, ...
Physical Review B 81 (13), 134439, 2010
Imprinting nanoporous alumina patterns into the magneto-transport of oxide superconductors
JE Villegas, I Swiecicki, R Bernard, A Crassous, J Briatico, T Wolf, ...
Nanotechnology 22 (7), 075302, 2011
HTS Josephson junctions arrays for high-frequency mixing
A Sharafiev, M Malnou, C Feuillet-Palma, C Ulysse, T Wolf, F Couëdo, ...
Superconductor Science and Technology 31 (3), 035003, 2018
TLDR: token loss dynamic reweighting for reducing repetitive utterance generation
S Jiang, T Wolf, C Monz, M de Rijke
arXiv preprint arXiv:2003.11963, 2020
Direct observation of electronic inhomogeneities induced by point defect disorder in manganite films
M Sirena, A Zimmers, N Haberkorn, E Kaul, LB Steren, J Lesueur, T Wolf, ...
Journal of Applied Physics 107 (11), 113903, 2010
Learning from others' mistakes: Avoiding dataset biases without modeling them
V Sanh, T Wolf, Y Belinkov, AM Rush
arXiv preprint arXiv:2012.01300, 2020
Continuous learning in a hierarchical multiscale neural network
T Wolf, J Chaumond, C Delangue
arXiv preprint arXiv:1805.05758, 2018
The system can't perform the operation now. Try again later.
Articles 1–20