Author

Alexander M

Associate Professor, Cornell University - Cited by 25,994 - Natural Language Processing - Machine Learning

Biography

Alexander M, belonging to Department of Physics, University of Winnipeg, Canada. Interested in the field of Image registration software, Multi-modality PET-MR imaging.
Title
Cited by
Year
Transformers: State-of-the-art natural language processing
T Wolf, L Debut, V Sanh, J Chaumond, C Delangue, A Moi, P Cistac, ...Proceedings of the 2020 conference on empirical methods in natural language …, 20204781202
2020
Multitask prompted training enables zero-shot task generalization
V Sanh, A Webson, C Raffel, SH Bach, L Sutawika, Z Alyafeai, A Chaffin, ...arXiv preprint arXiv:2110.08207, 2021202
577
2021
Bloom: A 176b-parameter open-access multilingual language model
TL Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...arXiv preprint arXiv:2211.05100, 2022202
328
2022
How many data points is a prompt worth?
TL Scao, AM RusharXiv preprint arXiv:2103.08493, 2021202
184
2021
Parameter-efficient transfer learning with diff pruning
D Guo, AM Rush, Y KimarXiv preprint arXiv:2012.07463, 2020202
152
2020
Promptsource: An integrated development environment and repository for natural language prompts
SH Bach, V Sanh, ZX Yong, A Webson, C Raffel, NV Nayak, A Sharma, ...arXiv preprint arXiv:2202.01279, 2022202
112
2022
Datasets: A community library for natural language processing
Q Lhoest, AV del Moral, Y Jernite, A Thakur, P von Platen, S Patil, ...arXiv preprint arXiv:2109.02846, 2021202
104
2021
Block pruning for faster transformers
F Lagunas, E Charlaix, V Sanh, AM RusharXiv preprint arXiv:2109.04838, 2021202
85
2021
Pre-trained summarization distillation
S Shleifer, AM RusharXiv preprint arXiv:2010.13002, 2020202
68
2020
Sequence-level mixed sample data augmentation
D Guo, Y Kim, AM RusharXiv preprint arXiv:2011.09039, 2020202
67
2020
Edgebert: Sentence-level energy optimizations for latency-aware multi-task nlp inference
T Tambe, C Hooper, L Pentecost, T Jia, EY Yang, M Donato, V Sanh, ...MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture …, 2021202
59
2021
Algorithm-hardware co-design of adaptive floating-point encodings for resilient deep learning inference
T Tambe, EY Yang, Z Wan, Y Deng, VJ Reddi, A Rush, D Brooks, GY Wei2020 57th ACM/IEEE Design Automation Conference (DAC), 1-6, 2020202
45
2020
Conference demographics and footprint changed by virtual platforms
M Skiles, E Yang, O Reshef, DR Muñoz, D Cintron, ML Lind, A Rush, ...Nature Sustainability 5 (2), 149-156, 2022202
45
2022
GRIT: Generative role-filler transformers for document-level event entity extraction
X Du, AM Rush, C CardiearXiv preprint arXiv:2008.09249, 2020202
43
2020
Interactive and visual prompt engineering for ad-hoc task adaptation with large language models
H Strobelt, A Webson, V Sanh, B Hoover, J Beyer, H Pfister, AM RushIEEE transactions on visualization and computer graphics (1), 1146-1156, 2022202
29
2022
9.8 A 25mm2 SoC for IoT Devices with 18ms Noise-Robust Speech-to-Text Latency via Bayesian Speech Denoising and Attention-Based Sequence-to-Sequence …
T Tambe, EY Yang, GG Ko, Y Chai, C Hooper, M Donato, PN Whatmough, ...2021 IEEE International Solid-State Circuits Conference (ISSCC) 64, 158-160, 2021202
29
2021
Scaling hidden Markov language models
JT Chiu, AM RusharXiv preprint arXiv:11.04640,
20
2020
Template filling with generative transformers
X Du, AM Rush, C CardieProceedings of the 2021 Conference of the North American Chapter of the …, 2021202
19
2021
Low-complexity probing via finding subnetworks
S Cao, V Sanh, AM RusharXiv preprint arXiv:2104.03514, 2021202
18
2021
Rationales for sequential predictions
K Vafa, Y Deng, DM Blei, AM RusharXiv preprint arXiv:2109.06387, 2021202
17
2021