Stella Biderman

Cited by

	All	Since 2019
Citations	8765	8747
h-index	30	30
i10-index	36	36

5000

2500

1250

3750

2020202120222023202429 163 1210 4936 2359

Public access

View all

5 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Edward RaffBooz Allen Hamilton, UMBCVerified email at bah.com
Quentin AnthonyPhD Student, Ohio State UniversityVerified email at osu.edu
Hailey SchoelkopfResearcher, EleutherAIVerified email at eleuther.ai
Lintang SutawikaEleutherAIVerified email at sutawika.com

Stella Biderman

Other namesStella Rose Biderman

Booz Allen Hamilton, EleutherAI

Verified email at bah.com - Homepage

Natural Language Processing Artificial Intelligence Language Modeling Deep Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Multitask prompted training enables zero-shot task generalization V Sanh, A Webson, C Raffel, SH Bach, L Sutawika, Z Alyafeai, A Chaffin, ... The Tenth International Conference on Learning Representations (ICLR), 2022	1249	2022
The Pile: An 800GB Dataset of Diverse Text for Language Modeling L Gao, S Biderman, S Black, L Golding, T Hoppe, C Foster, J Phang, H He, ... arXiv preprint arXiv:2101.00027, 2020	1192*	2020
Bloom: A 176b-parameter open-access multilingual language model TL Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ... arXiv preprint arXiv:2211.05100, 2022	1190	2022
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ... Transactions of Machine Learning Research (TMLR), 2022	811*	2022
GPT-NeoX-20B: An Open-Source Autoregressive Language Model S Black, S Biderman, E Hallahan, Q Anthony, L Gao, L Golding, H He, ... ACL Workshop on Challenges & Perspectives in Creating Large Language Models, 2022	535	2022
GPT-Neo: Large scale autoregressive language modeling with Mesh-TensorFlow S Black, L Gao, P Wang, C Leahy, S Biderman GitHub Repository, 2021	534*	2021
The Language Model Evaluation Harness L Gao, J Tow, S Biderman, S Black, A DiPofi, C Foster, L Golding, J Hsu, ... GitHub Repository, 2021	440*	2021
Pythia: A suite for analyzing large language models across training and scaling S Biderman, H Schoelkopf, Q Anthony, H Bradley, K O'Brien, E Hallahan, ... International conference on machine learning (ICML), 2023	414	2023
Crosslingual generalization through multitask finetuning N Muennighoff, T Wang, L Sutawika, A Roberts, S Biderman, TL Scao, ... 61st Annual Meeting of the Association for Computational Linguistics, 2023	375	2023
VQGAN-CLIP: Open domain image generation and editing with natural language guidance K Crowson, S Biderman, D Kornis, D Stander, E Hallahan, L Castricato, ... European Conference on Computer Vision (ECCV), 2022	338*	2022
Quality at a glance: An audit of web-crawled multilingual datasets J Kreutzer, I Caswell, L Wang, A Wahab, D van Esch, N Ulzii-Orshikh, ... Transactions of the Association for Computational Linguistics 10, 50-72, 2022	215*	2022
RWKV: Reinventing RNNs for the Transformer Era B Peng, E Alcaide, Q Anthony, A Albalak, S Arcadinho, H Cao, X Cheng, ... Findings of the Association for Computational Linguistics: EMNLP, 2023	192*	2023
OpenFold: Retraining AlphaFold2 yields new insights into its learning mechanisms and capacity for generalization G Ahdritz, N Bouatta, S Kadyan, Q Xia, W Gerecke, TJ O’Donnell, ... Biorxiv, 2022.11. 20.517210, 2022	114*	2022
The bigscience roots corpus: A 1.6 tb composite multilingual dataset H Laurençon, L Saulnier, T Wang, C Akiki, A Villanova del Moral, ... Advances in Neural Information Processing Systems 35, 31809-31826, 2022	111	2022
The Annotated Transformer S Rush, A Huang, S Subramanian, J Sum, K Almubarak, S Biderman Workshop for NLP open source software (NLP-OSS), 2022	95*	2022
trlX: A framework for large scale reinforcement learning from human feedback A Havrilla, M Zhuravinskyi, D Phung, A Tiwari, J Tow, S Biderman, ... Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023	85*	2023
What Language Model to Train if You Have One Million GPU Hours? T Le Scao, T Wang, D Hesslow, L Saulnier, S Bekman, MS Bari, ... Findings of Empirical Methods in Natural Language Processing (EMNLP), 2022	78	2022
Magic: The gathering is Turing complete A Churchill, S Biderman, A Herrick 10th International Conference on Fun with Algorithms (FUN), 2020	72*	2020
Llemma: An open language model for mathematics Z Azerbayev, H Schoelkopf, K Paster, MD Santos, S McAleer, AQ Jiang, ... NeurIPS Workshop on Math and AI, 2023	68	2023
Eliciting latent predictions from transformers with the tuned lens N Belrose, Z Furman, L Smith, D Halawi, I Ostrovsky, L McKinney, ... arXiv preprint arXiv:2303.08112, 2023	66	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors