Sebastian Ruder
#156,542
Most Influential Person Now
Sebastian Ruder's AcademicInfluence.com Rankings
Sebastian Rudercomputer-science Degrees
Computer Science
#8651
World Rank
#9096
Historical Rank
Algorithms
#343
World Rank
#348
Historical Rank
Computational Linguistics
#1964
World Rank
#1985
Historical Rank
Machine Learning
#3597
World Rank
#3641
Historical Rank

Download Badge
Computer Science
Why Is Sebastian Ruder Influential?
(Suggest an Edit or Addition)Sebastian Ruder's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- An overview of gradient descent optimization algorithms (2016) (4449)
- Universal Language Model Fine-tuning for Text Classification (2018) (2796)
- An Overview of Multi-Task Learning in Deep Neural Networks (2017) (1966)
- XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalization (2020) (568)
- On the Cross-lingual Transferability of Monolingual Representations (2019) (436)
- A Survey of Cross-lingual Word Embedding Models (2017) (422)
- To Tune or Not to Tune? Adapting Pretrained Representations to Diverse Tasks (2019) (326)
- Transfer Learning in Natural Language Processing (2019) (305)
- Long Range Arena: A Benchmark for Efficient Transformers (2020) (280)
- MAD-X: An Adapter-based Framework for Multi-task Cross-lingual Transfer (2020) (276)
- AdapterHub: A Framework for Adapting Transformers (2020) (265)
- Fine-tuned Language Models for Text Classification (2018) (247)
- On the Limitations of Unsupervised Bilingual Dictionary Induction (2018) (226)
- A Hierarchical Model of Reviews for Aspect-based Sentiment Analysis (2016) (224)
- Latent Multi-Task Architecture Learning (2017) (206)
- Neural transfer learning for natural language processing (2019) (185)
- Sluice networks: Learning what to share between loosely related tasks (2017) (182)
- A Hierarchical Multi-task Approach for Learning Embeddings from Semantic Tasks (2018) (176)
- Episodic Memory in Lifelong Language Learning (2019) (154)
- How to (Properly) Evaluate Cross-Lingual Word Embeddings: On Strong Baselines, Comparative Analyses, and Some Misconceptions (2019) (144)
- Strong Baselines for Neural Semi-Supervised Learning under Domain Shift (2018) (143)
- Learning to select data for transfer learning with Bayesian Optimization (2017) (140)
- A survey of cross-lingual embedding models (2017) (122)
- Unsupervised Cross-Lingual Representation Learning (2019) (97)
- How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models (2020) (97)
- Character-level and Multi-channel Convolutional Neural Networks for Large-scale Authorship Attribution (2016) (93)
- INSIGHT-1 at SemEval-2016 Task 5: Deep Learning for Multilingual Aspect-based Sentiment Analysis (2016) (91)
- Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks (2021) (89)
- XTREME-R: Towards More Challenging and Nuanced Multilingual Evaluation (2021) (89)
- Rethinking embedding coupling in pre-trained language models (2020) (83)
- Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate Label Spaces (2018) (71)
- MultiFiT: Efficient Multi-lingual Language Model Fine-tuning (2019) (67)
- MasakhaNER: Named Entity Recognition for African Languages (2021) (67)
- Charformer: Fast Character Transformers via Gradient-based Subword Tokenization (2021) (63)
- UNKs Everywhere: Adapting Multilingual Language Models to New Scripts (2020) (62)
- A Call for More Rigor in Unsupervised Cross-lingual Learning (2020) (50)
- Are All Good Word Vector Spaces Isomorphic? (2020) (39)
- Knowledge Adaptation: Teaching to Adapt (2017) (38)
- AxCell: Automatic Extraction of Results from Machine Learning Papers (2020) (37)
- Generalizing Procrustes Analysis for Better Bilingual Dictionary Induction (2018) (35)
- Pitfalls of Static Language Modelling (2021) (34)
- Don’t Forget the Long Tail! A Comprehensive Analysis of Morphological Generalization in Bilingual Lexicon Induction (2019) (33)
- Emoji as Emotion Tags for Tweets (2016) (32)
- Multi-view Subword Regularization (2021) (30)
- A Discriminative Latent-Variable Model for Bilingual Lexicon Induction (2018) (27)
- Latent Multitask Architecture Learning (2018) (26)
- Cross-Lingual Word Embeddings (2019) (25)
- IndoNLG: Benchmark and Resources for Evaluating Indonesian Natural Language Generation (2021) (25)
- Data Selection Strategies for Multi-Domain Sentiment Analysis (2017) (20)
- ON MEMORY IN HUMAN AND ARTIFICIAL LANGUAGE PROCESSING SYSTEMS (2020) (13)
- INSIGHT-1 at SemEval-2016 Task 4: Convolutional Neural Networks for Sentiment Classification and Quantification (2016) (12)
- 360° Stance Detection (2018) (9)
- BERT memorisation and pitfalls in low-resource scenarios (2021) (8)
- Towards a continuous modeling of natural language domains (2016) (8)
- Multi-Domain Multilingual Question Answering (2021) (7)
- Modular Deep Learning (2023) (5)
- Analogy Training Multilingual Encoders (2021) (4)
- What do Deep Networks Like to Read? (2019) (4)
- Off-the-Shelf Unsupervised NMT (2018) (1)
- 360{\deg} Stance Detection (2018) (1)
- Compacter: Efficient Low-Rank Hypercomplex Adapter Layers (2021) (1)
- Morphologically Aware Word-Level Translation (2020) (0)
- Recent Developments in Computational Typology and Multilingual Natural Language Processing (0)
- Part 1 : Knowledgeable and Robust NLG Models Auxiliary Knowledge ( Entailment , Saliency ) External Commonsense Sensitivity to Negations / Antonyms Robustness to Missing words , Spelling / Grammar Errors , Paraphrases Auto-Adversary Generation (2019) (0)
This paper list is powered by the following services: