Charles Peter Elkan
#127,522
Most Influential Person Now
Charles Peter Elkan's AcademicInfluence.com Rankings
Charles Peter Elkancomputer-science Degrees
Computer Science
#5431
World Rank
#5739
Historical Rank
Data Mining
#103
World Rank
#103
Historical Rank
Machine Learning
#1445
World Rank
#1467
Historical Rank
Artificial Intelligence
#1676
World Rank
#1708
Historical Rank

Download Badge
Computer Science
Charles Peter Elkan's Degrees
- PhD Computer Science Stanford University
- Bachelors Mathematics Stanford University
Similar Degrees You Can Earn
Why Is Charles Peter Elkan Influential?
(Suggest an Edit or Addition)Charles Peter Elkan's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- Fitting a Mixture Model By Expectation Maximization To Discover Motifs In Biopolymer (1994) (5049)
- The Foundations of Cost-Sensitive Learning (2001) (2128)
- Transforming classifier scores into accurate multiclass probability estimates (2002) (1054)
- Learning to Diagnose with LSTM Recurrent Neural Networks (2015) (940)
- Learning the k in k-means (2003) (932)
- Learning classifiers from only positive and unlabeled data (2008) (928)
- Using the Triangle Inequality to Accelerate k-Means (2003) (827)
- Obtaining calibrated probability estimates from decision trees and naive Bayesian classifiers (2001) (801)
- The Value of Prior Knowledge in Discovering Motifs with MEME (1995) (744)
- Topic models (2008) (618)
- The Field Matching Problem: Algorithms and Applications (1996) (587)
- Link Prediction via Matrix Factorization (2011) (525)
- Learning and making decisions when costs and probabilities are both unknown (2001) (508)
- Alternatives to the k-means algorithm that find better clusterings (2002) (506)
- An Efficient Domain-Independent Algorithm for Detecting Approximately Duplicate Database Records (1997) (454)
- Unsupervised learning of multiple motifs in biopolymers using expectation maximization (1995) (438)
- The Transporter Classification Database: recent advances (2008) (378)
- Unsupervised Learning of Multiple Motifs in Biopolymers Using Expectation Maximization (2004) (335)
- Results of the KDD'99 classifier learning (2000) (327)
- Modeling word burstiness using the Dirichlet distribution (2005) (309)
- The paradoxical success of fuzzy logic (1993) (306)
- Scalability for clustering algorithms revisited (2000) (266)
- Optimal Thresholding of Classifiers to Maximize F1 Measure (2014) (263)
- Bayesian approaches to failure prediction for disk drives (2001) (226)
- Meta-MEME: motif-based hidden Markov models of protein families (1997) (217)
- The Protein-Protein Interaction tasks of BioCreative III: classification/ranking of articles and linking bio-ontology concepts to full text (2011) (212)
- Quadratic Programming Feature Selection (2010) (210)
- Differential Privacy and Machine Learning: a Survey and Review (2014) (198)
- Boosting and Naive Bayesian learning (1997) (175)
- A Positive and Unlabeled Learning Algorithm for One-Class Classification of Remote-Sensing Data (2011) (162)
- Principled Methods for Advising Reinforcement Learning Agents (2003) (162)
- KDD Cup and workshop 2007 (2007) (151)
- Clustering documents with an exponential-family approximation of the Dirichlet compound multinomial distribution (2006) (147)
- A Rational Reconstruction of Nonmonotonic Truth Maintenance Systems (1990) (111)
- Accounting for burstiness in topic models (2009) (110)
- Magical thinking in data mining: lessons from CoIL challenge 2000 (2001) (109)
- Learning gene regulatory networks from only positive and unlabeled data (2010) (106)
- ParaMEME: a parallel implementation and a web interface for a DNA and protein motif discovery tool (1996) (95)
- Independence of logic database queries and update (1990) (89)
- Fast recognition of musical genres using RBF networks (2005) (89)
- Thresholding Classifiers to Maximize F1 Score (2014) (83)
- Can we model the probability of presence of species without absence data (2011) (76)
- Beam search algorithms for multilabel learning (2013) (73)
- A Log-Linear Model with Latent Features for Dyadic Prediction (2010) (71)
- An artificial intelligence approach to motif discovery in protein sequences: Application to steroid dehydrogenases (1997) (70)
- Predictive analytics and data mining (2010) (65)
- Predicting accurate probabilities with a ranking loss (2012) (59)
- Estimating the Accuracy of Learned Concepts (1993) (57)
- Fast Algorithms for Approximating the Singular Value Decomposition (2011) (56)
- Predicting labels for dyadic data (2010) (50)
- Deriving TF-IDF as a Fisher Kernel (2005) (49)
- Spinach CSP41, an mRNA-binding protein and ribonuclease, is homologous to nucleotide-sugar epimerases and hydroxysteroid dehydrogenases. (1998) (47)
- Learning and Inference in Probabilistic Classifier Chains with Beam Search (2012) (44)
- Hidden Markov model analysis of motifs in steroid dehydrogenases and their homologs. (1997) (41)
- Nearest Neighbor Classification (2007) (39)
- Conspiracy Numbers and Caching for Searching And/Or Trees and Theorem-Proving (1989) (37)
- Latent semantic indexing (LSI) fails for TREC collections (2011) (37)
- Incremental, Approximate Planning (1990) (36)
- KDD Cup and Workshop 2007 (2007) (35)
- Inhibition in Multiclass Classification (2012) (35)
- Differential privacy based on importance weighting (2013) (34)
- Log-linear models and conditional random fields (2007) (32)
- Reasoning about Action in First-Order Logic (1992) (31)
- End-to-End Offline Goal-Oriented Dialog Policy Learning via Policy Gradient (2017) (30)
- Learning structure and concepts in data through data clustering (2003) (30)
- Nonlinear Support Vector Machines Can Systematically Identify Stocks with High and Low Future Returns (2012) (29)
- Making generative classifiers robust to selection bias (2007) (29)
- A Bayesian network framework for reject inference (2004) (26)
- Evaluating Classifiers (2006) (22)
- MLSys: The New Frontier of Machine Learning Systems (2019) (21)
- A High-Performance Explanation-Based Learning Algorithm (1994) (21)
- Policy mining: learning decision policies from fixed sets of data (2003) (21)
- Discovering motifs in dna and protein sequences: the approximate common substring problem (1995) (20)
- SysML: The New Frontier of Machine Learning Systems (2019) (20)
- Probabilistic Modeling of a Sales Funnel to Prioritize Leads (2015) (19)
- Elkan's Reply: The Paradoxical Controversy over Fuzzy Logic (1994) (19)
- Learning to Find Relevant Biological Articles without Negative Training Examples (2008) (18)
- A decision procedure for conjunctive query disjointness (1989) (18)
- Predicting Surgery Duration with Neural Heteroscedastic Regression (2017) (18)
- Identifying Relevant Data for a Biological Database: Handcrafted Rules versus Machine Learning (2011) (18)
- Measuring and Improving the Effectiveness of Representations (1991) (17)
- Finding Transport Proteins in a General Protein Database (2007) (17)
- A common ancestor for a subunit in the mitochondrial proton-translocating NADH:ubiquinone oxidoreductase (complex I) and short-chain dehydrogenases/reductases (1999) (16)
- The Promising Future of Fuzzy Logic (1994) (15)
- Automated Inductive Reasoning about Logic Programs (1988) (14)
- Sources of Success for Boosted Wrapper Induction (2004) (13)
- D. B. Lenat and R. V. Guha, Building Large Knowledge-Based Systems: Representation and Inference in the Cyc Project (1993) (13)
- MEME, MAST, and Meta-MEME: New Tools for Motif Discovery in Protein Sequences (1999) (12)
- Logical Characterizations of Nonmonotonic TMSs (1989) (10)
- Dyadic Prediction Using a Latent Feature Log-Linear Model (2010) (10)
- One-Class Remote Sensing Classification From Positive and Unlabeled Background Data (2021) (10)
- On Solving the Qualification Problem (1995) (10)
- Visualizing the Consequences of Evidence in Bayesian Networks (2017) (10)
- A Modified Logistic Regression for Positive and Unlabeled Learning (2019) (9)
- Sources of Success for Information Extraction Methods (2001) (9)
- The WEBFIND tool for finding scientific papers over the worldwide web (2007) (8)
- Cost-Sensitive Learning and Decision-Making When Costs Are Unknown (2000) (8)
- What we need to learn if we want to do and not just talk (2018) (8)
- Preserving Privacy in Data Mining via Importance Weighting (2010) (8)
- Contributions to research on machine translation (2006) (7)
- Paradoxes of fuzzy logic, revisited (2001) (6)
- Text mining and topic models (2010) (6)
- A Taxonomy of Computational and Social Learning (2001) (6)
- Learning Rules to Improve a Machine Translation System (2003) (6)
- A bayesian approach to motif-based protein modeling (1998) (5)
- Policy Iteration Based on a Learned Transition Model (2012) (5)
- Nowcasting with Numerous Candidate Predictors (2014) (4)
- Conditional Random Fields for Word Hyphenation (2010) (4)
- Efficient Elastic Net Regularization for Sparse Linear Models (2015) (4)
- Incremental, Approzimate Planning (1990) (3)
- Playing the Imitation Game with deep learning (2016) (3)
- Shared challenges in data mining and computational biology (abstract of invited talk) (2001) (3)
- Differential privacy based on importance weighting (2013) (3)
- Reinforcement Learning with a Bilinear Q Function (2011) (3)
- Adaptive Locking (1987) (3)
- Learning meanings for sentences (2012) (3)
- Reasoning about Unknown, Counterfactual, and Nondeterministic Actions in First-Order Logic (1996) (3)
- Learning the � in �-means (3)
- Log-linear models and conditional random fields Notes for a tutorial at CIKM ’ 08 (2008) (2)
- A Model-Independent Measure of Regression Difficulty (2000) (2)
- F1-Optimal Thresholding in the Multi-Label Setting (2014) (2)
- Integrating external information sources to guide worldwide web information retrieval (2007) (1)
- LOW-RANK DECOMPOSITION AND LOGISTIC REGRESSION METHODS FOR LINK PREDICTION IN TERRORIST NETWORKS CSE 293 MS PROJECT REPORT, FALL 2010 (2010) (1)
- Exploratory Analysis of Speedup Learning Data Using Epectation Maximization (1996) (1)
- Guest Editorial for Special Issue KDD’10 (2012) (1)
- A Bayesian Network Framework for Reject Inference [ Extended Abstract ] (0)
- Probabilistic learning (2012) (0)
- 2 Hidden Markov models (2012) (0)
- A Pipeline to Automate the Updating of a Specialized Protein Database (2007) (0)
- Cross-validation and Modal Theories (1995) (0)
- Reply to Comments on The Paradoxical Success of Fuzzy Logic (1997) (0)
- Web-scale information retrieval and data mining List of papers (2008) (0)
- Best of the Journal of Computational and Graphical Statistics ( Invited Session ) Organizer and Chair : (2002) (0)
- Flexible concurrency control by reasoning about database queries and updates (1990) (0)
- Learning to Re-rank for Interactive Problem Resolution and Query Refinement (2014) (0)
- Analysis of the budget of the Jacobs School of Engineering (2010) (0)
- ControlFlag: A Self-supervised Idiosyncratic Pattern Detection System for Software Control Structures (2020) (0)
- Formalizing Counterfactual and Nondeterministic Actions in First-Order Logic (1995) (0)
- Theory versus practice in data science (2015) (0)
- Achieving Fluency and Coherency in Task-oriented Dialog (2018) (0)
- LPMEME: A Statistical Method for Inductive Logic Programming (1996) (0)
- EARNING TO D IAGNOSE WITH LSTM R ECURRENT N EURAL N ETWORKS (2015) (0)
- Notes on Machine Learning Projects and Reports (2010) (0)
- Lessons learned from contests in data mining (2011) (0)
- Beam search algorithms for multilabel learning (2013) (0)
This paper list is powered by the following services:
What Schools Are Affiliated With Charles Peter Elkan?
Charles Peter Elkan is affiliated with the following schools: