Raymond T. Ng
#133,649
Most Influential Person Now
Raymond T. Ng's AcademicInfluence.com Rankings
Raymond T. Ngcomputer-science Degrees
Computer Science
#6012
World Rank
#6340
Historical Rank
Data Science
#104
World Rank
#107
Historical Rank
Data Mining
#131
World Rank
#131
Historical Rank
Database
#3137
World Rank
#3269
Historical Rank

Download Badge
Computer Science
Raymond T. Ng's Degrees
- PhD Computer Science Stanford University
- Masters Computer Science Stanford University
- Bachelors Computer Science University of British Columbia
Similar Degrees You Can Earn
Why Is Raymond T. Ng Influential?
(Suggest an Edit or Addition)Raymond T. Ng's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- LOF: identifying density-based local outliers (2000) (5834)
- Efficient and Effective Clustering Methods for Spatial Data Mining (1994) (2090)
- Algorithms for Mining Distance-Based Outliers in Large Datasets (1998) (1863)
- Distance-based outliers: algorithms and applications (2000) (1223)
- CLARANS: A Method for Clustering Objects for Spatial Data Mining (2002) (1173)
- Exploratory mining and pruning optimizations of constrained associations rules (1998) (839)
- On The Marriage of Lp-norms and Edit Distance (2004) (749)
- Predicting source code changes by mining change history (2004) (572)
- Finding Intensional Knowledge of Distance-Based Outliers (1999) (510)
- Indexing spatio-temporal trajectories with Chebyshev polynomials (2004) (355)
- A Unified Notion of Outliers: Properties and Computation (1997) (352)
- The New Jersey Data Reduction Report (1997) (283)
- Fast Computation of 2-Dimensional Depth Contours (1998) (232)
- Extracting knowledge from evaluative text (2005) (217)
- Exploratory Mining and Pruning Optimizations of Constrained Association Rules (1998) (203)
- Constraint-Based Multidimensional Data Mining (1999) (201)
- Optimization of constrained frequent set queries with 2-variable constraints (1999) (201)
- OPTICS-OF: Identifying Local Outliers (1999) (194)
- Probabilistic Logic Programming (1992) (189)
- Constraint-based clustering in large databases (2001) (181)
- Parametric query optimization (1992) (172)
- Abstractive Summarization of Product Reviews Using Discourse Structure (2014) (170)
- CODRA: A Novel Discriminative Framework for Rhetorical Analysis (2015) (167)
- Combining Intra- and Multi-sentential Rhetorical Parsing for Document-level Discourse Analysis (2013) (166)
- Integrating copy number polymorphisms into array CGH analysis using a robust HMM (2006) (166)
- Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining (2002) (140)
- Counting twig matches in a tree (2001) (140)
- Eecient and Eeective Clustering Methods for Spatial Data Mining (1994) (138)
- Summarizing email conversations with clue words (2007) (128)
- Finding Aggregate Proximity Relationships and Commonalities in Spatial Data Mining (1996) (125)
- Genome-wide profiling of follicular lymphoma by array comparative genomic hybridization reveals prognostically significant DNA copy number imbalances. (2009) (124)
- Very large data bases (1994) (116)
- Proceedings of the 2008 ACM SIGMOD international conference on Management of data (2008) (112)
- Stable Semantics for Probabilistic Deductive Databases (1994) (100)
- A unified approach for mining outliers (1997) (98)
- Mixed integer programming methods for computing nonmonotonic deductive databases (1994) (94)
- Semantic Compression and Pattern Extraction with Fascicles (1999) (92)
- Explaining Outliers by Subspace Separability (2013) (91)
- Inference of transcriptional regulation relationships from gene expression data (2003) (90)
- Iceberg-cube computation with PC clusters (2001) (89)
- Exploiting succinct constraints using FP-trees (2002) (86)
- Detecting potential labeling errors in microarrays by data perturbation (2006) (85)
- Modeling recurrent DNA copy number alterations in array CGH data (2007) (85)
- A Template-based Abstractive Meeting Summarization: Leveraging Summary and Source Text Relationships (2014) (83)
- Efficient dynamic mining of constrained frequent sets (2003) (82)
- Summarizing Emails with Conversational Cohesion and Subjectivity (2008) (79)
- Discriminative features for identifying and interpreting outliers (2014) (76)
- A Novel Discriminative Framework for Sentence-Level Discourse Analysis (2012) (75)
- Flexible buffer allocation based on marginal gains (1991) (74)
- Preferential expression of antioxidant response element mediated gene expression in astrocytes (2001) (71)
- The 3W Model and Algebra for Unified Data Mining (2000) (70)
- Abstractive Meeting Summarization with Entailment and Fusion (2013) (69)
- A semantical framework for supporting subjective and conditional probabilities in deductive databases (1990) (68)
- Interactive multimedia summaries of evaluative text (2006) (67)
- Substring selectivity estimation (1999) (66)
- Maximizing Buffer and Disk Utilizations for News On-Demand (1994) (65)
- The Generalized MDL Approach for Summarization (2002) (64)
- Generating and Validating Abstracts of Meeting Conversations: a User Study (2010) (63)
- Local Outlier Detection with Interpretation (2013) (60)
- Predictive Load Control for Flexible Buffer Allocation (1991) (60)
- Flexible and Adaptable Buffer Management Techniques for Database Management Systems (1995) (59)
- Extending Q-Grams to Estimate Selectivity of String Matching with Low Edit Distance (2007) (59)
- The segment support map: scalable mining of frequent itemsets (2000) (58)
- Topic Segmentation and Labeling in Asynchronous Conversations (2013) (58)
- An Extendible Hash for Multi-Precision Similarity Querying of Image Databases (2001) (57)
- SQUIRE: sequential pattern mining with quantities (2004) (57)
- Evaluating multidimensional indexing structures for images transformed by principal component analysis (1996) (55)
- ItCompress: an iterative semantic compression algorithm (2004) (53)
- Robust space transformations for distance-based operations (2001) (51)
- Implementing Stable Semantics by Linear Programming (1993) (48)
- Abstractive Summarization of Spoken and Written Conversations Based on Phrasal Queries (2014) (47)
- Searching for dependencies at multiple abstraction levels (2002) (46)
- Similarity Join Size Estimation using Locality Sensitive Hashing (2011) (46)
- Dialogue Act Recognition in Synchronous and Asynchronous Conversations (2013) (44)
- MDQC: a new quality assessment method for microarrays based on quality control reports (2007) (42)
- To do or not to do: the dilemma of disclosing anonymized data (2005) (41)
- Exploratory mining via constrained frequent set queries (1999) (39)
- Implementing deductive databases by linear programming (1992) (39)
- SIGMA2: A system for the integrative genomic multi-dimensional analysis of cancer genomes, epigenomes, and transcriptomes (2008) (38)
- Exploring Joint Neural Model for Sentence Level Discourse Parsing and Sentiment Analysis (2017) (38)
- Modeling content and structure for abstractive review summarization (2016) (37)
- Expressive power of an algebra for data mining (2006) (37)
- Preservation Of Patterns and Input-Output Privacy (2007) (37)
- Detecting Disagreement in Conversations using Pseudo-Monologic Rhetorical Structure (2014) (36)
- Power-Law Based Estimation of Set Similarity Join Size (2009) (36)
- MDL Summarization with Holes (2005) (34)
- Multi-Dimensional Substring Selectivity Estimation (1999) (32)
- MD-SeeGH: a platform for integrative analysis of multi-dimensional genomic data (2008) (32)
- Hierarchical cluster analysis of SAGE data for cancer profiling (2001) (32)
- Stable Model Semantics for Probabilistic Deductive Databases (1990) (32)
- Finding Boundary Shape Matching Relationships in Spatial Data (1997) (31)
- Model-based clustering of array CGH data (2009) (31)
- Interpretation and Transformation for Abstracting Conversations (2010) (31)
- Exploiting Conversation Structure in Unsupervised Topic Segmentation for Emails (2010) (30)
- Computing Circumscriptive Databases: I. Theory and Algorithms (1995) (28)
- Schemes for Implementing Buffer Sharing in Continuous-Media Systems (1995) (28)
- One-dimensional and multi-dimensional substring selectivity estimation (2000) (28)
- Cooperative Query Answering Using Multiple Layered Databases (1994) (27)
- Extraction of Spatial Proximity Patterns by Concept Generalization (1996) (26)
- Implementing deductive databases by mixed integer programming (1996) (25)
- Outliers and data mining: finding exceptions in data (2002) (25)
- Using the Omega Index for Evaluating Abstractive Community Detection (2012) (24)
- Evolution and Revolutions in LDAP Directory Caches (2000) (24)
- Detecting outliers from large datasets (2001) (23)
- Parallel Computation of High-Dimensional Robust Correlation and Covariance Matrices (2006) (23)
- Towards Topic Labeling with Phrase Entailment and Aggregation (2013) (22)
- OSSM: a segmentation approach to optimize frequency counting (2002) (22)
- Complex Group-By Queries for XML (2007) (22)
- Discovering roll-up dependencies (1999) (21)
- Reducing bias and increasing utility by federated generative modeling of medical images using a centralized adversary (2021) (21)
- A Model-Based Ensembling Approach for Developing QSARs (2009) (21)
- Visual mining of power sets with large alphabets (2005) (21)
- Relating Dempster-Shafer Theory to Stable Semantics (1991) (20)
- Computation and implementation of non-monotonic deductive databases (1991) (20)
- Semantics, Consistency, and Query Processing of Empirical Deductive Databases (1997) (19)
- Discourse Analysis and Its Applications (2019) (17)
- Domain Adaptation to Summarize Human Conversations (2010) (17)
- Approximate substring selectivity estimation (2009) (17)
- Scalable discovery of hidden emails from large folders (2005) (17)
- Data Mining: The Next Generation (2004) (16)
- Outlier Detection with Space Transformation and Spectral Analysis (2013) (16)
- ChemModLab: A Web-Based Cheminformatics Modeling Laboratory (2012) (16)
- Introduction to the special issue on data mining for health informatics (2007) (16)
- Empirical Probabilities in Monadic Deductive Databases (1992) (15)
- Regression-Based Summarization of Email Conversations (2009) (15)
- On disclosure risk analysis of anonymized itemsets in the presence of prior knowledge (2008) (15)
- A methodology for analyzing SAGE libraries for cancer profiling (2005) (14)
- An expressive language and interface for image querying (1997) (14)
- Temporal Dependencies Generalized for Spatial and Other Dimensions (1999) (14)
- Parallel computation of high dimensional robust correlation and covariance matrices (2004) (13)
- A High Precision Pipeline for Financial Knowledge Graph Construction (2020) (13)
- Geo-Spatial Clustering with User-Specified Constraints (2000) (12)
- Dealing with Semantic Heterogeneity by Generalization-Based Data Mining Techniques (2002) (12)
- Identification of novel blood biomarkers of treatment response in cystic fibrosis pulmonary exacerbations by label-free quantitative proteomics (2019) (11)
- Statistical Modeling and Buffer Allocation for Mpeg Streams (1996) (11)
- Discovery and regeneration of hidden emails (2005) (11)
- Supervised Topic Segmentation of Email Conversations (2011) (11)
- Assessment of SVM Reliability for Microarray Data Analysis (2004) (10)
- Aggregate query processing in the presence of duplicates in wireless sensor networks (2015) (10)
- Multilevel Filtering for High-Dimensional Image Data: Why and How (1999) (9)
- Reasoning with Uncertainty in Deductive Databases and Logic Programs (1997) (9)
- Multiresolution subimage similarity matching for large image databases (1997) (9)
- Efficient compilation of large rule bases using logical access paths (1990) (9)
- Data mining and knowledge discovery in molecular databases (1998) (9)
- Optimization-based Content Selection for Opinion Summarization (2009) (9)
- The University of British Columbia at TAC 2008 (2008) (8)
- Training Data Enrichment for Infrequent Discourse Relations (2016) (8)
- Perspectives on Business Intelligence (2013) (8)
- Optimal clip ordering for multi-clip queries (1998) (7)
- Multiscale Similarity Matching for Subimage Queries of Arbitrary Size (1998) (6)
- A Comprehensive Survey on Online Anomaly Detection (2015) (6)
- Optical Mass Storage Systems and their Performance (1988) (6)
- Non-Monotonic Negation in Probabilistic Deductive Databases (1991) (6)
- Inferring RNA sequence preferences for poorly studied RNA-binding proteins based on co-evolution (2018) (6)
- Analysis of multilevel color histograms (1997) (5)
- Guest Editor's Introduction to the Special Section on the IEEE International Conference on Data Engineering (2010) (5)
- The impact of ASR on abstractive vs. extractive meeting summaries (2010) (5)
- Computational modeling of stigmatized behaviour in pro-vaccination and anti-vaccination discussions on social media (2019) (5)
- Outlier detection in personalized medicine (2013) (4)
- Buffer Sharing Schemes for Continuous-Media Systems (1995) (4)
- Probabilistic reasoning in logic programming (1991) (4)
- Neural Prediction of Patient Needs in an Ovarian Cancer Online Discussion Forum (2019) (3)
- ProbeRating: a recommender system to infer binding profiles for nucleic acid-binding proteins (2020) (3)
- Building Trust & Protecting Privacy: Analyzing Evidentiary Quality in a Blockchain Proof-of-Concept for Health Research Data Consent Management (2018) (3)
- Stigma Annotation Scheme and Stigmatized Language Detection in Health-Care Discussions on Social Media (2020) (3)
- Semantics and Consistency of Empirical Databases (1993) (3)
- 3D PET image generation with tumour masks using TGAN (2021) (2)
- EXQUISI: an expressive query interface for similar images (1996) (2)
- Towards a Toolkit for Data Analysis and Mining (1999) (2)
- Private data sharing between decentralized users through the privGAN architecture (2020) (2)
- Differences in DNA methylation of white blood cell types at birth and in adulthood reflect postnatal immune maturation and influence accuracy of cell type prediction (2018) (2)
- A Visual Interface for Analyzing Text Conversations (2012) (2)
- Efficient Aggregation Processing in the Presence of Duplicately Detected Objects in WSNs (2019) (2)
- Incremental Algorithms for Optimizing Model Computation Based on Partial Instantiation (1997) (2)
- Finding Topics in Emails: Is LDA enough? (2009) (2)
- Predictive modelling of stigmatized behaviour in vaccination discussions on Facebook (2019) (2)
- Dempster-Shafer Logic Programs and Stable Semantics (1993) (1)
- Discourse Processing and Its Applications in Text Mining (2018) (1)
- The Optimized Segment Support Map for the Mining of Frequent Patterns (2001) (1)
- Review - Clustering Categorical Data: An Approach Based on Dynamical Systems (1999) (1)
- Automated Analysis of Public Health Laboratory Test Results. (2020) (1)
- Exploiting Conversation Features for Finding Topics in Emails (2010) (1)
- An analysis of buffer sharing and prefetching techniques for multimedia systems (1996) (1)
- Designing a Discourse Parser for the Evaluative Text Genre (2010) (1)
- Generating and Evaluating Summaries for Partial Email Threads: Conversational Bayesian Surprise and Silver Standards (2017) (1)
- Automatic Topic Labeling in Asynchronous Conversations (2012) (1)
- GEA: a toolkit for gene expression analysis (2002) (1)
- Incompleteness in Data Mining (2001) (1)
- Guest Editorial (2004) (0)
- Comments using Tree Structured Conditional Random Fields (2012) (0)
- Incremental Methods for Optimizing Partial Instantiation (1995) (0)
- {20 () on the Complexity of Mining Quantitative Association Rules Editor (1998) (0)
- Use of DNS SRV records for host selection (2009) (0)
- Data Mining and Knowledge Discovery in Molecular Databases - Session Introduction (1999) (0)
- lationships ata Mining (1996) (0)
- Natural Language Processing, Wearables, and Their Combination in Healthcare: Opportunities, Challenges, and Considerations (2020) (0)
- Natural Language Summarization of Evaluative Arguments (0)
- Privacy-Preserving Data Publishing: A Constraint-Based Clustering Approach (2009) (0)
- Generalizing Temporal Dependencies for Non-Temporal Dimensions (2003) (0)
- Constraint-Based Clustering in Large DatabasesAnthony (2000) (0)
- Performing boundary shape matching in spatial data (1996) (0)
- Group GAN (2022) (0)
- Towards Multi-modal Extraction and Summarization of Conversations (2009) (0)
- Dense Forecasting of Wildfire Smoke Particulate Matter Using Sparsity Invariant Convolutional Neural Networks (2020) (0)
- Guest Editorial (2020) (0)
- Tutorial notes of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining, KDD 2000, Boston, Massachusetts, USA, August 20-23, 2000 (2000) (0)
- Scientific Data Management for ecology and evolution (2020) (0)
- Inferring RNA sequence preferences for poorly studied RNA-binding proteins based on co-evolution (2018) (0)
This paper list is powered by the following services:
What Schools Are Affiliated With Raymond T. Ng?
Raymond T. Ng is affiliated with the following schools: