Jeffrey Naughton

Q: What Schools Are Affiliated With Jeffrey Naughton

Jeffrey Naughton is affiliated with the following schools: University of California, Berkeley, Princeton University, University of Wisconsin–Madison, University of Chicago, Stanford University

Jeffrey Naughton's AcademicInfluence.com Rankings

Jeffrey Naughton

Computer Science

#2759

World Rank

#2887

Historical Rank

#1160

USA Rank

Database

#890

World Rank

#935

Historical Rank

#270

USA Rank

computer-science Degrees

Download Badge

Computer Science

Why Is Jeffrey Naughton Influential?

(Suggest an Edit or Addition)

According to Wikipedia, Jeffrey Naughton is a computer scientist and former professor and department chair of Computer Sciences at the University of Wisconsin–Madison, where he was one of the leaders of the Wisconsin Database Group. He was lead of Google's Madison office until 2022.

(See a Problem?)

Jeffrey Naughton's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

Relational Databases for Querying XML Documents: Limitations and Opportunities (1999) (1131)
On supporting containment queries in relational database management systems (2001) (900)
On the Computation of Multidimensional Aggregates (1996) (643)
Generalized Search Trees for Database Systems (1995) (606)
Shoring up persistent applications (1994) (500)
An array-based algorithm for simultaneous multidimensional aggregates (1997) (469)
Evaluating window joins over unbounded streams (2003) (412)
The oo7 Benchmark (1993) (392)
Covering indexes for branching path queries (2002) (358)
Practical Skew Handling in Parallel Joins (1992) (328)
Materialized View Selection for Multidimensional Datasets (1998) (314)
Sampling-Based Estimation of the Number of Distinct Values of an Attribute (1995) (309)
Practical selectivity estimation through adaptive sampling (1990) (308)
Maximizing the Output Rate of Multi-Way Join Queries over Streaming Information Sources (2003) (298)
The 007 Benchmark (1993) (284)
The Asilomar report on database research (1998) (272)
Rate-based query optimization for streaming information sources (2002) (271)
On schema matching with opaque column names and data values (2003) (268)
Anonymization of Set-Valued Data via Top-Down, Local Generalization (2009) (249)
Corleone: hands-off crowdsourcing for entity matching (2014) (240)
Cache Conscious Algorithms for Relational Query Processing (1994) (239)
Caching multidimensional queries using chunks (1998) (237)
The Niagara Internet Query System (2001) (235)
The Lowell database research self-assessment (2003) (234)
Estimating the Selectivity of XML Path Expressions for Internet Scale Applications (2001) (229)
Declarative Information Extraction Using Datalog with Embedded Extraction Predicates (2007) (226)
Bolt-on Differential Privacy for Scalable Stochastic Gradient Descent-based Analytics (2016) (190)
Technical Perspective:: Toward Building Entity Matching Management Systems (2016) (190)
On the integration of structure indexes and inverted lists (2004) (181)
On the provenance of non-answers to queries over extracted data (2008) (177)
Middle-tier database caching for e-business (2002) (177)
Storage Estimation for Multidimensional Aggregates in the Presence of Hierarchies (1996) (176)
Parallel sorting on a shared-nothing architecture using probabilistic splitting (1991) (164)
Combining keyword search and forms for ad hoc querying of databases (2009) (162)
A general technique for querying XML documents using a relational database system (2001) (162)
Low-Latency, Concurrent Checkpointing for Parallel Programs (1994) (162)
Predicting query execution time: Are optimizer cost models really unusable? (2013) (159)
An Evaluation of Non-Equijoin Algorithms (1991) (156)
Query Size Estimation by Adaptive Sampling (1995) (153)
Adaptive parallel aggregation algorithms (1995) (142)
XML-SQL Query Translation Literature: The State of the Art and Open Problems (2003) (136)
Real-time, concurrent checkpoint for parallel programs (1990) (133)
Selectivity and Cost Estimation for Joins Based on Random Sampling (1996) (133)
Learning Generalized Linear Models Over Normalized Data (2015) (131)
The Beckman Report on Database Research (2014) (129)
Toward a progress indicator for database queries (2004) (124)
K-Anonymization as Spatial Indexing: Toward Scalable and Incremental Anonymization (2007) (124)
Model Selection Management Systems: The Next Frontier of Advanced Analytics (2016) (122)
On the performance of object clustering techniques (1992) (121)
A Methodology for Formalizing Model-Inversion Attacks (2016) (119)
Fast algorithms for mining association rules and sequential patterns (1996) (118)
Building a scaleable geo-spatial DBMS: technology, implementation, and evaluation (1997) (117)
Static optimization of conjunctive queries with sliding windows over infinite streams (2004) (117)
Design and evaluation of alternative selection placement strategies in optimizing continuous queries (2002) (115)
Clustera: an integrated computation and data management system (2008) (115)
On differentially private frequent itemset mining (2012) (111)
Magellan: Toward Building Entity Matching Management Systems over Data Science Stacks (2016) (111)
Query execution techniques for caching expensive methods (1996) (110)
Extending RDBMSs To Support Sparse Datasets Using An Interpreted Attribute Storage Format (2006) (103)
Data independent recursion in deductive databases (1985) (103)
Mixed Mode XML Query Processing (2003) (101)
Efficient evaluation of right-, left-, and multi-linear rules (1989) (99)
Turbocharging DBMS buffer pool using SSDs (2011) (98)
Efficient Sampling Strategies for Relational Database Operations (1993) (96)
A stochastic approach for clustering in object bases (1991) (93)
The BUCKY object-relational benchmark (1997) (92)
Checkpointing multicomputer applications (1991) (89)
A scalable hash ripple join algorithm (2002) (89)
Active Query Caching for Database Web Servers (2000) (88)
Set Containment Joins: The Good, The Bad and The Ugly (2000) (87)
Information extraction challenges in managing unstructured data (2009) (87)
Towards Predicting Query Execution Time for Concurrent and Dynamic Database Workloads (2013) (87)
Falcon: Scaling Up Hands-Off Crowdsourced Entity Matching to Build Cloud Services (2017) (85)
A status report on the OO7 OODBMS benchmarking effort (1994) (84)
To Join or Not to Join?: Thinking Twice about Joins before Feature Selection (2016) (80)
Simultaneous optimization and evaluation of multiple dimensional queries (1998) (80)
A decidable class of bounded recursions (1987) (79)
Estimating the Size of Generalized Transitive Closures (1989) (79)
Towards Linear Algebra over Normalized Data (2016) (77)
Query size estimation by adaptive sampling (extended abstract) (1990) (77)
Toward scalable keyword search over relational data (2010) (76)
Form-based proxy caching for database-backed web sites: keywords and functions (2001) (75)
Recursive XML schemas, recursive XML queries, and relational storage: XML-to-SQL query translation (2004) (73)
On energy management, load balancing and replication (2010) (71)
Efficiently incorporating user feedback into information extraction and integration programs (2009) (70)
Multi-query SQL Progress Indicators (2006) (70)
Accurate estimation of the cost of spatial selections (2000) (69)
One-sided recursions (1987) (66)
A Relational Approach to Incrementally Extracting and Querying Structure in Unstructured Data (2007) (63)
Increasing the accuracy and coverage of SQL progress indicators (2005) (59)
End-biased Samples for Join Cardinality Estimation (2006) (59)
Architecting a Network Query Engine for Producing Partial Results (2000) (59)
The case for a wide-table approach to manage sparse relational data sets (2007) (58)
A non-blocking parallel spatial join algorithm (2002) (58)
Fixed-precision estimation of join selectivity (1993) (57)
On the relative cost of sampling for join selectivity estimation (1994) (56)
Argument Reduction by Factoring (1989) (56)
Integrating databases and workflow systems (2005) (55)
Generating Synthetic Complex-Structured XML Data (2001) (55)
Multiprocessor Main Memory Transaction Processing (1988) (54)
Efficient XML-to-SQL Query Translation: Where to Add the Intelligence? (2004) (53)
Sampling-Based Query Re-Optimization (2016) (52)
Proceedings of the 2000 ACM SIGMOD : International Conference on Management of Data, May 16-18, 2000, Dallas, Texas (2000) (51)
DIFF (2018) (51)
Preventing equivalence attacks in updated, anonymized data (2011) (50)
Bottom-Up Evaluation of Logic Programs (1991) (49)
Sampling Issues in Parallel Database Systems (1992) (48)
YAWN! (Yet Another Window on NAIL!) (1987) (48)
Materialized View Selection for Multi-Cube Data Models (2000) (48)
Nested loops revisited (1993) (47)
Updates for Structure Indexes (2002) (45)
Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data (1980) (44)
Storage reclamation and reorganization in client-server persistent object stores (1994) (43)
Using shared virtual memory for parallel join processing (1993) (43)
The Case for a Structured Approach to Managing Unstructured Data (2009) (41)
The Beckman report on database research (2016) (41)
GSLPI: A Cost-Based Query Progress Indicator (2012) (40)
Minimizing function-free recursive inference rules (1989) (40)
Modeling entity evolution for temporal record matching (2014) (38)
Compiling separable recursions (1988) (38)
On Estimating the Size of Projections (1990) (37)
Avi Pfeffer: Generalized Search Trees for Database Systems (1995) (37)
Partial results in database systems (2014) (36)
On relational support for XML publishing: beyond sorting and tagging (2003) (36)
A software-defined networking based approach for performance management of analytical queries on distributed data stores (2014) (35)
Locking protocols for materialized aggregate join views (2003) (35)
On Load Shedding in Complex Event Processing (2013) (34)
Following the paths of XML Data: An algebraic framework for XML query evaluation (2001) (33)
On the complexity of privacy-preserving complex event processing (2011) (32)
Counting methods for cyclic relations (1988) (32)
DIFF: a relational interface for large-scale data explanation (2018) (31)
Impact of disk corruption on open-source DBMS (2010) (30)
Uncertainty Aware Query Execution Time Prediction (2014) (30)
Cubing Algorithms, Storage Estimation, and Storage and Processing Alternatives for OLAP (1997) (29)
Tracking Entities in the Dynamic World: A Fast Algorithm for Matching Temporal Records (2014) (29)
Schema Matching Using Interattribute Dependencies (2008) (28)
Parallelizing OODBMS traversals: a performance evaluation (1996) (28)
Revisiting Differentially Private Regression: Lessons From Learning Theory and their Consequences (2015) (28)
Aggregate Aware Caching for Multi-Dimensional Queries (2000) (27)
OODB Bulk Loading Revisited: The Partitioned-List Approach (1995) (27)
Bulk Loading into an OODB: A Performance Study (1994) (25)
Utility-maximizing event stream suppression (2013) (24)
Demonstration of Santoku: Optimizing Machine Learning over Normalized Data (2015) (23)
Operator and Query Progress Estimation in Microsoft SQL Server Live Query Statistics (2016) (23)
Differentially Private Stochastic Gradient Descent for in-RDBMS Analytics (2016) (22)
A comparison of three methods for join view maintenance in parallel RDBMS (2003) (22)
m-tables: Representing Missing Data (2017) (22)
The BUCKY Object-Relational Benchmark (Experience Paper) (1997) (21)
On the Difficulty of Finding Optimal Relational Decompositions for XML Workloads: A Complexity Theoretic Perspective (2003) (21)
JECB: a join-extension, code-based approach to OLTP data partitioning (2014) (20)
Approximating StreamingWindow Joins Under CPU Limitations (2006) (18)
DIAMetrics: Benchmarking Query Engines at Scale (2020) (18)
Transaction Reordering and Grouping for Continuous Data Loading (2006) (18)
Resource Bricolage for Parallel Database Systems (2014) (18)
Array-based evaluation of multi-dimensional queries in object-relational database systems (1998) (18)
Putting XML Query Algebras into Context (2002) (16)
Synopses for query optimization: A space-complexity perspective (2004) (16)
Clocked adversaries for hashing (1993) (16)
Exploring Provenance in a Distributed Job Execution System (2006) (16)
Distribution-Based Query Scheduling (2013) (15)
On the complexity of join predicates (2001) (15)
Bridging relational technology and xml (2001) (15)
Towards Interactive Debugging of Rule-based Entity Matching (2017) (14)
Tuple-oriented Compression for Large-scale Mini-batch Stochastic Gradient Descent (2019) (14)
ParSets for parallelizing OODBMS traversals: implementation and performance (1994) (14)
F1 lightning (2020) (13)
On the expected size of recursive Datalog queries (1991) (13)
Remote load-sensitive caching for multi-server database systems (1998) (13)
Memory management for scalable Web data servers (1997) (12)
Transparently Gathering Provenance with Provenance Aware Condor (2009) (12)
The impact of data placement on memory management for multi-server OODBMS (1995) (12)
Space optimization in the bottom-up evaluation of logic programs (1991) (12)
A Counting Algorithm for a Cyclic Binary Query (1991) (11)
Comprehensive and Efficient Workload Compression (2020) (11)
A Simple Characterization of Uniform Boundedness for a Class of Recursions (1991) (11)
Building XML statistics for the hidden web (2003) (11)
On Transactional Memory, Spinlocks, and Database Transactions (2010) (11)
Database support for matching: limitations and opportunities (2006) (11)
Transaction reordering (2010) (11)
On Debugging Non-Answers in Keyword Search Systems (2015) (11)
How to forget the past without repeating it (1990) (10)
In-RDBMS inverted indexes revisited (2014) (10)
Napa: Powering Scalable Data Warehousing with Robust Query Performance at Google (2021) (10)
Toward Progress Indicators on Steroids for Big Data Systems (2013) (9)
Minimizing Expansions of Recursions (1989) (9)
Parallel and Distributed Information Systems (2010) (8)
Building a Scalable GeoSpatial DBMS : Technology , Implementation , and Evaluation (1997) (8)
XML views as integrity constraints and their use in query translation (2005) (8)
Space optimization in deductive databases (1995) (8)
An efficient checkpointing method for multicomputers with wormhole routing (1991) (7)
Issues in applying data mining to grid job failure detection and diagnosis (2008) (7)
Efficient storage and query processing of set-valued attributes (2001) (7)
Global memory management for multi-server database systems (1996) (7)
Processing Aggregates in Parallel Database Systems (1994) (7)
Unraveling the duplicate-elimination problem in XML-to-SQL query translation (2004) (6)
The Token Distribution Filter for Approximate String Membership (2011) (6)
A Survey of the Existing Landscape of ML Systems (2015) (6)
Transaction reordering with application to synchronized scans (2008) (6)
Xml-to-sql query translation (2004) (6)
Redundancy in Function-Free Recursive Rules (1986) (6)
Toward industrial-strength keyword search systems over relational data (2010) (6)
Short Notes Low-Latency, Concurrent Checkpointing for Parallel Programs (1994) (5)
When Lempel-Ziv-Welch Meets Machine Learning: A Case Study of Accelerating Machine Learning using Coding (2017) (5)
Query Execution Strategies for Caching Expensive Methods (1996) (5)
Exploiting Data Partitioning To Provide Approximate Results (2018) (5)
Approximate String Membership Checking: A Multiple Filter, Optimization-Based Approach (2012) (4)
TRAC: toward recency and consistency reporting in a database with distributed data sources (2006) (4)
Anonymization techniques for large and dynamic data sets (2007) (4)
Efficient Sorting, Duplicate Removal, Grouping, and Aggregation (2020) (4)
Optimizing Fixed-Schema XML to SQL Query Translation (2002) (4)
Novel query optimization and evaluation techniques (2003) (4)
Incremental Loading of Object Databases (1996) (3)
Optimization of recursive database query languages (1987) (3)
Efficient Evaluation of Right-, Left-, and Mult-Lineare Rules (1989) (3)
RESEARCH SELF-ASSESSMENT (2005) (3)
K-relevance: a spectrum of relevance for data sources impacting a query (2007) (2)
DIAMetrics (2021) (2)
Resource bricolage and resource selection for parallel database systems (2017) (2)
Database Support for Weighted Match Joins (2007) (2)
A Unified Approach to Logic Program Evaluation (1989) (2)
Static Optimization of Conjunctive Queries with Sliding Windows over Unbounded Streaming Information Sources (2003) (2)
We are drowning in a sea of least publishable units (LPUs) (2013) (2)
Sparse relational data sets: issues and an application (2008) (2)
Performance issues of multi-dimensional data analysis (1998) (2)
Instrumenting a logic programming language to gather provenance from an information extraction application (2012) (2)
Graph summarization for indexing paths in graph-structured data (2003) (2)
Relational databases for xml indexing (2002) (2)
On optimal differentially private mechanisms for count-range queries (2013) (2)
Transaction Reordering and Grouping for Continuous (2007) (1)
DBMS: Lessons from the first 50 years, speculations for the next 50 (2010) (1)
SIGMOD Digital Symposium Collection (DiSC) Editor's Message. (2000) (1)
1 IDB : Toward the Scalable Integration of Queryable Internet Data Sources (2000) (1)
HighSim : Highly Effective Similarity Measurement in Large Heterogeneous Information Networks (2016) (1)
InfoNames : An Information-Based Naming Scheme for Multimedia Content (2010) (1)
Technical Perspective: Broadening and Deepening Query Optimization Yet Still Making Progress (2016) (1)
Dealing with ( un ) structuredness in XML Data and Queries Using Relational Databases (1)
Techniques for operational data warehousing (2004) (1)
Technical Perspective: Natural Language to SQL Translation by Iteratively Exploring a Middle Ground (2016) (1)
On Differentially Private Inductive Logic Programming (2013) (1)
Efficient database support for olap queries (on-line analytical processing) (2000) (1)
Transaction reordering with application to synchronized scans (2008) (1)
Towards Building XML Statistics for the Hidden Web (2003) (1)
Implementing information retrieval using a combined object-oriented database/file system paradigm (1996) (0)
The Management of ClassAds in RDBMS and XML Native Storage Systems (2004) (0)
On privacy-preserving data publishing and analysis (2012) (0)
Comprehensive and Efficient Workload Summarization (2022) (0)
Providing Insights for Queries affected by Failures and Stragglers (2020) (0)
Resource bricolage and resource selection for parallel database systems (2016) (0)
Guest Editors' Introduction (2004) (0)
Supporting match joins in relational database management systems (2007) (0)
A Case Study on A Miner Dataset: Identifying leading research through various Models (2019) (0)
Storage and query processing optimizations for hierarchically-organized data (2006) (0)
DIAMETRICS (2022) (0)
Technical Perspective: Optimized Wandering for Online Aggregation (2017) (0)
Have Developed Strategies for Estimating Selectivity Lor, a Linear-time Probabilistic Counting Algo- Rithm for Database Applications," Acm Trans. on 8 a Strawman" Random Sampling Scheme Rs (0)
Suppression Strikes Back: On the Interaction of Thresholding and Differential Privacy (2015) (0)
Resource Bricolage for Parallel DBMSs on Heterogeneous Clusters (2016) (0)
Holistic Cube Analysis: A Query Framework for Data Insights (2023) (0)
The oo7 Benchmark: Current Status & Future Directions (1993) (0)
Applications of data mining to cluster scheduling and failure diagnosis (2009) (0)
Session details: Tutorial 2 (2010) (0)
A Review on Query Result Caching using dynamic data cache (2014) (0)
SIGMOD'2000 Program Chair's Message (2000) (0)
Transparent gathering of provenance during program execution (2010) (0)
Auxiliary Relations for Join View Maintenance in Parallel RDBMS (2002) (0)
Salvaging failing and straggling queries (2022) (0)
DIAMetrics (2020) (0)
Optimization and approximation techniques for data streaming queries (2006) (0)
Locality-Aware Distribution Schemes (2021) (0)
Efficient Tabular Dataset Preparations by the Aggregations in SQL: A Survey (2016) (0)
Relational database management system support for sparse data sets (2006) (0)
Optimizing unction-Free 4 Recursive inference Rules (1998) (0)
An Intuition of the Necessitate of Column-Oriented Database Systems (2017) (0)
Toward scalable keyword search over structured data (2011) (0)
B No Lockout \an Ecient Protocol for Checkpointing Recovery in Distibuted Systems", to Appear in Ieee Trans. Parallel and Appendix (proofs) (1993) (0)
Session details: Distributed data management (2007) (0)
Realizing parallelism in oltp workloads (2013) (0)
Cost estimation techniques for database systems (2002) (0)
Paradise and Direct Broadcast Satellite: A Solution to Battlefield Data Dissemination for the 21st Century (2001) (0)
IS1 Collected Scientific Publications: 2004 - 2007 References (2007) (0)
Caching for web-based database applications (2002) (0)
Chapter 8: Interactive Analytics (2016) (0)
On interpreting and debugging results of database queries over imprecise data (2008) (0)

This paper list is powered by the following services:

Other Resources About Jeffrey Naughton

en.wikipedia.org

What Schools Are Affiliated With Jeffrey Naughton?

Jeffrey Naughton is affiliated with the following schools: