Rajeev Rastogi
#149,713
Most Influential Person Now
Rajeev Rastogi's AcademicInfluence.com Rankings
Rajeev Rastogicomputer-science Degrees
Computer Science
#7784
World Rank
#8190
Historical Rank
Data Mining
#189
World Rank
#190
Historical Rank
Machine Learning
#3000
World Rank
#3037
Historical Rank
Database
#4835
World Rank
#5023
Historical Rank

Download Badge
Computer Science
Rajeev Rastogi's Degrees
- PhD Computer Science University of Texas at Austin
- Masters Computer Science University of Texas at Austin
Similar Degrees You Can Earn
Why Is Rajeev Rastogi Influential?
(Suggest an Edit or Addition)Rajeev Rastogi's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- CURE: an efficient clustering algorithm for large databases (1998) (3304)
- Efficient algorithms for mining outliers from large data sets (2000) (2176)
- ROCK: a robust clustering algorithm for categorical attributes (1999) (2078)
- SPIRIT: Sequential Pattern Mining with Regular Expression Constraints (1999) (612)
- on Knowledge and Data Engineering, (1990) (527)
- Approximate query processing using wavelets (2001) (526)
- A cost-based model and effective heuristic for repairing constraints by value modification (2005) (447)
- Efficient filtering of XML documents with XPath expressions (2002) (428)
- WALRUS: a similarity retrieval algorithm for image databases (1999) (367)
- Processing complex aggregate queries over data streams (2002) (354)
- Graph summarization with bounded error (2008) (347)
- Provisioning a virtual private network: a network design problem for multicommodity flow (2001) (302)
- XTRACT: a system for extracting document type descriptors from XML documents (2000) (268)
- PUBLIC: A Decision Tree Classifier that Integrates Building and Pruning (1998) (242)
- Holistic aggregates in a networked world: distributed tracking of approximate quantiles (2005) (218)
- Topology discovery in heterogeneous IP networks (2000) (183)
- Robust Monitoring of Link Delays and Faults in IP Networks (2003) (183)
- Mining optimized association rules with categorical and numeric attributes (1998) (167)
- Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (2016) (166)
- Independence is good: dependency-based histogram synopses for high-dimensional data (2001) (154)
- Disk striping in video server environments (1996) (152)
- Topology discovery in heterogeneous IP networks: the NetInventory system (2004) (150)
- Mining Sequential Patterns with Regular Expression Constraints (2002) (142)
- Data Stream Management: Processing High-Speed Data Streams (Data-Centric Systems and Applications) (2019) (141)
- A Low-Cost Storage Server for Movie on Demand Databases (1994) (131)
- Efficiently monitoring bandwidth and latency in IP networks (2001) (127)
- Data mining and the Web: past, present and future (1999) (125)
- Recommendations to boost content spread in social networks (2012) (123)
- Algorithms for provisioning virtual private networks in the hose model (2001) (121)
- Algorithms for provisioning virtual private networks in the hose model (2002) (113)
- Update propagation protocols for replicated databates (1999) (111)
- Physical topology discovery for large multisubnet networks (2003) (104)
- Entity disambiguation with hierarchical topic models (2011) (103)
- The Fellini Multimedia Storage Server (1996) (99)
- Dalí: A High Performance Main Memory Storage Manager (1994) (99)
- Mining (Social) Network Graphs to Detect Random Link Attacks (2008) (98)
- ConTracts - A Low-Level Mechanism for Building General-Purpose Workflow Management-Systems. (1995) (98)
- Processing set expressions over continuous update streams (2003) (94)
- Main-memory index structures with fixed-size partial keys (2001) (93)
- Buffer replacement algorithms for multimedia storage systems (1996) (92)
- Web-scale information extraction with vertex (2011) (89)
- SPARTAN: a model-based semantic compression system for massive data tables (2001) (87)
- The concurrency control problem in multidatabases: characteristics and solutions (1992) (86)
- Optimal configuration for BGP route selection (2003) (83)
- XTRACT: Learning Document Type Descriptors from XML Document Collections (2004) (80)
- DataBlitz: A High Performance Main-Memory Storage Manager (1994) (78)
- Distributed Set Expression Cardinality Estimation (2004) (76)
- Tree Pattern Aggregation for Scalable XML Data Dissemination (2002) (76)
- A transaction model for multidatabase systems (1992) (75)
- Efficient gossip-based aggregate computation (2006) (73)
- Non-serializable executions in heterogeneous distributed database systems (1991) (73)
- Traveling with a Pez dispenser (or, routing issues in MPLS) (2001) (66)
- Mining optimized support rules for numeric attributes (1999) (65)
- Optimal configuration of OSPF aggregates (2002) (64)
- DTD-Directed Publishing with Attribute Translation Grammars (2002) (62)
- Algorithms for computing QoS paths with restoration (2005) (62)
- Efficient Detection of Distributed Constraint Violations (2007) (61)
- Capturing both types and constraints in data integration (2003) (61)
- Update Propagation Protocols For Replicated Databases (1999) (59)
- Restoration algorithms for virtual private networks in the hose model (2002) (57)
- Building Decision Trees with Constraints (2003) (55)
- Streaming Algorithms for Robust, Real-Time Detection of DDoS Attacks (2007) (55)
- Query translation from XPath to SQL in the presence of recursive DTDs (2009) (55)
- Efficient Constraint Monitoring Using Adaptive Thresholds (2008) (54)
- A framework for the storage and retrieval of continuous media data (1995) (54)
- Mining optimized gain rules for numeric attributes (1999) (54)
- Sketch-Based Multi-Query Processing over Data Streams (2004) (49)
- The Architecture of the Dalí Main-Memory Storage Manager (1997) (49)
- RE-tree: an efficient index structure for regular expressions (2003) (47)
- Tracking set-expression cardinalities over continuous update streams (2004) (45)
- Processing Data-Stream Join Aggregates Using Skimmed Sketches (2004) (44)
- A New Channel Assignment Mechanism for Rural Wireless Mesh Networks (2008) (44)
- Data Stream Management (2016) (42)
- Efficient algorithms for constructing decision trees with constraints (2000) (42)
- Logical and Physical Versioning in Main Memory Databases (1997) (42)
- Ensuring transaction atomicity in multidatabase systems (1992) (41)
- Scalable regular expression matching on data streams (2008) (39)
- Demand paging for video-on-demand servers (1995) (38)
- Scalable Content-Based Routing in Pub/Sub Systems (2009) (38)
- Joint Routing and Scheduling in Multi-hop Wireless Networks with Directional Antennas (2010) (37)
- Exploring the trade-off between label size and stack depth in MPLS routing (2003) (37)
- Exploiting content redundancy for web information extraction (2010) (36)
- Routing and Channel Allocation in Rural Wireless Mesh Networks (2007) (36)
- Fault-tolerant architectures for continuous media servers (1996) (35)
- Robust Monitoring of Link Delays and Faults (2006) (35)
- Algorithms for computing QoS paths with restoration (2003) (33)
- LogUCB: an explore-exploit algorithm for comments recommendation (2012) (33)
- Diagnosing Link-Level Anomalies Using Passive Probes (2007) (31)
- Matching product titles using web-based enrichment (2012) (30)
- Distributed Multi-Level Recovery in Main-Memory Databases (1996) (30)
- Minimum Cost Topology Construction for Rural Wireless Mesh Networks (2008) (29)
- Detecting Anomalies Using End-to-End Path Measurements (2008) (28)
- Web information extraction using markov logic networks (2011) (28)
- The fellini multimedia storage system (1996) (27)
- Workshop report: 2000 ACM SIGMOD workshop on research issues in data mining and knowledge discovery (2000) (27)
- Recommending Product Sizes to Customers (2017) (27)
- Bayesian Models for Product Size Recommendations (2018) (25)
- Robust monitoring of link delays and faults in IP networks (2006) (24)
- On correctness of non-serializable executions (1993) (24)
- Optimal schemes for robust web extraction (2011) (24)
- Data Mining Meets Network Management: The NEMESIS Project (2001) (22)
- Chapter 23 – RE-Tree: An Efficient Index Structure for Regular Expressions (2002) (22)
- Relaxing serializability in multidatabase systems (1992) (22)
- VillageNet: A low-cost, 802.11-based mesh network for rural regions (2007) (22)
- Scalable Filtering of XML Data for Web Services (2003) (21)
- Join-distinct aggregate estimation over update streams (2005) (20)
- A Disk-Based Storage Architecture for Movie on Demand Servers (1995) (19)
- Client-based logging for high performance distributed architectures (1996) (19)
- Exploiting content redundancy for web information extraction (2010) (19)
- Multimedia support for databases (1997) (18)
- Proceedings of the 22nd ACM international conference on Information & Knowledge Management (2013) (18)
- Detection and Recovery Techniques for Database Corruption (2003) (17)
- Buffer Replacement Algorithms for Multimedia Databases (1996) (17)
- On the design of a low-cost video-on-demand storage system (1996) (17)
- Proceedings of the 29th ACM International Conference on Information & Knowledge Management (2013) (16)
- Strict histories in object-based database systems (1993) (15)
- Scheduling and data replication to improve tape jukebox performance (1999) (15)
- Memory-constrained aggregate computation over data streams (2011) (14)
- On the storage and retrieval of continuous media data (1994) (14)
- Scalable data mining with model constraints (2000) (14)
- Research issues in multimedia storage servers (1995) (14)
- DTD Inference from XML Documents: The XTRACT Approach (2003) (14)
- A Clustering Algorithm for Categorical Attributes (1997) (13)
- Improving Predictability of Transaction Execution Times in Real-time Databases (2000) (13)
- VillageNet: A low-cost, IEEE 802.11-based mesh network for connecting rural areas (2007) (13)
- Ensuring consistency in multidatabases by preserving two-level serializability (1998) (13)
- Using semantic knowledge of distributed objects to increase reliability and availability (2001) (12)
- On-line reorganization in object databases (2000) (12)
- Machine Learning in the Real World (2016) (12)
- Scalable algorithms for mining large databases (1999) (12)
- Exploiting transaction semantics in multidatabase systems (1995) (11)
- Oss architecture and requirements for VoIP networks (2005) (10)
- Maintaining Database Consistency in Heterogeneous Distributed DatabaseSystems (1991) (10)
- CRISP: A Probabilistic Model for Individual-Level COVID-19 Infection Risk Estimation Based on Contact Data (2020) (10)
- Accelerating Lookups in P2P Systems using Peer Caching (2008) (10)
- Web information extraction using Markov logic networks (2011) (10)
- Multi-query optimization for sketch-based estimation (2009) (10)
- Monitoring infrastructure for converged networks and services (2007) (9)
- Fine-granularity Locking and Client-Based Logging for Distributed Architectures (1996) (9)
- The architecture of the Dalí main memory storage manager (1997) (9)
- Design of active and passive probes for VoIP service quality monitoring (2006) (9)
- Semi-supervised correction of biased comment ratings (2012) (9)
- Of crawlers, portals, mice, and men: is there more to mining the Web? (1999) (8)
- Using codewords to protect database data from a class of software errors (1999) (8)
- Overcoming Heterogeneity and Autonomy in Multidatabase Systems (2001) (8)
- Data Stream Management: A Brave New World (2016) (8)
- VoIP service quality monitoring using active and passive probes (2006) (8)
- Periodic retrieval of videos from disk arrays (1997) (8)
- Efficient Aggregate Computation over Data Streams (2008) (8)
- Disk striping in video server environments (1995) (7)
- Machine Learning @ Amazon (2017) (7)
- On Correctness of Nonserializable Executions (1998) (6)
- Optimal scheduling for dynamic channel allocation in wireless LANs (2007) (6)
- Guest Editor Introduction: Special Section on Online Analysis and Querying of Continuous Data Streams (2003) (5)
- Strict Histories in Object-Based Database (1992) (5)
- Conclusions and Looking Forward (2016) (5)
- SPARTAN: using constrained models for guaranteed-error semantic compression (2002) (5)
- Demand Paging for Movie-on-demand Servers. in 5 Buuer Management Issues 3 Disk Striping Issues 4 Fault-tolerance Issues Research Issues in Multimedia Storage Servers (1995) (5)
- Transaction Management Issues in a Failure-prone Multidatabase System Environment (1992) (4)
- Transcending the Serializability Requirement (1993) (4)
- Physical and service topology discovery in heterogeneous networks: the NetInventory system (2004) (3)
- Gossip-Based Aggregate Computation with Low Communication Overhead (2006) (3)
- Connecting the next billion web users (2011) (3)
- Network Data Mining and Analysis: The NEMESIS Project (2002) (3)
- Machine Learning @ Amazon (2018) (2)
- THE DATABLITZ MAIN-MEMORY STORAGE MANAGER: ARCHITECTURE, PERFORMANCE, AND EXPERIENCE (1998) (2)
- Techniques for Clustering Massive Data Sets (2003) (2)
- Model-Based Semantic Compression for Network-Data Tables (2001) (2)
- On Configuring BGP Route Reflectors (2007) (2)
- Ensuring integrity of network inventory and configuration data (2004) (2)
- Proceedings of the 20th International Middleware Conference Industrial Track (2010) (2)
- Architecture issues in multimedia storage systems (1997) (1)
- Probabilistic matrix factorization system based on personas (2015) (1)
- Indexed Regular Expression Matching (2016) (1)
- Analysis and Querying of Continuous Data Streams (2003) (1)
- DataBlitz Storage Manager: Main Memory Database Performance for Critical Applications (1999) (1)
- Machine Learning @ Amazon (2015) (1)
- Efficient Global Transaction Management in Multidatabase Systems (1993) (1)
- Network Data Mining and Analysis: The \( \mathcal{N}\mathcal{E}\mathcal{M}\mathcal{E}\mathcal{S}\mathcal{I}\mathcal{S} \) Project (2002) (1)
- MobiCom'18 Panel: Hammer & Nail vis-a-vis AI / ML Applications to Networked Systems (2018) (1)
- A Scalable Algorithm for Higher-order Features Generation using MinHash (2018) (1)
- Efficient Design of End-to-End Probes for Source-Routed Networks (2007) (1)
- Granularity of Locks and Degr of Consistency in a Shared Data Base. in Ifip Working Conference on Modeling of D Base Management Systems, Pages 1{29, 1975. [gm83] H. Garcia-molina. Using Semantic Knowledge for Transaction Processing in a Distribu Database (2010) (0)
- Video service provision method and video server (1996) (0)
- VLDB 2002, Proceedings of 28th International Conference on Very Large Data Bases, August 20-23, 2002, Hong Kong, China (2002) (0)
- Traveling with a Pez (cid:3) Dispenser (Or, Routing Issues in MPLS) (2008) (0)
- Augmenting handset capacity through virtual storage (2007) (0)
- Proceedings, Fourth IEEE International Conference on Data Mining, ICDM 2004, 1-4 November 2004, Brighton, United Kingdom (2004) (0)
- Internet Research: What's hot in Search, Advertizing, and Cloud Computing. (2008) (0)
- Welcome Message from the Conference Chairs (2005) (0)
- Tion of Buuer Management Strategies for Rela- Tional Database Systems. in Proceedings of The (1996) (0)
- 7 Related Work 8 Concluding Remarks 6 Reducing Response Time (1994) (0)
- Peer Caching for Faster Lookups in P 2 P Systems (2006) (0)
- R Regular Expression Indexing Regular Expression Indexing (0)
- Of Crawlers, Portals, Mice and Men: Is there more to Mining the Web? (Panel) (1999) (0)
- Optimal Scheduling forDynamicChannel Allocation inWireless LANs (2007) (0)
- Proceedings of the 31st International Conference on Very Large Data Bases, Trondheim, Norway, August 30 - September 2, 2005 (2005) (0)
- Session details: Research Session 13: Graphs II (2008) (0)
- Enhancing Pre-existing Data Managers with Atomicity and Durability (1994) (0)
- Building Knowledge Bases from the Web (2012) (0)
- Overcoming Heterogeneity and Autonomy in Multidatabase Systems1 (2022) (0)
- Can Be Computed Recursively Using the Definitions of (0)
- Globalization: challenges to database community (2006) (0)
- Reminiscences on influential papers (2001) (0)
- Regular Expression Indexing (2008) (0)
- A Greedy Scheme for Designing Delay Monitoring Systems of IP Networks (2008) (0)
- High-Precision Web Extraction Using Site Knowledge (2010) (0)
- Industrial Track Panel Globalization: Challenges to Database Community (2006) (0)
This paper list is powered by the following services: