Héctor García-molina
#66,569
Most Influential Person Now
Héctor García-molina's AcademicInfluence.com Rankings
Héctor García-molinacomputer-science Degrees
Computer Science
#2201
World Rank
#2290
Historical Rank
Information Technology
#2
World Rank
#2
Historical Rank
Database
#195
World Rank
#202
Historical Rank

Download Badge
Computer Science
Héctor García-molina's Degrees
- PhD Computer Science Princeton University
- Masters Computer Science Stanford University
- Bachelors Computer Science National University of Rosario
Similar Degrees You Can Earn
Why Is Héctor García-molina Influential?
(Suggest an Edit or Addition)Héctor García-molina's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- The Eigentrust algorithm for reputation management in P2P networks (2003) (3938)
- Similarity flooding: a versatile graph matching algorithm and its application to schema matching (2002) (1666)
- Database Systems: The Complete Book (2001) (1479)
- The TSIMMIS Project: Integration of Heterogeneous Information Sources (1994) (1278)
- Combating Web Spam with TrustRank (2004) (1258)
- Designing a super-peer network (2003) (1042)
- Efficient Crawling Through URL Ordering (1998) (1029)
- Object exchange across heterogeneous information sources (1995) (1010)
- Routing indices for peer-to-peer systems (2002) (920)
- Crawling the Hidden Web (2001) (849)
- Improving search in peer-to-peer networks (2002) (814)
- Scheduling real-time transactions: a performance evaluation (1988) (778)
- The TSIMMIS Approach to Mediation: Data Models and Languages (1997) (750)
- Searching the Web (2001) (715)
- Elections in a Distributed Computing System (1982) (673)
- Web Spam Taxonomy (2005) (662)
- The Evolution of the Web and Implications for an Incremental Crawler (2000) (646)
- Semantic Overlay Networks for P2P Systems (2004) (631)
- How to assign votes in a distributed system (1985) (623)
- Change detection in hierarchically structured information (1996) (614)
- View maintenance in a warehousing environment (1995) (596)
- Copy detection mechanisms for digital documents (1995) (586)
- Sagas (1987) (577)
- Using semantic knowledge for transaction processing in a distributed database (1983) (574)
- Main Memory Database Systems: An Overview (1992) (573)
- Consistency in a partitioned network: a survey (1985) (547)
- Can social bookmarking improve web search? (2008) (544)
- Swoosh: a generic approach to entity resolution (2008) (543)
- The Management of Probabilistic Data (1992) (530)
- Taxonomy of trust: Categorizing P2P reputation systems (2006) (507)
- From User Access Patterns to Dynamic Hypertext Linking (1996) (504)
- Database System Implementation (2000) (464)
- Extracting structured data from Web pages (2003) (446)
- Collaborative Creation of Communal Hierarchical Taxonomies in Social Tagging Systems (2006) (436)
- Synchronizing a database to improve freshness (2000) (434)
- Computing Iceberg Queries Efficiently (1998) (434)
- Overview of multidatabase transaction management (1992) (403)
- Exploiting hierarchical domain structure to compute similarity (2003) (396)
- Estimating frequency of change (2003) (384)
- Extracting Semistructured Information from the Web. (1997) (382)
- Social tag prediction (2008) (378)
- GlOSS: text-source discovery over the Internet (1999) (376)
- SCAM: A Copy Detection Mechanism for Digital Documents (1995) (369)
- Data caching issues in an information retrieval system (1990) (368)
- Meaningful change detection in structured data (1997) (368)
- Parallel crawlers (2002) (368)
- Disk striping (1986) (361)
- EigenRep: Reputation Management in P2P Networks (2003) (356)
- Efficient search in peer to peer networks (2004) (344)
- SIFT - a Tool for Wide-Area Information Dissemination (1995) (338)
- Time as essence for photo browsing through personal digital libraries (2002) (336)
- Comparing Hybrid Peer-to-Peer Systems (2001) (331)
- Two Can Keep A Secret: A Distributed Architecture for Secure Database Services (2005) (329)
- Seeing the whole in parts: text summarization for web browsing on handheld devices (2001) (328)
- Object Fusion in Mediator Systems (1996) (319)
- Generalizing GlOSS to Vector-Space Databases and Broker Hierarchies (1995) (317)
- Power browser: efficient Web browsing for PDAs (2000) (312)
- Publish/Subscribe in a Mobile Environment (2004) (311)
- Fighting Spam on Social Web Sites: A Survey of Approaches and Future Challenges (2007) (291)
- Online Balancing of Range-Partitioned Data with Applications to Peer-to-Peer Systems (2004) (288)
- Effective page refresh policies for Web crawlers (2003) (283)
- The SIFT information dissemination system (1999) (277)
- A measure of transaction processing power (1985) (276)
- PPay: micropayments for peer-to-peer systems (2003) (276)
- Deadline assignment in a distributed soft real-time system (1993) (274)
- The Asilomar report on database research (1998) (272)
- One torus to rule them all: multi-dimensional queries in P2P systems (2004) (272)
- Template-based wrappers in the TSIMMIS system (1997) (272)
- Streaming Live Media over a Peer-to-Peer Network (2001) (270)
- Limited reputation sharing in P2P systems (2004) (268)
- Open Problems in Data-Sharing Peer-to-Peer Systems (2003) (265)
- Clustering the tagged web (2009) (264)
- STARTS: Stanford proposal for Internet meta-searching (1997) (257)
- Integrating and Accessing Heterogeneous Information Sources in TSIMMIS (1994) (251)
- The effectiveness of GIOSS for the text database discovery problem (1994) (251)
- Entity resolution with iterative blocking (2009) (251)
- MedMaker: a mediation system based on declarative specifications (1996) (250)
- CrowdScreen: algorithms for filtering data with humans (2012) (246)
- Proximity Search in Databases (1998) (241)
- The Claremont report on database research (2008) (238)
- A Query Translation Scheme for Rapid Implementation of Wrappers (1995) (236)
- Canon in G major: designing DHTs with hierarchical structure (2004) (235)
- The Lowell database research self-assessment (2003) (234)
- The Stanford Data Warehousing Project (1995) (226)
- WebBase: a repository of Web pages (2000) (225)
- Index structures for selective dissemination of information under the Boolean model (1994) (225)
- Simrank++: query rewriting through link analysis of the click graph (2007) (224)
- Web graph similarity for anomaly detection (2010) (222)
- Scheduling real-time transactions (1988) (221)
- Data Leakage Detection (2011) (220)
- Automatic organization for digital photographs with geographic coordinates (2004) (219)
- Ordered and reliable multicast communication (1991) (218)
- Context data in geo-referenced digital photo collections (2004) (211)
- Read-only transactions in a distributed database (1982) (208)
- Link Spam Alliances (2005) (208)
- Representing Web graphs (2003) (199)
- Exploiting Geographical Location Information of Web Pages (1999) (194)
- The Strobe algorithms for multi-source warehouse consistency (1996) (194)
- Scheduling Real-Time Transactions with Disk Resident Data (1989) (189)
- Building a scalable and accurate copy detection mechanism (1996) (188)
- So who won?: dynamic max discovery with the crowd (2012) (188)
- Applying update streams in a soft real-time database system (1995) (187)
- Incremental updates of inverted lists for text document retrieval (1994) (184)
- Finding replicated Web collections (2000) (178)
- Leveraging context to resolve identity in photo albums (2005) (178)
- Pay-As-You-Go Entity Resolution (2013) (176)
- Link spam detection based on mass estimation (2006) (175)
- Accordion summarization for end-game browsing on PDAs and cellular phones (2001) (174)
- Interoperability for digital libraries worldwide (1998) (171)
- Question Selection for Crowd Entity Resolution (2013) (167)
- Efficient web browsing on handheld devices using page and form summarization (2002) (167)
- FlexRecs: expressing and combining flexible recommendations (2009) (165)
- Similarity Flooding: A Versatile Graph Matching Algorithm (Extended Technical Report) (2001) (162)
- Human-assisted graph search: it's okay to ask questions (2011) (153)
- Semistructured Data: The Tsimmis Experience (1997) (152)
- Scheduling I/O requests with deadlines: A performance evaluation (1990) (152)
- Modeling long-running activities as nested sagas (1991) (151)
- Publish/Subscribe in a mobile enviroment (2001) (146)
- An Overview of Real-Time Database Systems (1995) (145)
- Building a distributed full-text index for the Web (2001) (145)
- Clustering for Approximate Similarity Search in High-Dimensional Spaces (2002) (144)
- Combating spam in tagging systems (2007) (143)
- Deco: declarative crowdsourcing (2012) (140)
- Recommendation systems with complex constraints: A course recommendation perspective (2011) (140)
- Capability based mediation in TSIMMIS (1998) (140)
- SPROUT: P2P Routing with Social Networks (2004) (137)
- Efficient Snapshot Differential Algorithms for Data Warehousing (1996) (134)
- Database systems - the complete book (2. ed.) (2009) (133)
- Finding near-replicas of documents on the Web (1999) (133)
- The Reliability of Voting Mechanisms (1987) (132)
- Correcting for missing data in information cascades (2011) (131)
- Performance of inverted indices in shared-nothing distributed text document information retrieval systems (1993) (121)
- The price of validity in dynamic networks (2004) (121)
- Graph structured views and their incremental maintenance (1998) (120)
- Building a distributed full-text index for the web (2001) (119)
- Max algorithms in crowdsourcing environments (2012) (119)
- YAPPERS: a peer-to-peer lookup service over arbitrary topology (2003) (118)
- Query-flood DoS attacks in gnutella (2002) (117)
- DHT Routing Using Social Links (2004) (116)
- Finding Near-Replicas of Documents and Servers on the Web (1998) (115)
- Questioning Yahoo! Answers (2007) (114)
- Identity crisis: anonymity vs reputation in P2P systems (2003) (113)
- SLIC: a selfish link-based incentive mechanism for unstructured peer-to-peer networks (2004) (112)
- Publish/Subscribe Tree Construction in Wireless Ad-Hoc Networks (2003) (110)
- STARTS: Stanford Protocol Proposal for Internet Retrieval and Search (1997) (109)
- Concurrency Control and Recovery for Global Procedures in Federated Database Systems (1987) (109)
- Multiple view consistency for data warehousing (1997) (107)
- Data clouds: summarizing keyword search results over structured data (2009) (106)
- Boolean Query Mapping Across Heterogeneous Information Sources (1996) (106)
- Evaluating the crowd with confidence (2013) (105)
- Mind your vocabulary: query mapping across heterogeneous information sources (1999) (105)
- Challenges in Data Crowdsourcing (2016) (103)
- Debugging a Distributed Computing System (1984) (102)
- Using Distributed Objects for Digital Library Interoperability (1996) (101)
- A Massive Memory Machine (1984) (100)
- Expiring Data in a Warehouse (1998) (100)
- Information translation, mediation, and mosaic-based browsing in the TSIMMIS system (1995) (99)
- Interoperability, Scaling, and the Digital Libraries Research Agenda. (1996) (98)
- Spam: it's not just for inboxes anymore (2005) (96)
- A System Prototype for Warehouse View Maintenance (1996) (96)
- Index structures for information filtering under the vector space model (1993) (93)
- Making trust explicit in distributed commerce transactions (1996) (92)
- Computing capabilities of mediators (1999) (91)
- Indexing Boolean Expressions (2009) (91)
- From Where to What: Metadata Sharing for Digital Photographs with Geographic Coordinates (2003) (89)
- Deco: A System for Declarative Crowdsourcing (2012) (89)
- Entity resolution with evolving rules (2010) (89)
- Applications of Byzantine agreement in database systems (1986) (88)
- Focused Web searching with PDAs (2000) (88)
- Coordinating multi-transaction activities (1990) (87)
- The Demarcation Protocol: A Technique for Maintaining Linear Arithmetic Constraints in Distributed Database Systems (1992) (87)
- Increasing availability under mutual exclusion constraints with dynamic vote reassignment (1989) (87)
- Performance of update algorithms for replicated data in a distributed database (1979) (86)
- Node Autonomy In Distributed Systems (1988) (86)
- Performance Issues in Incremental Warehouse Maintenance (2000) (85)
- Improving Search in Peer-to-Peer Systems (2001) (85)
- Evaluating entity resolution results (2010) (85)
- Synthesizing view definitions from data (2010) (84)
- Information seeking (2011) (82)
- Incentives for Combatting Freeriding on P2P Networks (2003) (80)
- Message ordering in a multicast environment (1989) (79)
- Checkpointing memory-resident databases (1989) (79)
- Stanford WebBase components and applications (2006) (79)
- The WHIPS prototype for data warehouse creation and maintenance (1997) (79)
- Streaming Live Media over Peers (2002) (78)
- System M: A Transaction Processing Testbed for Memory Resident Data (1990) (76)
- Merging Ranks from Heterogeneous Internet Sources (1997) (76)
- Altruistic locking (1994) (75)
- Adaptive algorithms for set containment joins (2003) (75)
- Distributed Databases (1995) (74)
- Crawler-Friendly Web Servers (2000) (74)
- On the selection of tags for tag clouds (2011) (73)
- Adaptive peer-to-peer topologies (2004) (72)
- Peer-to-peer data trading to preserve information (2002) (72)
- Management of a remote backup copy for disaster recovery (1991) (71)
- Combating spam in tagging systems: An evaluation (2008) (71)
- Effective Memory Use in a Media Server (1997) (71)
- Ad Hoc, self-supervising peer-to-peer search networks (2005) (70)
- Vision Paper: Enabling Privacy for the Paranoids (2004) (68)
- Consistency Algorithms for Multi-Source Warehouse View Maintenance (2004) (68)
- Optimizing Large Join Queries in Mediation Systems (1999) (68)
- Efficient resumption of interrupted warehouse loads (2000) (68)
- Distributing data for secure database services (2011) (67)
- Quasi-Copies: Efficient Data Sharing for Information Retrieval Systems (1988) (67)
- Efficient Web form entry on PDAs (2001) (66)
- Shrinking the warehouse update Window (1999) (65)
- Entity Resolution with crowd errors (2015) (65)
- Predicate rewriting for translating Boolean queries in a heterogeneous information system (1999) (64)
- The vulnerability of vote assignments (1986) (61)
- Emulating soft real-time scheduling using traditional operating system schedulers (1994) (60)
- Crowd-powered find algorithms (2014) (59)
- Turkalytics: analytics for human computation (2011) (59)
- Coordinating activities through extended sagas: a summary (1991) (59)
- Incremental entity resolution on rules and data (2014) (58)
- Parallel and Distributed SystemS (2013) (58)
- Reliability issues for fully replicated distributed databases (1982) (57)
- Database Support for Efficiently Maintaining Derived Data (1996) (57)
- Bidding for storage space in a peer-to-peer data preservation system (2002) (57)
- Subtask deadline assignment for complex distributed soft real-time tasks (1994) (56)
- Query Merging: Improving Query Subscription Processing in a Multicast Environment (2003) (55)
- The Claremont report on database research (2009) (54)
- Archival storage for digital libraries (1998) (54)
- The case for controlled inconsistency in replicated data (1990) (53)
- Generic Entity Resolution with Data Confidences (2006) (53)
- Data-Pach: Integrating Inconsistent Copies of a Database After a Partition (1983) (52)
- Generic entity resolution with negative rules (2009) (52)
- SLiMFast: Guaranteed Results for Data Fusion and Source Reliability (2015) (52)
- Tagging human knowledge (2010) (52)
- Reliable distributed database management (1987) (52)
- Third generation TP monitors: a database challenge (1993) (52)
- Precision and recall of GlOSS estimators for database discovery (1994) (51)
- Towards the web of concepts (2010) (51)
- SeeDB: visualizing database queries efficiently (2013) (51)
- Peer-to-peer research at Stanford (2003) (49)
- D-Swoosh: A Family of Algorithms for Generic, Distributed Entity Resolution (2007) (49)
- Transience of peers & streaming media (2003) (49)
- A Probalilistic Relational Data Model (1990) (49)
- Interactive data exploration with smart drill-down (2014) (49)
- The demarcation protocol: A technique for maintaining constraints in distributed database systems (1994) (49)
- Comprehensive and reliable crowd assessment algorithms (2014) (48)
- Quality control for comparison microtasks (2012) (48)
- Where in the world is my data? (2011) (48)
- Distributed selective dissemination of information (1994) (48)
- Waldo: An Adaptive Human Interface for Crowd Entity Resolution (2017) (48)
- Evaluating GUESS and non-forwarding peer-to-peer search (2004) (47)
- Estimating Aggregates on a Peer-to-Peer Network (2003) (47)
- Using Distributed Objects to Build the Stanford Digital Library Infobus (1999) (47)
- Optimal Crowd-Powered Rating and Filtering Algorithms (2014) (46)
- Joint Entity Resolution (2012) (44)
- BubbleUp: low latency fast-scan for media servers (1997) (44)
- Issues in disaster recovery (1990) (44)
- Integrating Diverse Information Management Systems: A Brief Survey (2001) (43)
- Policies for Dynamic Vote Reassignment (1986) (43)
- Replicated Data Management in Mobile Environments: Anything New Under the Sun? (1994) (43)
- A mediation infrastructure for digital library services (2000) (43)
- Smart Filesystems (1991) (42)
- The Efficacy of GlOSS for the Text Database Discovery Problem (1993) (42)
- Aggressive Transmissions of Short Messages Over Redundant Paths (1994) (41)
- The SCAM Approach to Copy Detection in Digital Libraries (1995) (41)
- Overview of the STanford Real-time Information Processor (STRIP) (1996) (41)
- dSCAM: finding document copies across multiple databases (1996) (41)
- Proceedings of the 1990 ACM SIGMOD International Conference on Management of Data, Atlantic City, NJ, USA, May 23-25, 1990 (1990) (40)
- Report on the workshop on heterogenous database systems held at Northwestern University Evanston, Illinois, December 11-13, 1989 sponsored by NSF (1990) (40)
- Generic Entity Resolution in the SERF Project (2006) (40)
- Altruistic Locking: A Strategy for Coping with Long Lived Transactions (1987) (40)
- Reliable scheduling in a TMR database system (1989) (39)
- A toolkit for constraint management in heterogeneous information systems (1996) (39)
- Addressing the Non-Cooperation Problem in Competitive P2P Systems (2003) (39)
- Performance of Inverted Indices in Distributed Text Document Retrieval Systems (1993) (39)
- Implementing a Reliable Digital Object Archive (2000) (39)
- Privacy, Preservation and Performance: The 3 P's of Distributed Data Management (2008) (38)
- PhotoSpread: A Spreadsheet for Managing Photos (2008) (38)
- A Model for Data Leakage Detection (2009) (38)
- Evaluation of remote backup algorithms for transaction-processing systems (1994) (38)
- Scheduling Soft Real-Time Jobs Over Dual Non-Real-Time Servers (1996) (38)
- Beyond document similarity: understanding value-based search and browsing technologies (2000) (36)
- Evaluating, combining and generalizing recommendations with prerequisites (2010) (35)
- Synthetic workload performance analysis of incremental updates (1994) (35)
- Capability-sensitive query processing on Internet sources (1999) (35)
- Query processing and inverted indices in shared-nothing text document information retrieval systems (1993) (34)
- Protocols for dynamic vote reassignment (1986) (34)
- Exploiting symmetries for low-cost comparison of file copies (1988) (34)
- Complex Queries over Web Repositories (2003) (34)
- Safeguarding and charging for information on the Internet (1998) (34)
- Research directions for distributed databases (1990) (33)
- SIL: Modeling and Measuring Scalable Peer-to-Peer Search Networks (2003) (33)
- Wave-indices: indexing evolving databases (1997) (33)
- Using Ad-hoc Inter-vehicle Networks For Regional Alerts (2004) (33)
- An overview of the deco system: data model and query language; query processing and optimization (2013) (33)
- Caching and database scaling in distributed shared-nothing information retrieval systems (1993) (32)
- Approximate Query Translation Across Heterogeneous Information Sources (2000) (32)
- Achieving high availability in distributed databases (1987) (32)
- Contrasting Controlled Vocabulary and Tagging: Experts Choose the Right Names to Label the Wrong Things (2009) (32)
- Recsplorer: recommendation algorithms based on precedence mining (2010) (31)
- Mutual exclusion in partitioned distributed systems (1986) (30)
- Towards Interoperability in Digital Libraries: Overview and Selected Highlights of the Stanford Digital Library Project (1997) (30)
- Recommendations with prerequisites (2009) (30)
- The STRIP rule system for efficiently maintaining derived data (1997) (29)
- Web Content Categorization Using Link Information (2006) (29)
- Duplicate Removal in Information Dissemination (1998) (29)
- Shopping models: a flexible architecture for information commerce (1997) (29)
- InterPay: Managing Multiple Payment Mechanisms in Digital Libraries (1995) (29)
- CourseRank: a social system for course planning (2009) (29)
- Non-Cooperation in Competitive P2P Networks (2005) (29)
- Human-Powered Top-k Lists (2013) (28)
- Fusion Queries over Internet Databases (1998) (28)
- Peer-to-Peer Resource Trading in a Reliable Distributed System (2002) (28)
- Efficient Query Subscription Processing in a Multicast Environment (2000) (28)
- Approximate query mapping: Accounting for translation closeness (2001) (28)
- Transaction Management in Multidatabase Systems (1995) (27)
- Identifying users in social networks with limited information (2015) (27)
- Performance Comparison of Two Update Algorithms for Distributed Databases (1978) (27)
- Display advertising impact: search lift and social influence (2011) (27)
- Smart Drill-Down: A New Data Exploration Operator (2015) (27)
- Optimizing the Reliability Provided by Voting Mechanisms (1984) (26)
- CrowdDQS: Dynamic Question Selection in Crowdsourcing Systems (2017) (26)
- P-Swoosh: Parallel Algorithm for Generic Entity Resolution (2006) (26)
- Performance through memory (1987) (25)
- Optimal schemes for robust web extraction (2011) (24)
- Adlib: a self-tuning index for dynamic peer-to-peer systems (2005) (24)
- Reducing Initial Latency in Media Servers (1997) (24)
- Conjunctive constraint mapping for data translation (1998) (24)
- Simrank++: query rewriting through link analysis of the clickgraph (poster) (2008) (24)
- The Future of Data Replication (1986) (23)
- Report on the May 18-19 1995 IITA Digital Libraries Workshop: Final Draft for Participant Review, August 4, 1995 (1997) (23)
- Eecient Snapshot Diierential Algorithms for Data Warehousing (1996) (23)
- Peer-to-Peer Data Management (2002) (23)
- Two Epoch Algorithms for Disaster Recovery (1990) (23)
- Computing Iceberg Queries E ciently (1998) (22)
- CourseRank: A Closed-Community Social System through the Magnifying Glass (2009) (22)
- tDP: An Optimal-Latency Budget Allocation Strategy for Crowdsourced MAXIMUM Operations (2015) (22)
- An implementation of reliable broadcast using an unreliable multicast facility (1988) (22)
- Creating trading networks of digital archives (2001) (21)
- Elections inaDistributed Computing System (1982) (21)
- DataSift: An Expressive and Accurate Crowd-Powered Search Toolkit (2013) (21)
- Distributed and parallel computing issues in data warehousing (abstract) (1998) (20)
- Should Ad Networks Bother Fighting Click Fraud? (Yes, They Should.) (2008) (20)
- File system design using large memories (1990) (20)
- Assigning textual names to sets of geographic coordinates (2006) (20)
- Entity Resolution: Overview and Challenges (2004) (19)
- Exactly-once semantics in a replicated messaging system (2001) (19)
- Divide-and-Conquer Algorithm for Computing Set Containment Joins (2002) (19)
- Indexing in a Hypertext Database (1990) (19)
- Filtering with Approximate Predicates (1998) (19)
- Hybrid Strategies for Finding the Max with the Crowd: Technical Report (2014) (18)
- CourseCloud: summarizing and refining keyword searches over structured data (2009) (18)
- Cost-driven design for archival repositories (2001) (18)
- SWAPEROO: A Simple Wallet Architecture for Payments, Exchanges, Refunds, and Other Operations (1998) (17)
- Modeling Archival Repositories for Digital Libraries (2000) (17)
- An experimental evaluation of crash recovery machanisms (1985) (17)
- U-PAI: A Universal Payment Application Interface (1996) (17)
- Studying Search Networks with SIL (2003) (17)
- Evaluation of ESI and Class-Based Delta Encoding (2003) (17)
- Pong-cache poisoning in GUESS (2004) (16)
- Tagging with Queries: How and Why? (2009) (16)
- The Gold Mailer (1993) (16)
- Flexible Constraint Management for Autonomous Distributed Databases (1994) (16)
- Database systems - the complete book (international edition) (2002) (16)
- Flexible recommendations over rich data (2008) (16)
- The design of a document database (2000) (16)
- Attribute-based Crowd Entity Resolution (2016) (16)
- Peer-to-peer data preservation through storage auctions (2005) (16)
- Reliable broadcast in networks with nonprogrammable servers (1988) (15)
- Performance Issues in Distributed Shared-Nothing Information-Retrieval Systems (1996) (15)
- Overview of disaster recovery for transaction processing systems (1990) (15)
- Parameterized subscriptions in publish/subscribe systems (2007) (15)
- Evaluating the cost of Boolean query mapping (1997) (14)
- Recommendation Systems with Complex Constraints: A CourseRank Perspective (2011) (14)
- Summarization of Web pages on Handheld Devices (2001) (14)
- Global consistency constraints considered harmful for heterogeneous database systems (1991) (14)
- Interactive Data Exploration with Smart Drill-Down (2019) (14)
- Reducing initial latency in a multimedia storage system (1996) (14)
- Constraint Management in Loosely Coupled Distributed Databases (1993) (14)
- Web graph similarity for anomaly detection (poster) (2008) (14)
- DataSift: a crowd-powered search toolkit (2014) (13)
- Coping with Limited Capabilities of Source (1999) (13)
- Update propagation in Bakunin data networks (1987) (13)
- Aggressive transmissions over redundant paths (1991) (13)
- Semantic Overlay Networks (2003) (13)
- Pair-Wise entity resolution: overview and challenges (2006) (13)
- Social Systems: Can We Do More Than Just Poke Friends? (2009) (13)
- Joint entity resolution on multiple datasets (2013) (13)
- Distributed and Parallel Computing Issues in Data Warehousing (Invited Talk) (1998) (13)
- Evaluation of remote backup algorithms for transaction processing systems (1992) (13)
- Database Processing with Triple Modular Redundancy (1986) (13)
- Flexible Recommendations for Course Planning (2009) (12)
- Efficient Dissemination of Information on the Internet (1996) (12)
- Object Fusion in Mediator Systems (Extended Version) (1995) (12)
- Enabling Privacy for the Paranoids (2004) (12)
- Proceedings of the first international conference on Parallel and distributed information systems (1991) (12)
- Awareness Services for Digital Libraries (1997) (12)
- Optimizing Shadow Recovery Algorithms (1988) (12)
- How Expensive is Data Replication? An Example (1982) (12)
- Requirements Specification for a Temporal Extension to the Relationsl Model. (1988) (11)
- InfoMonitor: unobtrusively archiving a World Wide Web server (2005) (11)
- The Performance of a Concurrency Control Mechanism that Exploits Semantic Knowledge (1985) (11)
- Clindex: Clustering for Similarity Queries in High-Dimensional Spaces. (1999) (11)
- A Model for Quantifying Information Leakage (2012) (11)
- Services for a Workflow Management System (1994) (11)
- Building the InfoBus: A Review of Technical Choices in the Stanford Digital Library Project (2000) (11)
- Replicated condition monitoring (2001) (11)
- Automatically generating metadata for digital photographs with geographic coordinates (2004) (11)
- SIL: A model for analyzing scalable peer-to-peer search networks (2006) (10)
- MEDIC: a memory and disk cache for multimedia clients (1999) (10)
- Exploiting Correlations for Expensive Predicate Evaluation (2014) (10)
- Compare Me Maybe : Crowd Entity Resolution Interfaces (2012) (10)
- Protecting the PIPE from malicious peers (2002) (10)
- Developments in Generic Entity Resolution (2011) (10)
- A path-based approach for web page retrieval (2012) (10)
- A Dynamic Navigation Guide for Webpages (2009) (10)
- How To Safeguard Your Sensitive Data (2006) (10)
- 2D BubbleUp: Managing Parallel Disks for Media Servers (1998) (10)
- STARTS: Stanford Proposal for Internet Meta-Searching (Experience Paper) (1997) (10)
- The Vulnerability of Voting Mechanisms (1984) (9)
- Dewey Meets Turing: Librarians, Computer Scientists, and the Digital Libraries Initiative (2005) (9)
- Partial lookup services (2003) (9)
- Collaborative Value Filtering on the Web (1998) (9)
- Updating an Existing Social Graph Snapshot via a Limited API (2016) (9)
- Modeling Reputation and Incentives in Online Trade (2004) (9)
- Reprint of: Efficient crawling through URL ordering (2012) (9)
- Disinformation techniques for entity resolution (2013) (9)
- Performance of update algorithms for replicated data (1981) (8)
- The cost of data replication (1981) (8)
- Is byzantine agreement useful in a distributed database? (1984) (8)
- Managing Information Leakage (2011) (8)
- An extensible constructor tool for the rapid, interactive design of query synthesizers (1998) (8)
- Maximizing remote work in flooding-based peer-to-peer systems (2003) (8)
- Evaluating Reputation Systems for Document Authenticity (2003) (8)
- Top-K Entity Resolution with Adaptive Locality-Sensitive Hashing (2019) (8)
- Dynamic Max Algorithms in Crowdsourcing Environments (2012) (8)
- Comparing Hybrid Peer-to-Peer Systems (extended) (2000) (8)
- Recovery in a Triple Modular Redundant Database System (1987) (7)
- Taxonomy of trust : Categorizing P 2 P reputation systems q (2005) (7)
- Extracting structured data from Web pages (Poster) (2003) (7)
- Can Tagging Organize Human Knowledge (2008) (7)
- Performance evaluation of reliable distributed systems (1987) (7)
- Reliability Issues for Fully Replicated Databases. (1982) (7)
- Query and data mapping across heterogeneous information sources (2001) (7)
- Authenticity and availability in PIPE networks (2005) (7)
- Duplicate Removal in Information System Dissemination (1995) (7)
- Multicasting a changing repository (2003) (7)
- Information finding in a digital library: the Stanford perspective (1995) (7)
- Apocrypha: Making P2P Overlays Network-aware (2003) (6)
- Managing the quality of CPC traffic (2009) (6)
- Adlib: A Self-Tuning Index for Dynamic P2P Systems. (2005) (6)
- Issues in Parallel Information Retrieval (1994) (6)
- Self-Maintainability of Graph Structured Views (1999) (6)
- Boolean Query Mapping Across Heterogeneous Information Sources (Extended Version) (1997) (6)
- Examining Metrics for Peer-to-Peer Reputation Systems (2003) (6)
- Cloud databases (2010) (6)
- A Concurrency Control Mechanism for Distributed Databases Which Users Centralized Locking Controllers (1979) (6)
- A Sound and Complete Distributed Algorithm for Distributed Commerce Transactions (1996) (6)
- Evaluating Response Time in a Faulty Distributed Computing System (1985) (6)
- Client Clustering for Hiring Modeling in Work Marketplaces (2015) (6)
- Predictive Pricing and Revenue Sharing (2008) (6)
- A sound and complete algorithm for distributed commerce transactions (1999) (6)
- Competitive sourcing for Internet commerce (1998) (5)
- Secure Score Management in Peer-to-Peer Systems (2004) (5)
- The PhotoSpread Query Language (2007) (5)
- Comparing Very Large Database Snapshots (1995) (5)
- Processing of read-only queries at a remote backup (1994) (5)
- On building distributed soft real-time systems (1995) (5)
- How important is metadata? (2002) (5)
- Configurations: Understanding Alternatives for Safeguarding Data (2005) (5)
- Flexible Recommendations in CourseRank (2008) (5)
- Distributed processing of filtering queries in HyperFile (1991) (5)
- Quantifying agent strategies under reputation (2005) (5)
- Cost-based media server design (1998) (5)
- Evaluation of Delivery Techniques for Dynamic Web Content (2003) (5)
- Caching and Database Scaling in Distributed Shard-Nothing Information Retrieval Systems (1993) (4)
- Distributed Commerce Transactions (1997) (4)
- HyperFile: A data and query model for documents (2005) (4)
- A Generalized Digital Wallet Architecture (2000) (4)
- Multicasting a Web Repository (2001) (4)
- An Expressive Model for Comparing Tree-Structured Data (1997) (4)
- Web information management: past, present and future (2008) (4)
- Sponsored search auctions with conflict constraints (2012) (4)
- Fusion Query Optimization (1996) (4)
- Exploiting Hierarchical Domain Structure to Compute Similarity 1 (4)
- Non-deterministic queue operations (1991) (4)
- Smart Drill Down (2014) (4)
- Divide-and-Conquer Algorithm for Computing Set Containment Joins (Extended Technical Report) (2001) (4)
- Evaluating Entity Resolution Results (Extended version) (2009) (4)
- Distributed and parallel computing issues in data warehousing (abstract) (1998) (3)
- Output URL Bidding (2010) (3)
- Interoperability In Multidatabases: Semantic and System Issues (Panel) (1991) (3)
- Data Management with Massive Memory: A Summary (1991) (3)
- Challenges in Crawling the Web (2003) (3)
- Accounting for Memory Use, Cost, Througput, and Latency in the Design of a Media Server (1998) (3)
- Maintaining Availability of Replicated Data in a Dynamic Failure Environment (1987) (3)
- Efficient Queries in Peer-to-Peer Systems (2005) (3)
- The Stanford Archival Repository Project: Preserving our digital past (2009) (3)
- Finding with the Crowd Anish (2013) (3)
- Reliably networking a multicast repository (2003) (3)
- Social sites research through CourseRank (2010) (3)
- The Claremont Report on Database (3)
- Extreme Temporal Photo Browsing (2002) (3)
- Interoperability for Digital Libraries : Problems and Directions (1998) (3)
- Distributed Commerce Transactions with Timing Deadlines and Direct Trust (1997) (3)
- Output Bidding: A New Search Advertising Model Complementary to Keyword Bidding (2009) (3)
- Adaptive P2P Topologies (2004) (3)
- Query Processing and Inverted Indices in Distributed Text Document Retrieval Systems (1993) (2)
- Proposal for I**3 Client Server Protocol (1998) (2)
- Economic Design of Reputation Systems (2004) (2)
- Webbase: building a web warehouse (2004) (2)
- On managing continuous media data (1999) (2)
- Event Dissemination in High-Mobility Ad-hoc Networks (2005) (2)
- Multicasting a Web Repository [extended version] (2001) (2)
- Publish-Subscribe Event Dissemination in High-Mobility Networks (2005) (2)
- Handling data quality in entity resolution (2005) (2)
- Attribute-based Crowd Entity Resolution: Technical Report (2016) (2)
- Slicing Broadcast Disks [extended version] (2003) (2)
- Beyond Just Data Privacy (2007) (2)
- Pong-Cache Poisoning in GUESS (Extended Technical Report) (2003) (2)
- The Role Of Massive Memory In Knowledge-Base Management Systems (1986) (2)
- Graph Structured Views and Their Incremental Maintenance (Full version) (1997) (2)
- A Case for Locally-Organized Peer-to-Peer Lookup Services (2002) (2)
- Modeling Archival Repositories for Digital Libraries (Extended Version (1999) (2)
- Applications of web link analysis (2008) (2)
- Approximate Query Translation (Extended Version) (1999) (1)
- Interoperability with unstructured data and services (1993) (1)
- Updating an Existing Social Network Snapshot via a Limited API (2016) (1)
- Multicasting a Changing Repository [extended version] (2002) (1)
- HighSim : Highly Effective Similarity Measurement in Large Heterogeneous Information Networks (2016) (1)
- Partial Lookup Services (Extended Version) (2002) (1)
- Aggressive transmissions over redundant paths for time critical messages (1993) (1)
- Design of Efficient Query Interfaces for Web Sources (2000) (1)
- Review - The Notions of Consistency and Predicate Locks in a Database System (1999) (1)
- Configurations: a model for distributed data storage (2007) (1)
- Dealing with web data (2010) (1)
- Comparing Hybrid Peer-to-Peer Systems (25 page) (2001) (1)
- Implementing a Reliable Digital Object Archive (Extended Version) (2000) (1)
- Exploiting Features for Data Source Quality Estimation (2015) (1)
- Additional Experiments on Negative Rules (2008) (1)
- Merging Hierarchies Using Object Placement (2008) (1)
- Navigating the Web with Query Tags (2009) (1)
- Reliably Networking a Multicast Repository [extended version] (2002) (1)
- Using crowdsourcing for data analytics (2013) (1)
- Bufoosh: Buffering Algorithms for Generic Entity Resolution (2006) (1)
- Implementing Long Lived Transactions . Using Log Record Forwarding (2010) (1)
- Managing parallel disks for continuous media data (2000) (1)
- Joint entity resolution on multiple datasets (2013) (0)
- Power Email: Efficient Email Entry on Pen-Based Handheld Devices (2001) (0)
- Where Have You Been? A Comparison of Three Web Tracking Technologies (1999) (0)
- The Des ign of a Document Database (1988) (0)
- Data crowdsourcing: Is it for real? (2015) (0)
- Link Prediction and Hybrid Strategies for Updating a Social Graph Snapshot via a Limited API (2017) (0)
- Mapping Across HeterogeneousInformation Sources ( Extended Version ) (1996) (0)
- Have Developed Strategies for Estimating Selectivity Lor, a Linear-time Probabilistic Counting Algo- Rithm for Database Applications," Acm Trans. on 8 a Strawman" Random Sampling Scheme Rs (0)
- Chair Vice-Chair Secretary/Treasurer (2005) (0)
- Chapter 1 MANAGING PARALLEL DISKS FOR CONTINUOUS MEDIA DATA (0)
- 4 Future Work X Copy of X Copy of X Copy of X Client Server Location Server Replication and Mobility (0)
- Implementing Multicast Data Dissemination (2004) (0)
- Overview of Search Engine Spamming (2006) (0)
- Future Directions in Database Research (Panel) (1998) (0)
- Title, Subject Index 1(1)-2(4), Author Index 1(1)-2(4), Reviewers (1993) (0)
- Table 11: Result for D2 When Maximum Weights Are Estimated Gloss to Vector-space Databases and Broker Hierar- Gloss to Vector-space Databases and Broker Hier (1999) (0)
- LOADING IN INFORMATION WAREHOUSES (2017) (0)
- Assignment-based partitioning in a condition monitoring system (2002) (0)
- Title, Foreword, Calls for Articles (1993) (0)
- - Note that examples are not curriculum, meaning that the exam will not require knowl- edge of any particular example. (On the other hand it may be useful to read some of the examples to help you understand the text.) (2005) (0)
- Query Recommendation in Hidden Web Search Engine using Web Log Mining Techniques (2020) (0)
- Distributed Computing Research at Princeton (1985) (0)
- Are Disk Arrays Useful for Database Systems? (Panel) (1991) (0)
- Mining Web Activity Logs (2009) (0)
- Reconstruction of Objects Using Lineage (2009) (0)
- An Automatic Annotation Technique for Web Search Results (2020) (0)
- Second Workshop on the Management of Replicated Data, November 12-13, 1992, Monterey, California (1992) (0)
- U-PAI : A Universal Payment Application Interface , v 0 . 93 (1996) (0)
- Privacy,PreservationandPerformance: The3P'sofDistributedDataManagement (2008) (0)
- Approximate Query Translation (1999) (0)
- Collusion and Data Privacy (2007) (0)
- Guest editors' introduction (2005) (0)
- OVLDB 181 Overview of Multidatabase Transaction Management (0)
- A Case Study on A Miner Dataset: Identifying leading research through various Models (2019) (0)
- Introduction to PhotoSpread (2008) (0)
- The Efficacy of GlOSS for the Text Database Retrieval Problem (1994) (0)
- Online Information Search from Tamil Document Images in World Wide Web (2015) (0)
- A Novel Disk Scheduling Algorithm in Real-time Database Systems (2016) (0)
- Title, Preface by the Editors-in-Chief, Preface to Special Issue on PDIS (1993) (0)
- Polynets: providing reliable communications for distributed systems (1990) (0)
- Computing Iceberg Queries Eeciently Paper Number 234 (1998) (0)
- The Stanford Archival Vault: A reliable, long-term data archive (1999) (0)
- Project synopsis: evaluating STRIP (1996) (0)
- Implementing Multicast Data Distribution (0)
- Title, Foreword (1980) (0)
- Scientific Journals: Extinction or Explosion? (Panel) (1995) (0)
- Proceedings of the 1990 ACM SIGMOD International Conference on Management of Data, May 23-25, 1990, Atlantic City, NJ (1990) (0)
- What is Workflow and Who needs it? (1993) (0)
- Recent Advances in Multimedia Systems Research (1999) (0)
- Data Leakag e Detection (2011) (0)
- Human Processing ( Position Paper ) (2010) (0)
- Cl Event Dissemination in High-Mobility Ad-hoc Networks (2008) (0)
- 6. References [spec87] A. Z. Spector Et Al, ''camelot: a Distributed Transaction Facility for Mach and the Internet — Figure 7: Server Module 3. Layer 1 of the Model 2.1 Examples for Layer 0 Coordinating Multi-transaction Activities (1990) (0)
- Soft Real-Time Communication Over Dual Non-Real-Time Networks (1993) (0)
- ACHIEVING HIGH A V AILABILITY IN DISTRIBUTED DATABASES (1987) (0)
- Bibliometric Landscape of the ACM Digital Library (2005) (0)
- No . STAN-CS-80-787 April 1980 (1998) (0)
- WebBase and the Stanford InterLib Project (Extended Abstract) (2001) (0)
- WWW and the Internet - Did We Miss the Boat? (Panel) (1998) (0)
- Semantic Information and Web based Product Recommendation System – A Novel Approach (2020) (0)
- Recipe-XML Change Detection Use Case (2019) (0)
- A path-based approach for web page retrieval (2011) (0)
- Evolving Source Interfaces over the WebRamana Yerneni (2007) (0)
- IS1 Collected Scientific Publications: 2004 - 2007 References (2007) (0)
- Incremental entity resolution on rules and data (2013) (0)
- Blasting in Chord (0)
- 4 Extension of the Parallel Merge Protocol to the General Case (0)
- Adaptive Algorithms for Set Containment Joins (Technical Report) (2001) (0)
This paper list is powered by the following services: