Ihab Ilyas
#68,474
Most Influential Person Now
Canadian-Egyptian computer scientist
Ihab Ilyas's AcademicInfluence.com Rankings
Ihab Ilyascomputer-science Degrees
Computer Science
#2556
World Rank
#2669
Historical Rank
Big Data
#29
World Rank
#29
Historical Rank
Information Technology
#81
World Rank
#82
Historical Rank
Database
#5891
World Rank
#6110
Historical Rank
Download Badge
Computer Science
Ihab Ilyas's Degrees
- PhD Computer Science Purdue University
- Masters Computer Science Purdue University
- Bachelors Computer Science Cairo University
Similar Degrees You Can Earn
Why Is Ihab Ilyas Influential?
(Suggest an Edit or Addition)According to Wikipedia, Ihab Francis Ilyas is a computer scientist who works in data science. He is currently a professor of computer science in the David R. Cheriton School of Computer Science at the University of Waterloo. He also leads the Knowledge Platform team at Apple Inc. Ihab is the holder of the Thomson Reuters-NSERC Industrial Research Chair in Data Cleaning at the University of Waterloo.
Ihab Ilyas's Published Works
Published Works
- A survey of top-k query processing techniques in relational database systems (2008) (918)
- Top-k Query Processing in Uncertain Databases (2007) (471)
- HoloClean: Holistic Data Repairs with Probabilistic Inference (2017) (343)
- CORDS: automatic discovery of correlations and soft functional dependencies (2004) (338)
- RankSQL: query algebra and optimization for relational top-k queries (2005) (314)
- NADEEF: a commodity data cleaning system (2013) (290)
- Holistic data cleaning: Putting violations into context (2013) (289)
- Data Cleaning: Overview and Emerging Challenges (2016) (256)
- KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing (2015) (254)
- Data Curation at Scale: The Data Tamer System (2013) (216)
- Guided data repair (2011) (213)
- Discovering Denial Constraints (2013) (192)
- Detecting Data Errors: Where are we and what needs to be done? (2016) (177)
- Efficient search for the top-k probable nearest neighbors in uncertain databases (2008) (160)
- Supporting top-kjoin queries in relational databases (2004) (157)
- BigDansing: A System for Big Data Cleansing (2015) (148)
- Rank-aware query optimization (2004) (138)
- Nile: a query processing engine for data streams (2004) (138)
- Trends in Cleaning Relational Data: Consistency and Deduplication (2015) (137)
- The Data Civilizer System (2017) (130)
- Sampling the repairs of functional dependency violations under hard constraints (2010) (119)
- Joining Ranked Inputs in Practice (2002) (105)
- Ranking with Uncertain Scores (2009) (91)
- Probabilistic top-k and ranking-aggregate queries (2008) (88)
- Data Integration: The Current Status and the Way Forward (2018) (86)
- Seeping Semantics: Linking Datasets Using Word Embeddings for Data Discovery (2018) (82)
- Supporting ranking queries on uncertain and incomplete data (2010) (80)
- On the relative trust between inconsistent data and inaccurate constraints (2012) (79)
- Supporting ad-hoc ranking aggregates (2006) (77)
- HoloDetect: Few-Shot Learning for Error Detection (2019) (77)
- XSEED: Accurate and Fast Cardinality Estimation for XPath Queries (2006) (76)
- Creating Competitive Products (2009) (75)
- Expressive and flexible access to web-extracted data: a keyword-based structured query language (2010) (73)
- SP-GiST: An Extensible Database Index for Supporting Space Partitioning Trees (2001) (67)
- CLAMS: Bringing Quality to Data Lakes (2016) (66)
- Distributed Data Deduplication (2016) (66)
- Interpreting keyword queries over web knowledge bases (2012) (65)
- Adaptive rank-aware query optimization in relational databases (2006) (63)
- Ranking with uncertain scoring functions: semantics and sensitivity measures (2011) (62)
- Descriptive and prescriptive data cleaning (2014) (58)
- Top-k Nearest Neighbor Search In Uncertain Data Series (2014) (57)
- DataXFormer: A robust transformation discovery system (2016) (56)
- FIX: feature-based indexing technique for XML documents (2006) (51)
- Learning to identify relevant studies for systematic reviews using random forest and external information (2016) (49)
- Modeling and Querying Possible Repairs in Duplicate Detection (2009) (46)
- Benchmarking Smart Meter Data Analytics (2015) (44)
- Sampling from repairs of conditional functional dependency violations (2014) (43)
- NADEEF: A Generalized Data Cleaning System (2013) (37)
- Smart Meter Data Analytics (2016) (35)
- A Formal Framework For Probabilistic Unclean Databases (2018) (33)
- KATARA: Reliable Data Cleaning with Knowledge Bases and Crowdsourcing (2015) (33)
- RankSQL: Supporting Ranking Queries in Relational Database Management Systems (2005) (33)
- A Study of Ontology-based Query Expansion (2011) (32)
- Data Quality: The Role of Empiricism (2018) (31)
- A Demo of the Data Civilizer System (2017) (31)
- SMAS: A smart meter data analytics system (2015) (31)
- Attention-based Learning for Missing Data Imputation in HoloClean (2020) (29)
- Qualitative Data Cleaning (2016) (29)
- APEx: Accuracy-Aware Differentially Private Data Exploration (2017) (29)
- RuleMiner: Data quality rules discovery (2014) (28)
- Estimating compilation time of a query optimizer (2003) (28)
- Video query processing in the VDBMS testbed for video database research (2003) (27)
- NADEEF/ER: generic and interactive entity resolution (2014) (27)
- DataXFormer: An Interactive Data Transformation Tool (2015) (25)
- Discovering and Exploiting Statistical Properties for Query Optimization in Relational Databases: A Survey (2009) (25)
- Farewell Freebase: Migrating the SimpleQuestions Dataset to DBpedia (2018) (25)
- Dataxformer: Leveraging the Web for Semantic Transformations (2015) (24)
- Collecting and Maintaining Just-in-Time Statistics (2007) (23)
- An extensible index for spatial databases (2001) (23)
- URank: formulation and efficient evaluation of top-k queries in uncertain databases (2007) (23)
- Approximate Denial Constraints (2020) (22)
- A Video Database Management System for Advancing Video Database Research (2002) (19)
- Probabilistic Ranking Techniques in Relational Databases (2011) (19)
- StatAdvisor: Recommending Statistical Views (2009) (17)
- Kamino: Constraint-Aware Differentially Private Data Synthesis (2020) (17)
- Unsupervised String Transformation Learning for Entity Consolidation (2017) (16)
- Effective Data Cleaning with Continuous Evaluation (2016) (16)
- CORDS: Automatic Generation of Correlation Statistics in DB2 (2004) (14)
- Distributed Implementations of Dependency Discovery Algorithms (2019) (14)
- Matching Entities Across Different Knowledge Graphs with Graph Embeddings (2019) (13)
- A distributed database server for continuous media (2002) (13)
- Secure Multi-Party Functional Dependency Discovery (2019) (12)
- ProbClean: A probabilistic duplicate detection system (2010) (12)
- Distributed Discovery of Functional Dependencies (2019) (12)
- Editorial: Special Issue on Web Data Quality (2016) (12)
- InterJoin: Exploiting Indexes and Materialized Views in XPath Evaluation (2006) (11)
- VDBMS: A testbed facility for research in video database benchmarking (2004) (10)
- Dark Data: Are we solving the right problems? (2016) (10)
- Building Data Civilizer Pipelines with an Advanced Workflow Engine (2018) (10)
- Just-in-time information extraction using extraction views (2012) (9)
- Building ranked mashups of unstructured sources with uncertain information (2010) (9)
- Rank-aware query processsing and optimization (2005) (9)
- Rank-Join Algorithms for Search Computing (2009) (9)
- Scalable Knowledge Graph Construction from Text Collections (2019) (8)
- Entity Consolidation: The Golden Record Problem (2017) (8)
- MashRank: Towards uncertainty-aware and rank-aware mashups (2010) (8)
- LONLIES: Estimating Property Values for Long Tail Entities (2016) (8)
- Semi-supervised clustering for de-duplication (2018) (8)
- Automatic relationship discovery in self-managing database systems (2004) (8)
- Supporting Top-k Join Queries in Relational Databases (2003) (7)
- Ember: No-Code Context Enrichment via Similarity-Based Keyless Joins (2021) (7)
- Principles of Progress Indicators for Database Repairing (2019) (7)
- Finding Skyline and Top-k Bargaining Solutions (2007) (7)
- Properties of Inconsistency Measures for Databases (2019) (6)
- Saga: A Platform for Continuous Construction and Serving of Knowledge at Scale (2022) (6)
- Approximate Inference in Structured Instances with Noisy Categorical Observations (2019) (6)
- PSALM: Cardinality Estimation in the Presence of Fine-Grained Access Controls (2009) (6)
- Distributed Dependency Discovery (2019) (5)
- LOT : A Robust Overlay for Distributed Range Query Processing (2006) (5)
- Private Exploration Primitives for Data Cleaning (2017) (5)
- HoloDetect (2019) (5)
- QUICK: Expressive and Flexible Search over Knowledge Bases and Text Collections (2010) (5)
- A Semi-Supervised Framework of Clustering Selection for De-Duplication (2019) (4)
- Record fusion: A learning approach (2020) (4)
- Machine Learning and Data Cleaning: Which Serves the Other? (2022) (4)
- ExplIQuE: Interactive Databases Exploration with SQL (2019) (4)
- Efficient Processing of Ad-Hoc Top-k Aggregate Queries in OLAP (2005) (3)
- APEx (2019) (3)
- Real-Time LSM-Trees for HTAP Workloads (2021) (3)
- A framework for supporting the class of space partitioning trees (2001) (2)
- JTop Algorithms for Top-k Join Queries (2008) (2)
- QUICK : Queries Using Inferred Concepts from Keywords Technical Report CS-2009-18 (2009) (2)
- PCOR: Private Contextual Outlier Release via Differentially Private Search (2021) (2)
- Data unification at scale: data tamer (2018) (2)
- We are drowning in a sea of least publishable units (LPUs) (2013) (2)
- Modeling Uncertainty in Duplicate Elimination (2008) (1)
- Guest editorial: special issue on ranking in databases (2009) (1)
- Report on the First International Workshop on Ranking in Databases (DBRank'07) (2007) (1)
- Machine learning and probabilistic data cleaning (2019) (1)
- Data quality rule definition and discovery (2019) (1)
- Skyline and Top-k Processing in Web Bargaining (2006) (1)
- The data analytics group at the qatar computing research institute (2013) (1)
- Rule-based data cleaning (2019) (1)
- Technical Report: Optimizing Human Involvement for Entity Matching and Consolidation (2019) (1)
- Uncertainty in Rank Join (2010) (0)
- Working Group : Lineage / Provenance (2008) (0)
- Top-k Queries (2018) (0)
- Learning to identify relevant studies for systematic reviews using random forest and external information (2015) (0)
- Ember (2021) (0)
- Building Scalable Machine Learning Solutions for Data Cleaning (2019) (0)
- Probabilistic Web Data Management (2013) (0)
- InterJoin : Exploiting Materialized Views in XML Query Processing (0)
- Introduction (2019) (0)
- Working Group : Classification , Representation and Modeling (2009) (0)
- Conclusion and future thoughts (2019) (0)
- SIGMOD officers, committees, and awardees (2018) (0)
- Session details: Research session 21: entity matching (2014) (0)
- Knowledge Graph Imputation (2021) (0)
- 08421 Working Group: Lineage/Provenance (2008) (0)
- EMMA - Workshop Chairs (2005) (0)
- Data deduplication (2019) (0)
- An Efficient Duplication Record Detection Algorithm for Data Cleansing (2018) (0)
- Reminiscences on influential papers (2004) (0)
- Preface (2019) (0)
- High-Throughput Vector Similarity Search in Knowledge Graphs (2023) (0)
- Solution Integration Forms Wrappers Form Retrieval Wrapper Generator Index Corpus Table Query Transformer Query : ( Input values X , Examples E ) Web Forms Subsystem Web Tables Subsystem The Web Table Retrieval Augment Form Query Transformer E valuation Refinement (2015) (0)
- Sampling from repairs of conditional functional dependency violations (2013) (0)
- SIGMOD Executive Committee : (2020) (0)
- Provenance in Collaborative in Silico Scientific Research : a Survey (2020) (0)
- Welcome to the December 2018 issue of the ACM SIGMOD Record ! (2019) (0)
- Data transformation (2019) (0)
- Outlier detection (2019) (0)
- ExplIQuE (2019) (0)
- Message from the DBRANK'08 program co-chairs (2008) (0)
- Batchwise Probabilistic Incremental Data Cleaning (2020) (0)
- Editorial (2020) (0)
- Probabilistic Web Data Management (2013) (0)
- Rank-Aware Query Processing (2018) (0)
- RELATED WORK 2 . 1 Smart Meter Data Analytics (2015) (0)
- Working Group Report: Lineage/Provenance (2008) (0)
- PSALM : Accurate Sampling for Cardinality Estimation in a Multi-user Environment (2007) (0)
- 08421 Working Group: Classification, Representation and Modeling (2009) (0)
- Trends in Rank Join (2010) (0)
- L G ] 1 0 O ct 2 01 8 Semi-supervised clustering for deduplication (2018) (0)
- Kamino (2021) (0)
- References (2019) (0)
- Supplementary : Semi-supervised clustering for deduplication (2019) (0)
- On sampling from data with duplicate records (2020) (0)
This paper list is powered by the following services:
Other Resources About Ihab Ilyas
What Schools Are Affiliated With Ihab Ilyas?
Ihab Ilyas is affiliated with the following schools: