Magdalena Bałazińska
#83,897
Most Influential Person Now
Computer scientist
Magdalena Bałazińska's AcademicInfluence.com Rankings
Magdalena Bałazińskacomputer-science Degrees
Computer Science
#3375
World Rank
#3541
Historical Rank
Database
#7003
World Rank
#7247
Historical Rank
Download Badge
Computer Science
Magdalena Bałazińska's Degrees
- PhD Computer Science Stanford University
- Masters Computer Science Stanford University
- Bachelors Computer Science University of Warsaw
Similar Degrees You Can Earn
Why Is Magdalena Bałazińska Influential?
(Suggest an Edit or Addition)According to Wikipedia, Magdalena Bałazińska is a computer scientist whose research concerns databases and data streams. Born in Poland and educated in Algeria, Canada, and the US, she works at the University of Washington, where she directs the Paul G. Allen School of Computer Science & Engineering.
Magdalena Bałazińska's Published Works
Published Works
- The Design of the Borealis Stream Processing Engine (2005) (1582)
- HaLoop: Efficient Iterative Data Processing on Large Clusters (2010) (874)
- Building the Internet of Things Using RFID: The RFID Ecosystem Experience (2009) (680)
- Scalable Distributed Stream Processing (2003) (629)
- Characterizing mobility and network usage in a corporate wireless local-area network (2003) (559)
- SkewTune: mitigating skew in mapreduce applications (2012) (494)
- Fault-tolerance in the borealis distributed stream processing system (2005) (386)
- High-availability algorithms for distributed stream processing (2005) (304)
- INS/Twine: A Scalable Peer-to-Peer Architecture for Intentional Resource Discovery (2002) (298)
- Data Management in the Worldwide Sensor Web (2007) (222)
- Event queries on correlated probabilistic streams (2008) (221)
- Advanced clone-analysis to support object-oriented system refactoring (2000) (212)
- ParaTimer: a progress indicator for MapReduce DAGs (2010) (202)
- A Demonstration of SciDB: A Science-Oriented DBMS (2009) (194)
- SnipSuggest: Context-Aware Autocompletion for SQL (2010) (183)
- The Aurora and Medusa Projects (2003) (175)
- Infranet: Circumventing Web Censorship and Surveillance (2002) (169)
- Contract-Based Load Management in Federated Distributed Systems (2004) (165)
- Skew-resistant parallel processing of feature-extracting scientific user-defined functions (2010) (163)
- Retrospective on Aurora (2004) (162)
- Measuring clone based reengineering opportunities (1999) (151)
- The HaLoop approach to large-scale iterative data analysis (2012) (150)
- Estimating the progress of MapReduce pipelines (2010) (149)
- Query-Based Data Pricing (2015) (142)
- Learning State Representations for Query Optimization with Deep Reinforcement Learning (2018) (136)
- ArrayStore: a storage manager for complex parallel array processing (2011) (136)
- Towards correcting input data errors probabilistically using integrity constraints (2006) (130)
- The Beckman Report on Database Research (2014) (129)
- Data Markets in the Cloud: An Opportunity for the Database Community (2011) (121)
- A Demonstration of the BigDAWG Polystore System (2015) (120)
- From Theory to Practice: Efficient Join Query Evaluation in a Parallel Database System (2015) (111)
- A Study of Skew in MapReduce Applications (2011) (110)
- Partial redesign of Java software systems based on clone analysis (1999) (98)
- Demonstration of the Myria big data management service (2014) (97)
- A Case for A Collaborative Query Management System (2009) (97)
- An analysis of Hadoop usage in scientific workloads (2013) (93)
- Scalable Clustering Algorithm for N-Body Simulations in a Shared-Nothing Cluster (2010) (87)
- Fault-tolerant stream processing using a distributed, replicated file system (2008) (83)
- Toward practical query pricing with QueryMarket (2013) (80)
- Analyzing massive astrophysical datasets: Can Pig/Hadoop or a relational DBMS help? (2009) (77)
- Cascadia: A System for Specifying, Detecting, and Managing RFID Events (2008) (76)
- The Myria Big Data Management and Analytics System and Cloud Services (2017) (76)
- Homeviews: peer-to-peer middleware for personal data sharing applications (2007) (70)
- Building the Internet of Things Using Rfid (2009) (68)
- PerfXplain: Debugging MapReduce Job Performance (2012) (68)
- Astronomy in the Cloud: Using MapReduce for Image Co-Addition (2010) (67)
- Load management and high availability in the Medusa distributed stream processing system (2004) (66)
- Probabilistic Event Extraction from RFID Data (2008) (66)
- Asynchronous and Fault-Tolerant Recursive Datalog Evaluation in Shared-Nothing Engines (2015) (66)
- Support the Data Enthusiast: Challenges for Next-Generation Data-Analysis Systems (2014) (64)
- Access Methods for Markovian Streams (2009) (61)
- Thwarting Web Censorship with Untrusted Messenger Discovery (2003) (60)
- Moirae: History-Enhanced Monitoring (2007) (59)
- Comparative Evaluation of Big-Data Systems on Scientific Image Analytics Workloads (2016) (57)
- Challenges for Pervasive RFID-Based Infrastructures (2007) (56)
- Physical Access Control for Captured RFID Data (2007) (55)
- The Seattle Report on Database Research (2020) (54)
- Managing Skew in Hadoop (2013) (53)
- A latency and fault-tolerance optimizer for online parallel query plans (2011) (52)
- Believe It or Not: Adding Belief Annotations to Databases (2009) (50)
- PipeGen: Data Pipe Generator for Hybrid Analytics (2016) (46)
- QueryMarket Demonstration: Pricing for Online Data Markets (2012) (43)
- Pessimistic Cardinality Estimation: Tighter Upper Bounds for Intermediate Join Cardinalities (2019) (41)
- The Beckman report on database research (2016) (41)
- An Empirical Analysis of Deep Learning for Cardinality Estimation (2019) (40)
- How to Price Shared Optimizations in the Cloud (2012) (38)
- Big data research (2015) (36)
- Hadoop ’ s Adolescence : A Comparative Workload Analysis from Three Research Clusters (2012) (36)
- Longitudinal study of a building-scale RFID ecosystem (2009) (36)
- Hadoop's Adolescence (2013) (35)
- PEEX : Extracting Probabilistic Events from RFID Data (2007) (35)
- Time travel in a scientific array database (2013) (35)
- Changing the Face of Database Cloud Services with Personalized Service Level Agreements (2015) (32)
- SkewTune in Action: Mitigating Skew in MapReduce Applications (2012) (32)
- Data markets in the cloud (2011) (31)
- The Aurora and Borealis Stream Processing Engines (2016) (30)
- Public Data and Visualizations: How are Many Eyes and Tableau Public Used for Collaborative Analytics? (2014) (30)
- Biology and data-intensive scientific discovery in the beginning of the 21st century. (2011) (30)
- Automatic Enforcement of Data Use Policies with DataLawyer (2015) (30)
- Automated detection of glaucoma with interpretable machine learning using clinical data and multi-modal retinal images. (2021) (29)
- Probabilistic RFID Data Management (2007) (28)
- Session-Based Browsing for More Effective Query Reuse (2011) (27)
- A Comparison of Stream-Oriented High-Availability Algorithms (2003) (26)
- PerfEnforce Demonstration: Data Analytics with Performance Guarantees (2016) (26)
- Machine Learning and Databases: The Sound of Things to Come or a Cacophony of Hype? (2015) (25)
- Abstract: Hadoop's Adolescence; A Comparative Workloads Analysis from Three Research Clusters (2012) (25)
- Astronomical Image Processing with Hadoop (2011) (24)
- Efficient iterative processing in the SciDB parallel array engine (2015) (23)
- Query-based data pricing (2012) (23)
- Cuttlefish: A Lightweight Primitive for Adaptive Query Processing (2018) (23)
- A Discussion on Pricing Relational Data (2013) (23)
- Price-Optimal Querying with Data APIs (2016) (22)
- LightDB: A DBMS for Virtual Reality Video (2018) (21)
- CRAWDAD dataset ibm/watson (v.2003-02-19) (2003) (21)
- A vision for personalized service level agreements in the cloud (2013) (21)
- HaLoop (2010) (20)
- Fault-Tolerance and High Availability in Data Stream Management Systems (2009) (20)
- Approximation trade-offs in Markovian stream processing: An empirical study (2010) (19)
- Elastic Memory Management for Cloud Data Analytics (2017) (19)
- Visual Road: A Video Data Management Benchmark (2019) (19)
- Probabilistic Database Summarization for Interactive Data Exploration (2017) (18)
- Federated Database Systems (2009) (18)
- Specification and Verification of Complex Location Events with Panoramic (2010) (17)
- TASM: A Tile-Based Storage Manager for Video Analytics (2020) (16)
- SLAOrchestrator: Reducing the Cost of Performance SLAs for Cloud Data Analytics (2018) (16)
- Clustering Events on Streams Using Complex Context Information (2008) (15)
- Squeezing a Big Orange into Little Boxes: The AscotDB System for Parallel Processing of Data on a Sphere (2013) (15)
- Challenges for Event Queries over Markovian Streams (2008) (14)
- On-Demand View Materialization and Indexing for Network Forensic Analysis (2007) (14)
- Fault-tolerance and load management in a distributed stream processing system (2005) (13)
- Perceptual Compression for Video Storage and Processing Systems (2019) (12)
- Lahar Demonstration: Warehousing Markovian Streams (2009) (11)
- PerfEnforce: A Dynamic Scaling Engine for Analytics with Performance Guarantees (2016) (10)
- VisualWorldDB: A DBMS for the Visual World (2020) (10)
- Toward elastic memory management for cloud data analytics (2016) (10)
- Multilabel multiclass classification of OCT images augmented with age, gender and visual acuity data (2018) (9)
- Designing good algorithms for MapReduce and beyond (2012) (9)
- VisualCloud Demonstration: A DBMS for Virtual Reality (2017) (9)
- Gaussian Mixture Models Use-Case: In-Memory Analysis with Myria (2015) (9)
- Expressing Privacy Policies Using Authorization Views (2007) (9)
- A Demonstration of Iterative Parallel Array Processing in Support of Telescope Image Analysis (2013) (9)
- Automated detection of glaucoma with interpretable machine learning using clinical data and multi-modal retinal images (2020) (9)
- Sample Debiasing in the Themis Open World Database System (2020) (9)
- EntropyDB: a probabilistic approach to approximate query processing (2019) (8)
- 2009 IEEE International Conference on Cluster Computing and Workshops (2009) (7)
- Hybrid merge/overlap execution technique for parallel array processing (2011) (7)
- The Next 5 Years: What Opportunities Should the Database Community Seize to Maximize its Impact? (2020) (7)
- Design Issues for Second Generation Stream Processing Engines (2005) (6)
- Toward Sampling for Deep Learning Model Diagnosis (2020) (6)
- Big-Data Management Use-Case: A Cloud Service for Creating and Analyzing Galactic Merger Trees (2014) (6)
- A demonstration of Cascadia through a digital diary application (2008) (6)
- Poster: Hadoop's Adolescence; A Comparative Workloads Analysis from Three Research Clusters (2012) (6)
- VOCAL: Video Organization and Interactive Compositional AnaLytics (2022) (5)
- Proceedings of the Sixth ACM Symposium on Cloud Computing (2015) (5)
- A Measurement Study of Two Web-based Collaborative Visual Analytics Systems (2012) (5)
- Stop That Query! The Need for Managing Data Use (2013) (5)
- Availability-Consistency Trade-Offs in a Fault-Tolerant Stream Processing System (2004) (5)
- Fuzzy MCDM (2009) (5)
- Proceedings of the 25th International Conference on Scientific and Statistical Database Management (2013) (4)
- Approximation trade-offs in a Markovian stream warehouse: An empirical study (2014) (4)
- Deluceva: Delta-Based Neural Network Inference for Fast Video Analytics (2020) (4)
- A Demonstration of Interactive Analysis of Performance Measurements with Viska (2017) (4)
- False Negative Rate (2009) (3)
- Enabling Computer and Information Science and Engineering Research and Education in the Cloud (2018) (3)
- Mosaic: A Sample-Based Database System for Open World Query Processing (2019) (3)
- Beyond MapReduce: New Requirements for Scalable Data Processing (2012) (3)
- Sensor Data Stream Exploration for Monitoring Applications (2011) (3)
- Fault Tolerance and High Availability in Data Stream Management Systems (2018) (3)
- The power of data use management in action (2013) (3)
- Capability-Based Access Control for Peer-to-Peer Data Sharing (2006) (3)
- SQB : Session-based Query Browsing for More Effective Query Reuse (2011) (3)
- Automated Analysis of Muscle X-ray Diffraction Imaging with MCMC (2015) (3)
- DeepEverest: Accelerating Declarative Top-K Queries for Deep Neural Network Interpretation Technical Report (2021) (2)
- Sampling for Deep Learning Model Diagnosis (Technical Report) (2020) (2)
- Lahar: warehousing markovian streams (2010) (2)
- Affordable Analytics on Expensive Data (2014) (2)
- A study on eye movement strategies for a depth discrimination task in a "pseudo" natural context (1996) (2)
- VSS: A Storage System for Video Analytics [Technical Report] (2021) (2)
- The Medusa Distributed Stream-Processing System (2003) (2)
- Lineage for Markovian stream event queries (2011) (2)
- The case for being lazy: how to leverage lazy evaluation in MapReduce (2011) (2)
- Cascadia (2008) (2)
- Degree Sequence Bound For Join Cardinality Estimation (2022) (2)
- The Seattle report on database research (2022) (2)
- View-Driven Deduplication with Active Learning (2016) (1)
- Education and career paths for data scientists (2013) (1)
- LightDB (2018) (1)
- Fully Temporal Relation (2009) (1)
- Towards Efficient and Precise Queries over Ten Million Asteroid Trajectory Models (2011) (1)
- Fact-Oriented Modeling (2009) (1)
- Winds from seattle (2020) (1)
- PipeGen (2016) (1)
- Support the data enthusiast (2014) (1)
- Big Data Research: Will Industry Solve all the Problems? (2015) (1)
- Specification, Detection, and Notification of RFID Events with Cascadia (2008) (1)
- Leveraging Usage History to Enhance Database Usability (2012) (1)
- Sampling for Deep Learning Model Diagnosis (2020) (1)
- Fault Tolerant Applications (2009) (1)
- Systems aspects of probabilistic data management (2008) (1)
- VOCALExplore: Pay-as-You-Go Video Data Exploration and Model Building (2023) (0)
- Physical Access Control for Captured Rfid Data Privacy and Utility in Pervasive Architectures (0)
- Session details: Systems and prototypes (2008) (0)
- Creating a Desktop Search Application That Utilizes RFID Ecosystem © and Google Desktop © (2008) (0)
- The HaLoop approach to large-scale iterative data analysis (2012) (0)
- WCRE 2000 Most Influential Paper (2010) (0)
- Demo Program Committee (2005) (0)
- ASTROstream: Automated claSsification of Transient astRonomical phenOmena in the streaming mode (2019) (0)
- The Science of Cloud Computing – PI Meeting Application (2011) (0)
- EntropyDB: a probabilistic approach to approximate query processing (2019) (0)
- Message from the RFDM'80 general co-chairs (2008) (0)
- Service Front-End PSLAManager System Model PerfEnforce Query Scheduling Cluster Provisioning Data Ingest (2018) (0)
- Welcome message from the socc chairs (2015) (0)
- Cloud Data Systems: What are the Opportunities for the Database Research Community? (2022) (0)
- 43 Query-Based Data Pricing (2015) (0)
- Keynote: Research with Real Users (2017) (0)
- UW-CSE-1203-02 Query-Based Data Pricing (2012) (0)
- Editorial for S.I.: VLDB 2020 (2022) (0)
- Proceedings of the 5th Workshop on Data Management for Sensor Networks, in conjunction with VLDB, DMSN 2008, Auckland, New Zealand, August 24, 2008 (2008) (0)
- DeepEverest: Accelerating Declarative Top-K Queries for Deep Neural Network Interpretation (2021) (0)
- The database group at the University of Washington (2014) (0)
- Session details: Industrial session 2: exploiting new hardware (2009) (0)
- Congratulations! You Have Become a Senior Researcher. Now What? (2022) (0)
- Run-Length Encoding Markovian Streams (2010) (0)
- Data management tools for scientific analytics. (2011) (0)
- SafeBound: A Practical System for Generating Cardinality Bounds (2022) (0)
- Enabling end-user specification and debugging of complex events for location systems (2010) (0)
- Demonstration of Apperception: A Database Management System for Geospatial Video Data (2021) (0)
- Report on the Fourth International Workshop on Data Management for Sensor Networks (DMSN 2007) (2007) (0)
- Toward A Progress Indicator for Parallel Queries * (2009) (0)
- EQUI-VOCAL: Synthesizing Queries for Compositional Video Events from Limited User Interactions [Technical Report] (2023) (0)
- Front matter (2006) (0)
- Apache Spark (2020) (0)
- Cloud data systems (2022) (0)
- SkewTune in action (2012) (0)
- Toward Supporting the Data Enthusiast : Unlocking the Potential of Data for Analysis (2012) (0)
- Running N-body Use Cases on Myria by (2014) (0)
- RFID Event Specification Using Templates in Scenic and Event Notification (2008) (0)
- Letter from the new SIGMOD officers (2013) (0)
- ParaTimer (2010) (0)
- The DB Community vis-à-vis Environmental, Health, and Societal Grand Challenges: Innovation Engine, Plumber, or Bystander? (2022) (0)
- MaskSearch: Querying Image Masks at Scale (2023) (0)
- Message from the DMSN'08 organizing committee (2008) (0)
- Databases meet the stream processing era (2018) (0)
- A Visual Cloud for Virtual Reality Applications (2017) (0)
- Session details: University of Washington (2010) (0)
This paper list is powered by the following services:
Other Resources About Magdalena Bałazińska
What Schools Are Affiliated With Magdalena Bałazińska?
Magdalena Bałazińska is affiliated with the following schools: