Michael O'Boyle
#43,355
Most Influential Person Now
British academic
Michael O'Boyle's AcademicInfluence.com Rankings
Michael O'Boylecomputer-science Degrees
Computer Science
#1881
World Rank
#1955
Historical Rank
Database
#7693
World Rank
#7993
Historical Rank
Download Badge
Computer Science
Why Is Michael O'Boyle Influential?
(Suggest an Edit or Addition)According to Wikipedia, Michael O'Boyle is a professor of Computing and Director of the Institute for Computing Systems Architecture at the University of Edinburgh School of Informatics. Education O'Boyle received a Master of Science degree in computer science from the University of Manchester in 1990. He completed his PhD at the University of Manchester in 1992 under the supervision of John Gurd.
Michael O'Boyle's Published Works
Published Works
- Using machine learning to focus iterative optimization (2006) (455)
- Rapidly Selecting Good Compiler Optimizations using Performance Counters (2007) (260)
- Combined Selection of Tile Sizes and Unroll Factors Using Iterative Compilation (2000) (248)
- Towards a holistic approach to auto-parallelization: integrating profile-driven parallelism detection and machine-learning based mapping (2009) (247)
- Milepost GCC: Machine Learning Enabled Self-tuning Compiler (2011) (240)
- A Static Task Partitioning Approach for Heterogeneous Systems Using OpenCL (2011) (223)
- Mapping parallelism to multi-cores: a machine learning based approach (2009) (216)
- Automatic Feature Generation for Machine Learning Based Optimizing Compilation (2009) (152)
- Portable mapping of data parallel programs to OpenCL for heterogeneous systems (2013) (152)
- Machine Learning in Compiler Optimization (2018) (140)
- Introducing SLAMBench, a performance and accuracy benchmarking methodology for SLAM (2014) (138)
- MILEPOST GCC: machine learning based research compiler (2008) (133)
- Smart multi-task scheduling for OpenCL programs on CPU/GPU heterogeneous platforms (2014) (128)
- Partitioning streaming parallelism for multi-cores: A machine learning based approach (2010) (119)
- Portable compiler optimisation across embedded programs and microarchitectures using machine learning (2009) (117)
- Evaluating Iterative Compilation (2002) (116)
- Method-specific dynamic compilation using logistic regression (2006) (115)
- Automatic performance model construction for the fast software exploration of new hardware designs (2006) (101)
- IATAC: a smart predictor to turn-off L2 cache lines (2005) (98)
- Fast compiler optimisation evaluation using code-feature based performance prediction (2007) (94)
- Microarchitectural Design Space Exploration Using an Architecture-Centric Approach (2007) (93)
- A Predictive Model for Dynamic Microarchitectural Adaptivity Control (2010) (92)
- Non-singular data transformations: definition, validity and applications (1997) (86)
- A Feasibility Study in Iterative Compilation (1999) (86)
- A Practical Method for Quickly Evaluating Program Optimizations (2005) (83)
- Bayesian Meta-Learning for the Few-Shot Setting via Deep Kernels (2020) (81)
- Iterative Compilation in Program Optimization (2000) (81)
- A large-scale cross-architecture evaluation of thread-coarsening (2013) (81)
- Probabilistic source-level optimisation of embedded programs (2005) (79)
- OpenCL Task Partitioning in the Presence of GPU Contention (2013) (73)
- Integrating profile-driven parallelism detection and machine-learning-based mapping (2014) (72)
- Automatic Tuning of Inlining Heuristics (2005) (71)
- Automatic optimization of thread-coarsening for graphics processors (2014) (69)
- The effect of cache models on iterative compilation for combined tiling and unrolling (2004) (68)
- Compiler Reduction of Invalidation Traffic in Virtual Shared Memory Systems (1996) (67)
- MiDataSets: Creating the Conditions for a More Realistic Evaluation of Iterative Optimization (2007) (64)
- Smart, adaptive mapping of parallelism in the presence of external workload (2013) (63)
- Integrating loop and data transformations for global optimisation (1998) (62)
- High-Performance Embedded Architecture and Compilation Roadmap (2007) (60)
- Automatic and Portable Mapping of Data Parallel Programs to OpenCL for GPU-Based Heterogeneous Systems (2014) (58)
- Iterative Compilation (2002) (55)
- A Data Partitioning Algorithm for Distributed Memory Compilation (1994) (53)
- Array recovery and high-level transformations for DSP applications (2003) (52)
- A workload-aware mapping approach for data-parallel programs (2011) (51)
- OCEANS: Optimizing Compilers for Embedded Applications (1998) (51)
- SLAMBench2: Multi-Objective Head-to-Head Benchmarking for Visual SLAM (2018) (50)
- A Compiler Strategy for Shared Virtual Memories (1995) (49)
- Adaptive java optimisation using instance-based learning (2004) (46)
- Integrating algorithmic parameters into benchmarking and design space exploration in 3D scene understanding (2016) (43)
- Integrating Loop and Data Transformations for Global Optimization (2002) (42)
- ProGraML: A Graph-based Program Representation for Data Flow Analysis and Compiler Optimizations (2021) (41)
- Using machine learning to partition streaming programs (2013) (41)
- Synchronization Minimization in a SPMD Execution Model (1995) (36)
- Evaluating the Effects of Compiler Optimisations on AVF (2008) (35)
- Hybrid Optimizations: Which Optimization Algorithm to Use? (2006) (35)
- Characterising Across-Stack Optimisations for Deep Convolutional Neural Networks (2018) (35)
- Embedded Processor Design Challenges (2002) (35)
- Merge or Separate?: Multi-job Scheduling for OpenCL Kernels on CPU/GPU Platforms (2017) (34)
- Celebrating diversity: a mixture of experts approach for runtime mapping in dynamic environments (2015) (34)
- Compiler directed early register release (2005) (33)
- Navigating the Landscape for Real-Time Localization and Mapping for Robotics and Virtual and Augmented Reality (2018) (32)
- Exploring and predicting the architecture/optimising compiler co-design space (2008) (31)
- High Performance Embedded Architectures and Compilers (2008) (31)
- Portable and transparent software managed scheduling on accelerators for fair resource sharing (2016) (30)
- Reducing Training Time in a One-Shot Machine Learning-Based Compiler (2009) (30)
- Compiler Transformation of Pointers to Explicit Array Accesses in DSP Applications (2001) (29)
- Data alignment: transformations to reduce communication on distributed memory architectures (1992) (29)
- SLAMBench 3.0: Systematic Automated Reproducible Evaluation of SLAM Systems for Robot Vision Challenges and Scene Understanding (2019) (29)
- Performance Aware Convolutional Neural Network Channel Pruning for Embedded GPUs (2019) (28)
- Pruning neural networks: is it time to nip it in the bud? (2018) (27)
- Compiler reduction of synchronisation in shared virtual memory systems (1995) (27)
- Raced profiles: efficient selection of competing compiler optimizations (2009) (26)
- Partitioning data-parallel programs for heterogeneous MPSoCs: time and energy design space exploration (2014) (26)
- Energy-efficient register caching with compiler assistance (2009) (26)
- Prodigy: Improving the Memory Latency of Data-Indirect Irregular Workloads Using Hardware-Software Co-Design (2021) (25)
- Software directed issue queue power reduction (2005) (25)
- Automatic Matching of Legacy Code to Heterogeneous APIs: An Idiomatic Approach (2018) (24)
- Nonsingular Data Transformations: Definition, Validity, and Applications (1999) (23)
- A Closer Look at Structured Pruning for Neural Network Compression (2018) (23)
- Program and data transformations for efficient execution on distributed memory architectures (1993) (21)
- Quick and Practical Run-Time Evaluation of Multiple Program Optimizations (2007) (21)
- Feedback Assisted Iterative Compilation (2000) (21)
- Compile Time Barrier Synchronization Minimization (2002) (20)
- The effect of cache models on iterative compilation for combined tiling and unrolling: Research Articles (2004) (20)
- Exploitation of GPUs for the Parallelisation of Probably Parallel Legacy Code (2014) (19)
- Optimizing Grouped Convolutions on Edge Devices (2020) (18)
- Accelerating Deep Neural Networks on Low Power Heterogeneous Architectures (2018) (17)
- A fast and accurate method for determining a lower bound on execution time (2004) (16)
- MaxPair: Enhance OpenCL Concurrent Kernel Execution by Weighted Maximum Matching (2018) (16)
- A graph based approach to barrier synchronisation minimisation (1997) (16)
- Discovery and exploitation of general reductions: A constraint based approach (2017) (16)
- Transmuter: Bridging the Efficiency Gap using Memory and Dataflow Reconfiguration (2020) (15)
- Portable and Transparent Host-Device Communication Optimization for GPGPU Environments (2014) (15)
- MARS: A Distributed Memory Approach to Shared Memory Compilation (1998) (15)
- DATE '08 Proceedings of the conference on Design, automation and test in Europe (2008) (14)
- Expert Programmer versus Parallelizing Compiler: A Comparative Study of Two Approaches for Distributed Shared Memory (1996) (14)
- Instruction Cache Energy Saving Through Compiler Way-Placement (2008) (14)
- A complete compiler approach to auto-parallelizing C programs for multi-DSP systems (2005) (14)
- Load Balancing of Parallel Affine Loops by Unimodular Transformations (1992) (13)
- Compiler parallelization of C programs for multi-core DSPs with multiple address spaces (2003) (13)
- Exploiting GPU Hardware Saturation for Fast Compiler Optimization (2014) (12)
- Efficient parallelisation using combined loop and data transformations (1999) (12)
- Automatic generation of specialized direct convolutions for mobile GPUs (2020) (12)
- Exploring the limits of early register release: Exploiting compiler analysis (2009) (12)
- ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI '14, Edinburgh, United Kingdom - June 09 - 11, 2014 (2014) (12)
- M3: Semantic API Migrations (2020) (12)
- CAnDL: a domain specific language for compiler analysis (2018) (11)
- Proceedings of the GCC Developers' Summit (2008) (11)
- A fast and accurate method for determining a lower bound on execution time: Research Articles (2004) (11)
- Predictive search distributions (2006) (11)
- Proceedings of the 1998 Workshop on Profile and Feedback Directed Compilation (PFDC'98) (1998) (10)
- A compiler algorithm to reduce invalidation latency in virtual shared memory systems (1996) (10)
- CoSPARSE: A Software and Hardware Reconfigurable SpMV Framework for Graph Analytics (2021) (9)
- Euro-Par 2003 Parallel Processing (2003) (9)
- Four Metrics to Evaluate Heterogeneous Multicores (2015) (9)
- SparseAdapt: Runtime Control for Sparse Linear Algebra on a Reconfigurable Accelerator (2021) (8)
- Optimising Convolutional Neural Networks Inference on Low-Powered GPUs (2019) (8)
- A New Program Transformation to Minimise Communication in Distributed Memory Architecture (1992) (8)
- Automatic Parameter Tuning of Motion Planning Algorithms (2018) (8)
- An empirical evaluation of high level transformations for embedded processors (2001) (8)
- Cross Component Optimisation in a High Level Category-Based Language (2004) (8)
- Automatically harnessing sparse acceleration (2020) (8)
- Diplomat: Mapping of Multi-kernel Applications Using a Static Dataflow Abstraction (2016) (7)
- Combining program recovery, auto-parallelisation and locality analysis for C programs on multi-processor embedded systems (2003) (7)
- A transformational approach to compiling Sisal for distributed memory architectures (1992) (7)
- Type-Directed Program Synthesis and Constraint Generation for Library Portability (2019) (7)
- Weak heterogeneity as a way of adapting multicores to real workloads (2013) (7)
- PALMOS: A Transparent, Multi-tasking Acceleration Layer for Parallel Heterogeneous Systems (2015) (7)
- Towards general and exact distributed invalidation (2003) (7)
- Distilling with Performance Enhanced Students (2018) (6)
- Neural architecture search as program transformation exploration (2021) (6)
- Measuring flexibility in single-ISA heterogeneous processors (2014) (6)
- Exploring and Predicting the Effects of Microarchitectural Parameters and Compiler Optimizations on Performance and Energy (2012) (6)
- BlockSwap: Fisher-guided Block Substitution for Network Compression (2019) (6)
- An Empirical Architecture-Centric Approach to Microarchitectural Design Space Exploration (2011) (6)
- A compiler framework for automatically mapping data parallel programs to heterogeneous MPSoCs (2014) (6)
- Selecting Heterogeneous Cores for Diversity (2016) (5)
- TASO: Time and Space Optimization for Memory-Constrained DNN Inference (2020) (5)
- Change Detection Based Parallelism Mapping: Exploiting Offline Models and Online Adaptation (2014) (5)
- Modeling black-box components with probabilistic synthesis (2020) (5)
- Compiling multimedia applications on VLIW architecture (1997) (5)
- Full-System Simulation of Mobile CPU/GPU Platforms (2019) (4)
- Deep Data Flow Analysis (2020) (4)
- Loop Rolling for Code Size Reduction (2022) (4)
- A hierarchical locality algorithm for NUMA compilation (1995) (4)
- Incorporating Cache Models in Iterative Compilation for Combined Tiling and Unrolling (2000) (4)
- Algorithmic Performance-Accuracy Trade-off in 3D Vision Applications (2018) (4)
- The MHAOTEU Toolset for Memory Hierarchy Management (2000) (3)
- Efficient Parallelization Using Combined Loop and Data Transformations (1999) (3)
- Bind the gap: compiling real software to hardware FFT accelerators (2022) (3)
- Rapid early-stage microarchitecture design using predictive models (2009) (3)
- Program Lifting using Gray-Box Behavior (2021) (3)
- Proceedings of the 35th ACM SIGPLAN Conference on Programming Language Design and Implementation (2014) (3)
- Practical loop generation (1996) (3)
- A Cross-platform Evaluation of Graphics Shader Compiler Optimization (2018) (3)
- DelayRepay: delayed execution for kernel fusion in Python (2020) (3)
- HAKD: Hardware Aware Knowledge Distillation (2018) (3)
- A Modular Iterative Solver Package in a Categorical Language (2005) (3)
- Iterative Collective Loop Fusion (2006) (3)
- Machine Learning in Compilers (2018) (2)
- Learning C to x86 Translation: An Experiment in Neural Compilation (2021) (2)
- Topic Introduction (2003) (2)
- Augmenting Type Signatures for Program Synthesis (2019) (2)
- Fast and Accurate Evaluation of Memory Performance Upper-Bound (2001) (2)
- Specialization Opportunities in Graphical Workloads (2019) (2)
- F3M: Fast Focused Function Merging (2022) (2)
- Barrier Synchronisation Optimisation (1997) (2)
- Compiler Directed Issue Queue Energy Reduction (2011) (1)
- HETSIM: Simulating Large-Scale Heterogeneous Systems using a Trace-driven, Synchronization and Dependency-Aware Framework (2020) (1)
- CGO 2017 Proceedings of the 2017 International Symposium on Code Generation and Optimization (2017) (1)
- ExeBench: an ML-scale dataset of executable C functions (2022) (1)
- POSTER: Space and Time Optimal DNN Primitive Selection with Integer Linear Programming (2019) (1)
- Towards Automatic Parallelisation for Multi-Processor DSPs (2001) (1)
- Automatic Matching of Legacy Code to Heterogeneous APIs (2018) (1)
- Welcome from the PLDI chairs (2014) (1)
- Exact Distributed Invalidation (2000) (1)
- Retrofitting Symbolic Holes to LLVM IR (2020) (1)
- First Fast Sink: A compiler algorithm for barrier placement optimisation (1998) (1)
- Towards a Parallelising COBOL Compiler Rizos Sakellariou (1996) (0)
- Explorer Four Metrics to Evaluate Heterogeneous Multicores (2017) (0)
- Languages and Compilers for Parallel Computing (1991) (0)
- Interaction with Programers (2001) (0)
- Investigating magic numbers: improving the inlining heuristic in the Glasgow Haskell Compiler (2022) (0)
- Automatic Tuning of Inlining Heuristics for Java JIT Compilation (0)
- ExploringandPredictingthe Architecture/OptimisingCompilerCo-DesignSpace (2008) (0)
- Rovder, S., Cano, J. and O’Boyle, M. (2019) Optimising Convolutional Neural Networks Inference on Low-Powered GPUs. Twelfth International Workshop on Programmability and Architectures for Heterogeneous Multicores (MULTIPROG- (2019) (0)
- M 3 (2020) (0)
- Explorer Selecting Heterogeneous Cores for Diversity (2017) (0)
- HYDA: A HYbrid Dependence Analysis for the adaptive optimisation of OpenCL kernels (2014) (0)
- The MHAOTEU Toolset (2000) (0)
- Topic 4 - Compilers for High Performance (2000) (0)
- Synergy between Compiler Optimizations and Partitioning on the Cell processor (2010) (0)
- Automatic Array Access Recovery in Pointer based DSP Codes. (2000) (0)
- Proceedings of the 22nd International Conference on Parallel Architectures and Compilation Techniques, Edinburgh, United Kingdom, September 7-11, 2013 (2013) (0)
- Matching Linear Algebra and Tensor Code to Specialized Hardware Accelerators (2023) (0)
- Program Graphs for Machine Learning (2020) (0)
- Topic 4 High-Performance Architectures and Compilers (2007) (0)
- HyBF: A Hybrid Branch Fusion Strategy for Code Size Reduction (2023) (0)
- Obituary: Peter Knijnenburg (1961-2007) (2009) (0)
- New Regular Expressions on Old Accelerators (2021) (0)
- Advanced Performance Analysis (2001) (0)
- Introduction to Part 2 (2007) (0)
- Explorer Diversity : A Design Goal for Heterogeneous Processors (2017) (0)
- Euro-Par '99 Parallel Processing, 5th International Euro-Par Conference, Toulouse, France, August 31 - September 3, 1999, Proceedings (1999) (0)
- General chairs' welcome message (2013) (0)
- Barrier Synchronisation (0)
- Type-Directed Program Synthesis and Constraint Generation for Accelerator Library Portability (2019) (0)
- Accelerating Deep Convolutional Neural Networks on Low Power Embedded Devices (2018) (0)
- Edinburgh Research Explorer Navigating the Landscape for Real-time Localisation and Mapping for Robotics, Virtual and Augmented Reality (2018) (0)
- Evaluating Grouped Spatial Pack Convolutions on Edge CPUs (2020) (0)
- Legacy Systems - The Future of HPC (1999) (0)
- Compiler Construction : 21th international conference, CC 2012, held as part of the European Joint Conferences on Theory and Practice of Software, ETAPS 2012, Tallinn, Esstonia, March 24-April 1, 2012 : proceedings (2012) (0)
- Process for Optimizing an Application (2000) (0)
- Proceedings of the 2007 Workshop on the Interaction between Compilers and Computer Architecture (INTERACT'07) (2007) (0)
- Proceedings of the 21st international conference on Compiler Construction (2012) (0)
- Transactions on HiPEAC (2007) (0)
- Edinburgh Research Explorer MILEPOST GCC: machine learning based research compiler (2017) (0)
- Loop versus data scheduling: models, language and application for SVM (1995) (0)
- Obituary: Peter Knijnenburg (1961–2007) (2009) (0)
- Proceedings of the 2000 Workshop on Media Processors and DSPs (MP-DSP'00) (2000) (0)
- Pre-and Post Selection of Compiler Optimizations by Program Execution (2007) (0)
- Designing Efficient Processors Using Compiler-Directed Optimisations (2006) (0)
- Towards a Parallelising COBOL Compiler (2009) (0)
- BLOCKSWAP: FISHER-GUIDED BLOCK SUBSTITUTION (2019) (0)
- Ensemble of Mapping Techniques for Improved Efficiency (2015) (0)
- Workshop on Software & Compilers for Embedded Systems (SCOPES 2001) (2001) (0)
- Data Prefetching and Targeted Loop Optimizations (1999) (0)
- REPORT M 3 . D 3 INTERACTION WITH PROGRAMMERS (2001) (0)
- Rethinking Compilation in a Heterogeneous World (Keynote) (2019) (0)
This paper list is powered by the following services:
Other Resources About Michael O'Boyle
What Schools Are Affiliated With Michael O'Boyle?
Michael O'Boyle is affiliated with the following schools: