Katherine Yelick
#36,923
Most Influential Person Now
American computer scientist and academic
Katherine Yelick's AcademicInfluence.com Rankings
Katherine Yelickcomputer-science Degrees
Computer Science
#1978
World Rank
#2054
Historical Rank
#923
USA Rank
Parallel Computing
#25
World Rank
#25
Historical Rank
#21
USA Rank
Database
#6848
World Rank
#7087
Historical Rank
#834
USA Rank
Download Badge
Computer Science
Katherine Yelick's Degrees
- PhD Electrical Engineering and Computer Science Stanford University
- Masters Electrical Engineering and Computer Science Stanford University
- Bachelors Mathematics University of California, Berkeley
Similar Degrees You Can Earn
Why Is Katherine Yelick Influential?
(Suggest an Edit or Addition)According to Wikipedia, Katherine "Kathy" Anne Yelick, an American computer scientist, is the vice chancellor for research and the Robert S. Pepper Professor of Electrical Engineering and Computer Sciences at the University of California, Berkeley. She is also a faculty scientist at Lawrence Berkeley National Laboratory, where she was Associate Laboratory Director for Computing Sciences from 2010-2019.
Katherine Yelick's Published Works
Published Works
- The Landscape of Parallel Computing Research: A View from Berkeley (2006) (2362)
- Optimization of sparse matrix-vector multiplication on emerging multicore platforms (2007) (826)
- The International Exascale Software Project roadmap (2011) (735)
- A case for intelligent RAM (1997) (658)
- A view of the parallel computing landscape (2009) (653)
- Parallel programming in Split-C (1993) (641)
- Stencil computation optimization and auto-tuning on state-of-the-art multicore architectures (2008) (616)
- Titanium: A High-performance Java Dialect (1998) (559)
- OSKI: A Library of Automatically Tuned Sparse Matrix Kernels (2005) (556)
- Introduction to UPC and Language Specification (2000) (401)
- The potential of the cell processor for scientific computing (2005) (393)
- Sparsity: Optimization Framework for Sparse Matrix Kernels (2004) (361)
- A whole-genome shotgun approach for assembling and anchoring the hexaploid bread wheat genome (2015) (248)
- Optimization and Performance Modeling of Stencil Computations on Modern Microprocessors (2007) (240)
- Self-Adapting Linear Algebra Algorithms and Software (2005) (234)
- Scalable Processors in the Billion-Transistor Era: IRAM (1997) (223)
- UPC: Distributed Shared-Memory Programming (2003) (221)
- UPC Language Specifications V1.1.1 (2003) (216)
- Cluster I/O with River: making the fast case common (1999) (214)
- A Case for Intelligent RAM: IRAM (1997) (207)
- Productivity and performance using partitioned global address space languages (2007) (204)
- UPC++: A PGAS Extension for C++ (2014) (169)
- Optimizing Sparse Matrix Computations for Register Reuse in SPARSITY (2001) (165)
- Performance Optimizations and Bounds for Sparse Matrix-Vector Multiply (2002) (154)
- Implicit and explicit optimizations for stencil computations (2006) (150)
- Avoiding communication in sparse matrix computations (2008) (150)
- Minimizing communication in sparse matrix solvers (2009) (149)
- A performance analysis of the Berkeley UPC compiler (2003) (144)
- Optimizing bandwidth limited problems using one-sided communication and overlap (2005) (139)
- Intelligent RAM (IRAM): chips that remember and compute (1997) (135)
- The Energy Efficiency Of Iram Architectures (1997) (126)
- Scientific Computing Kernels on the Cell Processor (2007) (123)
- Titanium Language Reference Manual (2001) (122)
- Hierarchical Work Stealing on Manycore Clusters (2011) (120)
- SEJITS: Getting Productivity and Performance With Selective Embedded JIT Specialization (2010) (120)
- When cache blocking of sparse matrix vector multiply works and why (2007) (116)
- Lattice Boltzmann simulation optimization on leading multicore platforms (2008) (114)
- Impact of modern memory subsystems on cache optimizations for stencil computations (2005) (113)
- Empirical evaluation of the CRAY-T3D: a compiler perspective (1995) (97)
- Communication optimizations for fine-grained UPC applications (2005) (92)
- An evaluation of current high-performance networks (2003) (87)
- Critical Assessment of Metagenome Interpretation: the second round of challenges (2021) (86)
- Modeling the benefits of mixed data and task parallelism (1995) (84)
- Auto-tuning stencil codes for cache-based multicore platforms (2009) (79)
- Analyses and Optimizations for Shared Address Space Programs (1996) (78)
- HipMer: an extreme-scale de novo genome assembler (2015) (77)
- Parallel De Bruijn Graph Construction and Traversal for De Novo Genome Assembly (2014) (75)
- Performance models for evaluation and automatic tuning of symmetric sparse matrix-vector multiply (2004) (71)
- Optimizing Sparse Matrix Vector Multiplication on SMP (1999) (67)
- Optimizing parallel programs with explicit synchronization (1995) (67)
- Implementing an irregular application on a distributed memory multiprocessor (1993) (65)
- The Parallel Computing Laboratory at U.C. Berkeley: A Research Agenda Based on the Berkeley View (2008) (65)
- Communication lower bounds and optimal algorithms for programs that reference arrays - Part 1 (2013) (62)
- Scaling communication-intensive applications on BlueGene/P using one-sided communication and overlap (2009) (61)
- Intelligent RAM (IRAM): the industrial setting, applications, and architectures (1997) (61)
- Exascale applications: skin in the game (2020) (60)
- Unification in Combinations of Collapse-Free Regular Theories (1987) (59)
- Optimization of a lattice Boltzmann computation on state-of-the-art multicore platforms (2009) (57)
- Connected components on distributed memory machines (1994) (56)
- A Case for Intelligent DRAM: IRAM (1998) (55)
- Deadlock-free scheduling of X10 computations with bounded resources (2007) (54)
- Multi-threading and one-sided communication in parallel LU factorization (2007) (54)
- Languages for High-Productivity Computing: the DARPA HPCS Language Project (2007) (54)
- Reducing Communication in Graph Neural Network Training (2020) (51)
- Titanium Performance and Potential: An NPB Experimental Study (2005) (51)
- Concurrency Analysis for Parallel Programs with Textually Aligned Barriers (2005) (48)
- Avoiding Communication in Computing Krylov Subspaces (2007) (47)
- Making Sequential Consistency Practical in Titanium (2005) (47)
- Memory-intensive benchmarks: IRAM vs. cache-based machines (2002) (47)
- Auto-Tuning the 27-point Stencil for Multicore (2009) (46)
- Introduction to Split-C (1995) (44)
- Hardware/compiler codevelopment for an embedded media processor (2001) (44)
- DARPA's HPCS Program- History, Models, Tools, Languages (2008) (43)
- merAligner: A Fully Parallel Sequence Aligner (2015) (43)
- Communication-Avoiding Parallel Sparse-Dense Matrix-Matrix Multiplication (2016) (42)
- Automatic Performance Tuning and Analysis of Sparse Triangular Solve (2002) (42)
- ROC-1: Hardware Support for Recovery-Oriented Computing (2002) (39)
- UPC: Distributed Shared Memory Programming (Wiley Series on Parallel and Distributed Computing) (2005) (39)
- Autotuning Sparse Matrix-Vector Multiplication for Multicore (2012) (39)
- Hybrid PGAS runtime support for multicore nodes (2010) (38)
- Randomized load balancing for tree-structured computation (1994) (38)
- Communication avoiding and overlapping for numerical linear algebra (2012) (38)
- Parallel Languages and Compilers: Perspective From the Titanium Experience (2007) (37)
- Tuning collective communication for Partitioned Global Address Space programming models (2011) (36)
- Automatic support for irregular computations in a high-level language (2005) (36)
- A Communication-Optimal N-Body Algorithm for Direct Interactions (2013) (35)
- Optimizing Parallel SPMD Programs (1994) (34)
- Type Systems for Distributed Data Sharing (2003) (34)
- LOGAN: High-Performance GPU-Based X-Drop Long-Read Alignment (2020) (30)
- Titanium Language Reference Manual, version 2.19 (2005) (30)
- Optimizing collective communication on multicores (2009) (30)
- Combining Unification Algorithms for Confined Regular Equational Theories (1985) (29)
- Extreme Scale De Novo Metagenome Assembly (2018) (29)
- Memory-efficient optimization of Gyrokinetic particle-to-grid interpolation for multicore processors (2009) (28)
- Evaluating support for global address space languages on the Cray X1 (2004) (28)
- QFAST: Quantum Synthesis Using a Hierarchical Continuous Circuit Space (2020) (28)
- Evaluation of architectural support for global address-based communication in large-scale parallel machines (1996) (27)
- A Parallel Completion Procedure for Term Rewriting Systems (1992) (27)
- Runtime Support for Portable Distributed Data Structures (1995) (27)
- Terabase-scale metagenome coassembly with MetaHipMer (2020) (26)
- Models and Scheduling Algorithms for Mixed Data and Task Parallel Programs (1997) (26)
- Automatic nonblocking communication for partitioned global address space programs (2007) (26)
- Performance Modeling and Analysis of Cache Blocking in Sparse Matrix Vector Multiply (2004) (26)
- Distributed Immersed Boundary Simulation in Titanium (2006) (25)
- SCALABLE PROCESSORS IN THE BILLION-TRANSISTOR THE BILLION-TRANSISTOR ERA :IRAM (1997) (25)
- QFAST: Conflating Search and Numerical Optimization for Scalable Quantum Circuit Synthesis (2021) (24)
- Parallel timing simulation on a distributed memory multiprocessor (1993) (24)
- Porting GASNet to Portals: Partitioned Global Address Space (PGAS) Language Support for the Cray XT (2009) (22)
- Programming models for irregular applications (1993) (21)
- Multipol: A Distributed Data Structure Library (1995) (21)
- A preliminary evaluation of the hardware acceleration of the cray gemini interconnect for PGAS languages and comparison with MPI (2011) (21)
- PERI - auto-tuning memory-intensive kernels for multicore (2008) (21)
- The Optimized Sparse Kernel Interface (OSKI) Library User's Guide for Version 1.0.1h (2007) (20)
- Titanium Language Reference Manual (Version 2.20) (2006) (20)
- Hierarchical Computation in the SPMD Programming Model (2013) (19)
- [Personal health]. (1969) (19)
- [Personal health]. (1969) (19)
- An adaptive mesh refinement benchmark for modern parallel programming languages (2007) (18)
- A proposal for a UPC memory consistency model, v1.0 (2004) (18)
- Multithreading for synchronization tolerance in matrix factorization (2007) (18)
- Accelerating Applications at Scale Using One-Sided Communication (2012) (17)
- Performance Optimizations and Bounds for Sparse Symmetric Matrix-Multiple Vector Multiply (1985) (17)
- BELLA: Berkeley Efficient Long-Read to Long-Read Aligner and Overlapper (2018) (17)
- A preliminary evaluation of the hardware acceleration of the Cray Gemini interconnect for PGAS languages and comparison with MPI (2012) (17)
- Memory Hierarchy Optimizations and Performance ounds for Sparse A (2003) (17)
- An Evaluation of One-Sided and Two-Sided Communication Paradigms on Relaxed-Ordering Interconnect (2014) (17)
- Memory Hierarchy Optimizations and Performance Bounds for Sparse A T Ax (2003) (16)
- Polynomial-Time Algorithms for Enforcing Sequential Consistency in SPMD Programs with Arrays (2003) (16)
- Distributed data structures and algorithms for Gröbner basis computation (1994) (16)
- Enforcing Textual Alignment of Collectives Using Dynamic Checks (2009) (16)
- Generating Permutation Instructions from a High-Level Description (2004) (15)
- ADEPT: a domain independent sequence alignment strategy for gpu architectures (2020) (15)
- Hierarchical Pointer Analysis for Distributed Programs (2007) (15)
- Compiling Verilog into timed finite state machines (1995) (15)
- The roofline model: A pedagogical tool for program analysis and optimization (2008) (14)
- A Local-View Array Library for Partitioned Global Address Space C++ Programs (2014) (14)
- Exploiting On-Chip Memory Bandwidth in the VIRAM Compiler (2000) (14)
- Ten ways to waste a parallel computer (2009) (14)
- Performance analysis of an H.263 video encoder for VIRAM (2000) (14)
- Accelerating Science: A Computing Research Agenda (2016) (13)
- Using abstraction in explicitly parallel programs (1992) (12)
- Performance Portable Optimizations for Loops Containing Communication Operations (2007) (12)
- The parallelism motifs of genomic data analysis (2020) (12)
- diBELLA: Distributed Long Read to Long Read Alignment (2019) (12)
- AI for Science (2020) (11)
- On the conditions for efficient interoperability with threads: an experience with PGAS languages using cray communication domains (2014) (11)
- 10 Years Later: Cloud Computing is Closing the Performance Gap (2020) (11)
- On Holder-Brascamp-Lieb inequalities for torsion-free discrete Abelian groups (2015) (11)
- On the Correctness of a Distributed Memory Gröbner basis Algorithm (1993) (11)
- A Computation- and Communication-Optimal Parallel Direct 3-Body Algorithm (2014) (11)
- BCL: A Cross-Platform Distributed Data Structures Library (2019) (11)
- Common runtime support for high-performance parallel languages parallel compiler runtime consortium (1993) (11)
- Parallelizing the Phylogeny Problem (1995) (11)
- Moded type systems for logic programming (1989) (10)
- Communication-Avoiding Optimization Methods for Distributed Massive-Scale Sparse Inverse Covariance Estimation (2017) (10)
- Array prefetching for irregular array accesses in Titanium (2004) (10)
- Portable Runtime Support for Asynchronous Simulation (1995) (10)
- Identifying performance bottlenecks on modern microarchitectures using an adaptable probe (2004) (10)
- PersGNN: Applying Topological Data Analysis and Geometric Deep Learning to Structure-Based Protein Function Prediction (2020) (9)
- Data Structures for Irregular Applications (1993) (9)
- Scheduling dynamic parallelism on accelerators (2009) (9)
- Yada: Straightforward parallel programming (2011) (9)
- Auto-Tuning Stencil Computations on Multicore and Accelerators (2010) (9)
- Portable Parallel Irregular Applications (1995) (8)
- Hierarchical Additions to the SPMD Programming Model (2012) (8)
- Optimization of Parallel Particle-to-Grid Interpolation on Leading Multicore Platforms (2012) (8)
- An Asynchronous Task-based Fan-Both Sparse Cholesky Solver (2016) (8)
- A GENERALIZED APPROACH TO EQUATIONAL UNIFICATION (1985) (8)
- Performance Characterization of De Novo Genome Assembly on Leading Parallel Systems (2017) (8)
- Parallel Programming Languages (1998) (8)
- Performance Modeling and Optimization of a High Energy Colliding Beam Simulation Code (2006) (8)
- Efficient FFTs on IRAM (2000) (7)
- Parallel Hessian Assembly for Seismic Waveform Inversion Using Global Updates (2015) (7)
- Optimizing partitioned global address space programs for cluster architectures (2007) (7)
- Speech Recognition on Vector Architectures Speech Recognition on Vector Architectures Abstract Speech Recognition on Vector Architectures (2007) (6)
- Distributed-memory parallel algorithms for sparse times tall-skinny-dense matrix multiplication (2021) (6)
- Automatic Communication Performance Debugging in PGAS Languages (2007) (6)
- Chapter 9 Communication Avoiding ( CA ) and Other Innovative Algorithms (2013) (6)
- Proceedings of the 12th ACM SIGPLAN symposium on Principles and practice of parallel programming (2007) (5)
- Implementing High-Performance Geometric Multigrid Solver with Naturally Grained Messages (2015) (5)
- BCL: A Cross-Platform Distributed Container Library (2018) (5)
- Appendix B: UPC Collective Operations Specifications, v1.0 (2005) (5)
- Future Directions for Parallel and Distributed Computing: SPX 2019 Workshop Report (2019) (5)
- Improving Memory Subsystem Performance Using ViVA: Virtual Vector Architecture (2009) (5)
- MerBench: PGAS Benchmarks for High Performance Genome Assembly (2017) (5)
- Dense and Sparse Matrix Operations on the Cell Processor (2005) (5)
- Parallel String Graph Construction and Transitive Reduction for De Novo Genome Assembly (2020) (5)
- Accelerating Time-To-Solution for Computational Science and Engineering (2009) (4)
- A Hartree-Fock Application Using UPC++ and the New DArray Library (2016) (4)
- Evaluation of PGAS Communication Paradigms with Geometric Multigrid (2014) (4)
- Science-Driven Computing: NERSC's Plan for 2006-2010 (2005) (4)
- Compiler and Runtime Support for Scaling Adaptive Mesh Refinement Computations in Titanium (2006) (4)
- Extreme-Scale De Novo Genome Assembly (2017) (4)
- CloudBank: Managed Services to Simplify Cloud Access for Computer Science Research and Education (2021) (4)
- Optimized collectives for PGAS languages with one-sided communication (2006) (3)
- A whole-genome shotgun approach for assembling and anchoring the hexaploid bread wheat genome (2015) (3)
- Indigo: A Domain-Specific Language for Fast, Portable Image Reconstruction (2018) (3)
- Intelligent RAM (IRAM) (1997) (3)
- Workshop Report: Petascale Computing in the Geosciences in View of the National Science Foundation's Recent Announcement Entitled: Leadership- Class System Acquisition -creating a Petascale Computing Environment for Science and Engineering, Which Calls for Deployment of a Petascale Computational Fac (3)
- The Parallel Computing Laboratory at U . C . (2008) (3)
- Communication-optimal iterative methods (2009) (3)
- Performance Engineering: Understanding and Improving thePerformance of Large-Scale Codes (2007) (3)
- Programming models for petascale to exascale (2008) (3)
- Special Issue on Automatic Performance Tuning (2004) (3)
- CS 267: Applications of Parallel Computers (2004) (3)
- The Castle Project (2000) (3)
- A Unified Model for Shared-Memory and Message-Passing Systems (1993) (3)
- Beyond UPC (2009) (2)
- Performance Trade-offs in GPU Communication: A Study of Host and Device-initiated Approaches (2020) (2)
- Advanced Cyberinfrastructure for Science, Engineering, and Public Policy (2017) (2)
- Accelerating Large Scale de novo Metagenome Assembly Using GPUs (2021) (2)
- Lawrence Berkeley National Laboratory Recent Work Title Auto-Tuning the 27-point Stencil for Multicore Permalink (2009) (2)
- Using Moded Type Systems to Support Abstraction in Logic Programs (1992) (2)
- An interface for a self-optimizing sparse matrix kernel library (2005) (2)
- Atos: A Task-Parallel GPU Scheduler for Graph Analytics (2022) (2)
- Data Sharing Analysis for Titanium (2001) (2)
- Compiling to avoid communication (2012) (2)
- Performance modeling and composition: a case study in cell simulation (1996) (2)
- Performance Tuning of Matrix Triple Products Based on Matrix Structure (2004) (1)
- Challenges and Strategies for High End Computing (2007) (1)
- DEGAS: Dynamic Global Address Space programming environments (2015) (1)
- Atos: A Task-Parallel GPU Dynamic Scheduling Framework for Dynamic Irregular Computations (2021) (1)
- Extreme-Scale Many-against-Many Protein Similarity Search (2022) (1)
- Performance Advantages of Partitioned Global Address Space Languages (2006) (1)
- Exascale opportunities and challenges (2011) (1)
- Distributed-Memory k-mer Counting on GPUs (2021) (1)
- Distributed-Memory Parallel Contig Generation for De Novo Long-Read Genome Assembly (2022) (1)
- RDMA vs. RPC for Implementing Distributed Data Structures (2019) (1)
- Panel: Programming Language Paradigms: Past, Present, and Future (2007) (1)
- Dynamic Shared Memory Allocation (2005) (1)
- Communication-Avoiding Optimization Methods for Massive-Scale Graphical Model Structure Learning (2017) (1)
- Concurrency Analysis for Parallel Programs With Textual Barriers (2005) (1)
- REPORT OF THE 2014 Programming Models & Environments Summit (2016) (1)
- Performance Analysis of a High Energy Colliding Beam Simulation Code on Four HPC Architectures (2006) (1)
- Programming View and UPC Data Types (2005) (1)
- BCL (2019) (1)
- Software Roadmap to Plug and Play Petaflop/s (2006) (1)
- Opportunities and Challenges for Next Generation Computing (2020) (1)
- Resource-Efficient, Hierarchical Auto-Tuning of a Hybrid Lattice Boltzmann Computation on the Cray XT4 (2009) (1)
- diBELLA (2019) (1)
- Compilation Techniques for Partitioned Global Address Space Languages (2006) (1)
- The Endgame for Moore's Law: Architecture, Algorithm, and Application Challenges. (2015) (1)
- The Road for Recovery: Aligning COVID-19 efforts and building a more resilient future (2020) (1)
- 1 ACCELERATING SCIENCE : A COMPUTING RESEARCH AGENDA (2016) (1)
- Empirical Evaluation of Global Memory Support on the CRAY-T3D and CRAY-T3E (1998) (0)
- Center for Scalable Application Development Software: Final Report (2014) (0)
- Lawrence Berkeley National Laboratory Lawrence Berkeley National Laboratory Title Optimization of Sparse Matrix-Vector Multiplication on Emerging Multicore Platforms Permalink (2008) (0)
- In View of the National Science Foundation's Recent Announcement Entitled: Leadership-class System Acquisition -creating a Petascale Computing Environment for Science and Engineering, Which Calls for Deployment of a Petascale Computational Facility Capable of Sustained Scientific Applications Perfor (0)
- Session details: Invited panel (2005) (0)
- Edison - A New Cray Supercomputer Advances Discovery at NERSC (2014) (0)
- LBNL Computational Research & Theory Facility Groundbreaking - Full Press Conference. Feb 1st, 2012 (2012) (0)
- Pointers and Arrays (2005) (0)
- Technical perspectiveAbstraction for parallelism (2009) (0)
- What is Supercomputing? A Conversation with Kathy Yelick (2012) (0)
- Appendix A: UPC Language Specifications, v1.1.1 (2005) (0)
- Parallel Completion (1990) (0)
- Asynchrony versus bulk-synchrony for a generalized N-body problem from genomics (2021) (0)
- Appendix D: How to Compile and Run UPC Programs (2005) (0)
- Lawrence Berkeley National Laboratory Recent Work Title An Asynchronous Task-based Fan-Both Sparse Cholesky Solver Permalink (2018) (0)
- Modeling the Bene ts of Mixed Data and Task ParallelismSoumen Chakrabarti (2013) (0)
- An Unsteady Flamelet Solver for Parallel Computers Implementation and Testing CS 267-Spring 2004 - (0)
- Creating Software Technology to Harness the Power of Leadership-class Computing Systems (2007) (0)
- Multicore: Fallout From a Computing Evolution (LBNL Summer Lecture Series) (2008) (0)
- Advanced Scientific Computing Research The Potential of the Cell Processor for Scientific Computing (2006) (2008) (0)
- CHIUW 2018 Keynote (2018) (0)
- Simulation Optimization on Leading Multicore Platforms (2008) (0)
- GPU accelerated partial order multiple sequence alignment for long reads self-correction (2020) (0)
- Expert Structured Linear Operator Backend Optimized Operator CustomGPU CUDA CustomCPU MKL Numpy Platform GPU (2018) (0)
- Lawrence Berkeley National Laboratory Recent Work Title Enforcing textual alignment of collectives using dynamic checks Permalink (2009) (0)
- Compiling Explicitly Parallel Programs (2001) (0)
- Language innovations for HPCS (2005) (0)
- Thread Level Speculation (TLS) Parallelization (2011) (0)
- 2019 Computing Sciences Strategic Plan (2021) (0)
- Optimizing Collective Communication on Multicores 12th Workshop on Hot Topics in Operating Systems (hotos Xii) It 's Dead, Jim (2009) (0)
- Best paper awards: 26th international parallel and distributed processing symposium (IPDPS 2012) (2013) (0)
- Performance Optimizations and Bounds for Sparse Matrix Kernels (2002) (0)
- LBNL Computational Research and Theory Facility Groundbreaking. February 1st, 2012 (2012) (0)
- ADEPT: a domain independent sequence alignment strategy for gpu architectures (2020) (0)
- Optimizations & Bounds for Sparse Symmetric Matrix-Vector Multiply (2004) (0)
- Center for Scalable Application Development Software (2014) (0)
- Systems Software for Irregular Parallel Applications (2001) (0)
- CS 267 : Applications of Parallel Computers Assignment 0 (2013) (0)
- SPAA'21 Panel Paper: Architecture-Friendly Algorithms versus Algorithm-Friendly Architectures (2021) (0)
- Science Driven Supercomputing Architectures: AnalyzingArchitectural Bottlenecks with Applications and Benchmark Probes (2005) (0)
- Lawrence Berkeley National Laboratory Recent Work Title Exascale scientific applications : Scalability and performance portability (2017) (0)
- Final Report from The University of Texas at Austin for DEGAS: Dynamic Global Address Space programming environments (2018) (0)
- Scheduling Dynamic Parallelism on the Cell BE (2009) (0)
- LOGAN: High-Performance GPU-Based X-Drop (2020) (0)
- Computing and Data Challenges in Climate Change (2020) (0)
- Automatic Performance Tuning of Sparse Matrix-Multiple Vector Multiply (2002) (0)
- Scalable Irregular Parallelism with GPUs: Getting CPUs Out of the Way (2022) (0)
- Synchronization and Memory Consistency (2005) (0)
- RUNTIME SYSTEMS REPORT 2014 Runtime Systems Summit (2016) (0)
- that scale with increasing numbers of cores should be as easy as writing programs for sequential computers (2018) (0)
- Scaling Generalized N-Body Problems, A Case Study from Genomics (2021) (0)
- symPACK: a solver for sparse Symmetric Matrices (2016) (0)
- Operating Systems for Exascale (0)
- High-Performance Filters for GPUs (2022) (0)
- Computing, data and COVID-19 (2020) (0)
- Work Sharing and Domain Decomposition (2005) (0)
- Use of a high-level language in high performance biomechanics simulations (2006) (0)
- Autotuning Structured Grid Kernels (2008) (0)
- Systems Support for Irregular Parallel Applications (Abstract) (1996) (0)
- Keynote address: Moving a science workload to exascale computing (2012) (0)
- Performance Tuning and Optimization (2005) (0)
- Randomized Load Balancing for Tree-structured (1994) (0)
- Appendix E: Quick UPC Reference (2005) (0)
- The Energy Efficiencyof IRAM Architectures (1997) (0)
- Multicore: Fallout from a Computing Evolution (2008) (0)
- Appendix C: UPC‐IO Specifications, v1.0 (2005) (0)
- Interacting agents for local search (1999) (0)
- Proceedings of the ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 2005, June 15-17, 2005, Chicago, IL, USA (2005) (0)
- Merging processing and memory into a single DRAM chip could revolutionize the semiconductor industry . A CASE FOR INTELLIGENT RAM (1997) (0)
- HPC Performance Improvements Through Innovative Architecture (2019) (0)
- Theory of Mazurkiewicz-Traces (2011) (0)
- Summer Series 2012 - Conversation with Kathy Yelick (LBNL Summer Lecture Series) (2012) (0)
- Session details: Memory models and concurrency analysis (2007) (0)
- Summer Series 2012 - Conversation with Kathy Yelick (2012) (0)
- A Communication-Optimal N-Body Algorithm for Short-Range Interactions (2012) (0)
This paper list is powered by the following services:
Other Resources About Katherine Yelick
What Schools Are Affiliated With Katherine Yelick?
Katherine Yelick is affiliated with the following schools: