Marc Snir

Q: What Schools Are Affiliated With Marc Snir

Marc Snir is affiliated with the following schools: Technion – Israel Institute of Technology, Hebrew University of Jerusalem, University of Chicago, University of Illinois Urbana-Champaign

Marc Snir's AcademicInfluence.com Rankings

Marc Snir

Computer Science

#1509

World Rank

#1563

Historical Rank

#743

USA Rank

Parallel Computing

World Rank

Historical Rank

USA Rank

Database

#2871

World Rank

#2996

Historical Rank

#573

USA Rank

computer-science Degrees

Download Badge

Computer Science

Why Is Marc Snir Influential?

(Suggest an Edit or Addition)

According to Wikipedia, Marc Snir is an Israeli-American computer scientist. He holds a Michael Faiman and Saburo Muroga Professorship in the Department of Computer Science at the University of Illinois at Urbana-Champaign. He currently pursues research in parallel computing. He was the principal investigator for the software of the petascale Blue Waters system and co-director of the Intel and Microsoft-funded Universal Parallel Computing Research Center .

(See a Problem?)

Marc Snir's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

MPI: The Complete Reference (1996) (2781)
Accepted for publication (1999) (2354)
Report No (2005) (1366)
The International Exascale Software Project roadmap (2011) (735)
The NYU Ultracomputer—Designing an MIMD Shared Memory Parallel Computer (1983) (624)
The Performance of Multistage Interconnection Networks for Multiprocessors (1983) (569)
MPI - The Complete Reference: Volume 1, The MPI Core (1998) (491)
Efficient and correct execution of parallel programs that share memory (1988) (414)
Toward Exascale Resilience (2009) (393)
Addressing failures in exascale computing (2014) (386)
Mpi - The Complete Reference: Volume 2, the Mpi Extensions (1998) (290)
Toward Exascale Resilience: 2014 update (2014) (282)
Blue Gene: A vision for protein science using a petaflop supercomputer (2001) (280)
A model for hierarchical memory (1987) (265)
Communication Complexity of PRAMs (1990) (262)
The power of parallel prefix (1985) (252)
A Complexity Theory of Efficient Parallel Algorithms (1990) (235)
Generalized communicators in the Message Passing Interface (1996) (221)
Generic topology mapping strategies for large-scale parallel architectures (2011) (215)
Parallel programming must be deterministic by default (2009) (208)
Hierarchical memory with block transfer (1987) (200)
Probabilities over rich languages, testing and randomness (1982) (192)
MPI-2: Extending the Message-Passing Interface (1996) (188)
GETTING UP TO SPEED THE FUTURE OF SUPERCOMPUTING (2004) (183)
On communication latency in PRAM computations (1989) (176)
Mpi---the complete reference: volume 1 (1998) (172)
A message passing standard for MPP and workstations (1996) (161)
SP2 System Architecture (1999) (158)
Big data, but are we ready? (2011) (153)
Overview of the MPI-IO Parallel I/O Interface (1996) (151)
Random walks on weighted graphs and applications to on-line algorithms (1993) (143)
Computing on an anonymous ring (1985) (142)
Memory versus randomization in on-line algorithms (1994) (133)
CCL: a portable and tunable collective communication library for scalable parallel computers (1994) (132)
Fault prediction under the microscope: A closer look into HPC systems (2012) (128)
Some Exact Complexity Results for Straight-Line Computations over Semirings (1982) (126)
A Unified Theory of Interconnection Network Structure (1986) (121)
Java programming for high-performance numerical computing (2000) (117)
Uncoordinated Checkpointing Without Domino Effect for Send-Deterministic MPI Applications (2011) (117)
Efficient synchronization of multiprocessors with shared memory (1988) (116)
Computing on an anonymous ring (1988) (116)
On Parallel Searching (2011) (115)
An Introduction to the MPI Standard (1995) (114)
Gluon: a communication-optimizing substrate for distributed heterogeneous graph analytics (2018) (111)
Depth-Size Trade-Offs for Parallel Prefix Computation (1986) (110)
Taming parallel I/O complexity with auto-tuning (2013) (109)
Lower Bounds on Probabilistic Linear Decision Trees (1985) (107)
The Distribution of Waiting Times in Clocked Multistage Interconnection Networks (1988) (105)
Damaris: How to Efficiently Leverage Multicore Parallelism to Achieve Scalable, Jitter-free I/O (2012) (103)
From Trace Generation to Visualization: A Performance Framework for Distributed Parallel Systems (2000) (91)
Mpi-io: a parallel file i/o interface for mpi (1995) (90)
Performance modeling for systematic performance tuning (2011) (89)
Efficient synchronization of multiprocessors with shared memory (1986) (86)
Scheduling the I/O of HPC Applications Under Congestion (2015) (85)
The NYU ultracomputer—designing a MIMD, shared-memory parallel machine (2018) (84)
Parallel File Systems for the IBM SP Computers (1995) (83)
The Communication Software and Parallel Environment of the IBM SP2 (1995) (80)
NUMA-aware shared-memory collective communication for MPI (2013) (79)
Argobots: A Lightweight Low-Level Threading and Tasking Framework (2018) (76)
Ghost Cell Pattern (2010) (75)
Issues related to MIMD shared-memory computers: the NYU ultracomputer approach (1985) (69)
The NYU Ultracomputer—designing a MIMD, shared-memory parallel machine (Extended Abstract) (1982) (65)
The IBM External User Interface for Scalable Parallel Systems (1994) (64)
On Communication Determinism in Parallel HPC Applications (2010) (61)
Memory Versus Randomization in On-line Algorithms (Extended Abstract) (1989) (60)
Better Computing on the Anonymous Ring (1988) (59)
The MPI core (1998) (58)
Using visualization tools to understand concurrency (1992) (58)
Automatic tuning matrix multiplication performance on graphics hardware (2005) (57)
A Framework for Measuring Supercomputer Productivity (2004) (56)
A Note on N-Body Computations with Cutoffs (2004) (54)
Enabling MPI interoperability through flexible communication endpoints (2013) (53)
Optimizing Array Reference Checking in Java Programs (1998) (53)
Programming for Exascale Computers (2013) (47)
FlipIt: An LLVM Based Fault Injector for HPC (2014) (45)
HydEE: Failure Containment without Event Logging for Large Scale Send-Deterministic MPI Applications (2012) (44)
Enabling communication concurrency through flexible MPI endpoints (2014) (43)
ESoftCheck: Removal of Non-vital Checks for Fault Tolerance (2009) (43)
LOGAIDER: A Tool for Mining Potential Correlations of HPC Log Events (2017) (42)
Failure prediction for HPC systems and applications (2013) (41)
Applications of Ramsey's theorem to decision tree complexity (1985) (40)
Improving Strong-Scaling of CNN Training by Exploiting Finer-Grained Parallelism (2019) (40)
Channel and filter parallelism for large-scale CNN training (2019) (38)
Demonstrating the scalability of a molecular dynamics application on a Petaflop computer (2001) (38)
Damaris: Addressing Performance Variability in Data Management for Post-Petascale Simulations (2016) (36)
Random walks on weighted graphs, and applications to on-line algorithms (1990) (36)
Exploring the feasibility of lossy compression for PDE simulations (2019) (36)
Optimizing the Barnes-Hut algorithm in UPC (2011) (35)
Improving parallel I/O autotuning with performance modeling (2014) (35)
MPI programming environment for IBM SP1/SP2 (1995) (34)
On the Theory of Spatial and Temporal Locality (2005) (34)
Ubiquitous Parallel Computing from Berkeley, Illinois, and Stanford (2010) (34)
Transformation for class immutability (2011) (34)
An Efficient Implementation of MPI (1994) (33)
Exascale Operating Systems and Runtime Software Report (2012) (31)
Efficient parallel algorithms for graph problems (1990) (31)
Reducing Waste in Extreme Scale Systems through Introspective Analysis (2016) (28)
Designing Efficient, Scalable, and Portable Collective Communication Libraries (1993) (27)
On the depth complexity of formulas (1979) (27)
Techniques for Parallel Manipulation of Sparse Matrices (1989) (27)
Aluminum: An Asynchronous, GPU-Aware Communication Library Optimized for Large-Scale Training of Deep Neural Networks on HPC Systems (2018) (27)
Calling Names on Nameless Networks (1994) (26)
Optimizing I/O Performance of HPC Applications with Autotuning (2019) (26)
Towards millions of communicating threads (2016) (26)
VLSI COMBINING NETWORK FOR THE NYU ULTRACOMPUTER. (1985) (25)
TLB consistency on highly-parallel shared-memory multiprocessors (2018) (25)
Pattern-driven parallel I/O tuning (2015) (25)
Improved MPI collectives for MPI processes in shared address spaces (2014) (24)
Exascale Research: Preparing for the Post-Moore Era (2011) (23)
Distributed Monitoring and Management of Exascale Systems in the Argo Project (2015) (22)
Exploring Properties and Correlations of Fatal Events in a Large-Scale HPC System (2019) (22)
Circuit partitioning with size and connection constraints (1983) (22)
On the distribution of delays in buffered multistage interconnection networks for uniform and nonuni (1984) (21)
Towards a More Complete Understanding of SDC Propagation (2017) (20)
A Near-Memory Processor for Vector, Streaming and Bit Manipulation Workloads (2005) (20)
The NYU ultracomputer: An MIMD shared memory parallel computer (1984) (20)
Message proxies for efficient, protected communication on SMP clusters (1997) (19)
Demonstrating the Scalability of a Molecular Dynamics Application on a Petaflops Computer (2002) (19)
MPI-F: An Efficient Implementation of MPI on IBM-SP1 (1994) (18)
MiniAMR - A miniapp for Adaptive Mesh Refinement (2016) (18)
Automatic datatype generation and optimization (2012) (18)
Gluon-Async: A Bulk-Asynchronous System for Distributed and Heterogeneous Graph Analytics (2019) (18)
PRISM: an integrated architecture for scalable shared memory (1998) (17)
Doing Moore with Less - Leapfrogging Moore's Law with Inexactness for Supercomputing (2016) (16)
Characterizing and Understanding HPC Job Failures Over The 2K-Day Life of IBM BlueGene/Q System (2019) (16)
A Lightweight Communication Runtime for Distributed Graph Analytics (2018) (16)
Predicting HPC parallel program performance based on LLVM compiler (2017) (15)
Recorder 2.0: Efficient Parallel I/O Tracing and Analysis (2020) (13)
Operating System Support for High Performance Parallel I/O Systems (1994) (13)
A Complexity Theory of Efficient Parallel Algorithms (Extended Abstract) (1988) (13)
The covering problem of complete uniform hypergraphs (1979) (13)
Cost-Performance Tradeoffs for Interconnection Networks (1992) (13)
Bringing the HPC Programmer ’ s IDE into the 21 st Century through Refactoring (2010) (12)
Overcoming the power wall by exploiting inexactness and emerging COTS architectural features: Trading precision for improving application quality (2016) (11)
MPI on IBM SP1/SP2: current status and future directions (1994) (11)
Computer and information science and engineering (2011) (11)
On parallel searching (Extended Abstract) (1982) (11)
2002 International Parallel and Distributed Processing Symposium (2003) (10)
Automatic Generation of I/O Kernels for HPC Applications (2014) (10)
OPTIMAL INTERCONNECTION NETWORKS FOR PARALLEL PROCESSORS: THE IMPORTANCE OF BEING SQUARE. (1987) (10)
Automatic generation of benchmarks for I/O-intensive parallel applications (2019) (10)
The importance of being square (1984) (10)
Writing Parallel Libraries with MPI - Common Practice, Issues, and Extensions (2011) (10)
Dynamic Model-Driven Parallel I/O Performance Tuning (2015) (10)
Software Abstractions and Methodologies for HPC Simulation Codes on Future Architectures (2013) (9)
What Are the Top Ten Most Influential Parallel and Distributed Processing Concepts of the Past Millenium? (2001) (9)
Proving Lower Bounds for Linar Decision Trees (1981) (9)
Efficient software checking for fault tolerance (2008) (9)
Damaris: Leveraging Multicore Parallelism to Mask I/O Jitter (2011) (9)
A Transformation to Convert Packing Code to Compact Datatypes for Efficient Zero-Copy Data Transfer (2011) (9)
Java for numerically intensive computing: from flops to gigaflops (1999) (8)
Establishing a master's degree programme in bioinformatics: challenges and opportunities. (2005) (8)
Randomized routing with shorter paths (1993) (8)
Programming Patterns for Architecture-Level Software Optimizations on Frequent Pattern Mining (2007) (8)
Understanding the Propagation of Error Due to a Silent Data Corruption in a Sparse Matrix Vector Multiply (2015) (8)
Comparisons between Linear Functions can Help (1982) (8)
Scalable Parallel Computers and Scalable Parallel Codes: From Theory to Practice (1992) (8)
Design of a Multithreaded Barnes-Hut Algorithm for Multicore Clusters (2015) (8)
The Future of Supercomputing (2014) (8)
On the Size Complexity of Monotone Formulas (1980) (7)
Navigating the Blue Waters : Online Failure Prediction in the Petascale Era (2013) (7)
Refactoring for Immutability (2010) (7)
Techniques for Efficient Software Checking (2007) (7)
Neural Network Based Silent Error Detector (2018) (7)
Towards a more fault resilient multigrid solver (2015) (7)
A New Locality Metric and Case Studies for HPCS Benchmarks (2005) (6)
Efficient Parallel Algorithms for Graph Models (1986) (6)
Viewpoint: Computer and Information Science and Engineering: One Discipline, Many Specialties (2011) (6)
GENERALIZED FOSTER’S IDENTITIES (2009) (6)
PPL: an abstract runtime system for hybrid parallel programming (2015) (5)
Scalable parallel computing: the IBM 9076 scalable POWERparallel 1 (1993) (5)
Parallel I/O: Getting ready for prime time (1995) (5)
Parallel Programming Language 1 (PPL1) V0.9 - Draft (2006) (5)
Cost-bandwidth tradeoffs for communication networks (1989) (5)
Proceedings of the ACM 1999 Conference on Java Grande, JAVA '99, San Francisco, CA, USA, June 12-14, 1999 (1999) (5)
Improving the trust in results of numerical simulations and scientific data analytics (2015) (4)
Performance engineering: a must for petascale and beyond (2011) (4)
A General Space-filling Curve Algorithm for Partitioning 2D Meshes (2015) (3)
Pilgrim: Scalable and (near) lossless MPI Tracing (2021) (3)
Eliminating contention bottlenecks in multithreaded MPI (2017) (3)
Shared memory programming on distributed memory systems (2009) (3)
Size-depth Trade-Offs for Monotone Arithmetic Circuits (1991) (3)
Issues and directions in scalable parallel computing (1993) (3)
Understanding and Finding Crash-Consistency Bugs in Parallel File Systems (2020) (3)
A Brief Description of the NMP ISA and Benchmarks (2005) (3)
Applications of Ramsey's Theorem to Decision Trees Complexity (Preliminary Version) (1984) (3)
Exact Balancing is Not Always Good (1986) (3)
Exascale and Beyond: Configuring, Reasoning, Scaling. Report of the 2011 Workshop on Architectures II: Exascale and Beyond, held August 8-10, Albuquerque, N.M. (2011) (3)
Space-filling curves for Partitioning Adaptively Refined Meshes (2015) (2)
Analyzing the Performance of a Sparse Matrix Vector Multiply for Extreme Scale Computers (2015) (2)
On communication determinism in parallel HPC applications ( Invited Paper ) (2010) (2)
Fault Management Workshop Final Report, August 13, 2012 (2012) (2)
Guest Editorial: Special Issue on Network and Parallel Computing for Emerging Architectures and Applications (2019) (2)
Distributed-Memory Multiprocessor (2011) (2)
A Direct Approach to the Parallel Evaluation of Rational Expressions with a Small Number of Processors (1977) (2)
The informal guide to ACM fellow nominations (2017) (2)
Rewriting Logic (2011) (2)
Parallel Computation Models — Some Useful Questions (1989) (2)
Reduce and Scan (2011) (2)
Proceedings of the seventh ACM SIGPLAN symposium on Principles and practice of parallel programming (1999) (2)
SOME RESULTS ON PACKET-SWITCHING NETWORKS FOR MULTIPROCESSING (EXTENDED ABSTRACT). (1982) (2)
What should we expect from parallel language standards ? Discussion (1992) (1)
Distinguished Members Advice to Members Seeking ACM Distinction (2010) (1)
On the Need for a Consortium of Capability Centers (2009) (1)
Best Papers from the 2002 International Parallel and Distributed Processing Symposium (2003) (1)
REPORT OF THE 2014 Programming Models & Environments Summit (2016) (1)
A Language for Deterministic-by-Default Parallel Programming ⋆ (2010) (1)
Comparing archival policies for Blue Waters (2011) (1)
Near-Lossless MPI Tracing and Proxy Application Autogeneration (2023) (1)
Aspects of the parallel program execution: work, time and the current state (1991) (1)
Programming Models for High-Performance Computing (2013) (1)
Bolstering U.S. Supercomputing (2005) (1)
PRISM-a design for scalable shared memory (1997) (1)
Verifying IO Synchronization from MPI Traces (2021) (1)
Technical perspective: The future of MPI (2018) (1)
The Ultraswitch: A Vlsi Network Node for Parallel Processing (2015) (1)
PI Programming Environment for IBM SPl/SP2 (1995) (1)
File System Semantics Requirements of HPC Applications (2021) (1)
Scalable parallel systems: past, present and future (from an IBM perspective) (1996) (1)
Improved MPI collectives for MPI processes in shared address spaces (2014) (0)
1 N-Body Pattern Language (0)
Performance Engineering : A Must for Petaflops and Beyond Extended (2011) (0)
Universal parallel computing research center at Illinois (2009) (0)
On the Complexity of Simplifying Quadratic Forms (1981) (0)
Making Parallel Programming Easy: Research Contributions from Illinois (2013) (0)
Bridging the Performance Gap between Manual and Automatic Compilers with Intent-based Compilation (2015) (0)
Computational Power of low-resource distributed systems: • Jukka Suomela: "Survey of Local Algorithms", Journal ACM Computing Surveys, 2013. (2013) (0)
Space-filling Curves for Domain Decomposition in Scientific Simulations (2014) (0)
COMMUNICAI ’ ION COMPLEXITY OF PRAMS (2002) (0)
First IEEE International Workshop on High-Performance Storage (HPS) (2020) (0)
Efficient and Correct Programs that Share DENNIS SHASHA Execution of Parallel Memory (1999) (0)
Overcoming the Power Wall by Exploiting Application Inexactness and Emerging COTS Architectural Features : Trading Precision for Improving Application Quality (2016) (0)
Guest Editorial: Special Issue on Network and Parallel Computing for Emerging Architectures and Applications (2019) (0)
FROM DETECTION TO OPTIMIZATION: IMPACT OF SILENT ERRORS ON SCIENTIFIC APPLICATIONS (2016) (0)
Reducing Waste in Large Scale Systems through Introspective Analysis (2015) (0)
Comm Engine Sync Thread SCSP Queue Comm Thread Request Container Threading Model Sync Container Worker Worker . . . Worker Shepherd Shepherd (2015) (0)
Computer architectures and programming models for scalable parallel computing (abstract) (1993) (0)
Held in Conjunction with the Eleventh International Symposium on High Performance Computer Architecture Program Committee: Proceedings of the Second Workshop on Productivity and Performance in High-end Computing (pphec-05) Keynote Address Mitigating the Risks Faced by Large-scale Computational Scien (2005) (0)
MPI: The Complete Reference [Book Review] (1997) (0)
Session details: Session 5 (2013) (0)
CIRCUIT PARTITIONING WITH SIZE AND CONNECTION CONSTRAINTS (EXTENDED ABSTRACT). (1982) (0)
Advice to members seeking ACM distinction (2010) (0)
סבוך העומק של נוסחאות (Depth complexity of formulas.) (1979) (0)
For a Massive Number of Massively Parallel Machines: What are the Target Applications, Who are the Target Users, and What New R&D is Needed to Hit the Target? (1996) (0)
FULT (2018) (0)
Memory versus randomization in on-line aigoritiims by P . (2009) (0)
356 IEEE TRANSACTIONS ON PARALLEL AND DlSTRlB (0)
Guest editor's foreword special issue on parallel computing (1996) (0)
New York University (NYU) Ultracomputer -- A General-Purpose Parallel Processor (1982) (0)
Scalable Shared Memory and Shared Memory Clustering (1998) (0)
3rd Workshop on Many-Task Computing on Grids and Supercomputers, MTAGS@SC 2010, New Orleans, Louisiana, USA, November 15, 2010 (2010) (0)
PGAS with Lightweight Threads and the Barnes-Hut Algorithm (2014) (0)
Programming models for extreme-scale computing (2013) (0)
Designing a Scalable Parallel System: the IBM SP2 (1995) (0)
Nicke-C extensions for programming on distributed-memory machines (1992) (0)
Predicting HPC parallel program performance based on LLVM compiler (2016) (0)
FULT: Fast User-Level Thread Scheduling Using Bit-Vectors (2018) (0)
ScaDL 2019 Keynote Talk (2019) (0)
Enabling Architectures for Large-Scale Applications (Presentation). (2010) (0)
Log parser FCE SCE TCE Output Engine Format template Key fields template Log Filter Layout template Query Condition User Interface Layer Analysis Engine Layer Log Parsing Layer Log Data Layer Key module Control info (2017) (0)
Pinpointing Crash-Consistency Bugs in the HPC I/O Stack: A Cross-layer Approach (2021) (0)

This paper list is powered by the following services:

Other Resources About Marc Snir

en.wikipedia.org

What Schools Are Affiliated With Marc Snir?

Marc Snir is affiliated with the following schools:

Marc Snir's Academic­Influence.com Rankings

Why Is Marc Snir Influential?

Marc Snir's Published Works

Published Works

Other Resources About Marc Snir

What Schools Are Affiliated With Marc Snir?

Marc Snir's AcademicInfluence.com Rankings