Saman Amarasinghe
#157,427
Most Influential Person Now
Saman Amarasinghe's AcademicInfluence.com Rankings
Saman Amarasingheengineering Degrees
Engineering
#6628
World Rank
#7964
Historical Rank
Electrical Engineering
#2001
World Rank
#2104
Historical Rank

Saman Amarasinghecomputer-science Degrees
Computer Science
#8748
World Rank
#9198
Historical Rank
Parallel Computing
#47
World Rank
#49
Historical Rank
Database
#5748
World Rank
#5964
Historical Rank

Download Badge
Engineering Computer Science
Saman Amarasinghe's Degrees
- PhD Electrical Engineering and Computer Science Stanford University
- Masters Electrical Engineering and Computer Science Stanford University
- Bachelors Electrical Engineering and Computer Science Stanford University
Why Is Saman Amarasinghe Influential?
(Suggest an Edit or Addition)Saman Amarasinghe's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- StreamIt: A Language for Streaming Applications (2002) (1281)
- The Raw Microprocessor: A Computational Fabric for Software Circuits and General-Purpose Programs (2002) (1021)
- Halide: a language and compiler for optimizing parallelism, locality, and recomputation in image processing pipelines (2013) (1010)
- Baring It All to Software: Raw Machines (1997) (678)
- Maximizing Multiprocessor Performance with the SUIF Compiler (1996) (644)
- Secure Execution via Program Shepherding (2002) (639)
- SUIF: an infrastructure for research on parallelizing and optimizing compilers (1994) (627)
- Exploiting coarse-grained task, data, and pipeline parallelism in stream programs (2006) (596)
- An infrastructure for adaptive dynamic optimization (2003) (565)
- OpenTuner: An extensible framework for program autotuning (2014) (479)
- Evaluation of the Raw microprocessor: an exposed-wire-delay architecture for ILP and streams (2004) (458)
- PetaBricks: a language and compiler for algorithmic choice (2009) (419)
- Automatically patching errors in deployed software (2009) (418)
- Kendo: efficient deterministic multithreading in software (2009) (405)
- Exploiting superword level parallelism with multimedia instruction sets (2000) (404)
- A stream compiler for communication-exposed architectures (2002) (373)
- Meta optimization: improving compiler heuristics with machine learning (2003) (309)
- Decoupling algorithms from schedules for easy optimization of image processing pipelines (2012) (272)
- Communication optimization and code generation for distributed memory machines (1993) (271)
- Space-time scheduling of instruction-level parallelism on a raw machine (1998) (271)
- Bidwidth analysis with application to silicon compilation (2000) (258)
- The tensor algebra compiler (2017) (255)
- Data and computation transformations for multiprocessors (1995) (254)
- DAWG: A Defense Against Cache Timing Attacks in Speculative Execution Processors (2018) (210)
- Array-data flow analysis and its use in array privatization (1993) (210)
- A Practical Approach to Exploiting Coarse-Grained Pipeline Parallelism in C Programs (2007) (199)
- Predicting unroll factors using supervised classification (2005) (197)
- The suif compiler for scalable parallel machines (1995) (189)
- Detecting Coarse - Grain Parallelism Using an Interprocedural Parallelizing Compiler (1995) (180)
- An empirical characterization of stream programs and its implications for language and compiler design (2010) (173)
- Tiramisu: A Polyhedral Compiler for Expressing Fast and Portable Code (2018) (168)
- Digital microfluidics using soft lithography. (2006) (167)
- Scalar operand networks: on-chip interconnect for ILP in partitioned architectures (2003) (163)
- Abstraction layers for scalable microfluidic biocomputing (2008) (148)
- Design and implementation of a dynamic optimization framework for windows (2000) (138)
- Language and compiler support for auto-tuning variable-accuracy algorithms (2011) (137)
- Parallelizing applications into silicon (1999) (128)
- GraphIt: a high-performance graph DSL (2018) (122)
- Transparent dynamic instrumentation (2012) (119)
- Weld : A Common Runtime for High Performance Data Analytics (2016) (108)
- Maps: a compiler-managed memory system for Raw machines (1999) (106)
- Ithemal: Accurate, Portable and Fast Basic Block Throughput Estimation using Deep Neural Networks (2018) (104)
- Autotuning algorithmic choice for input sensitivity (2015) (103)
- An Overview of a Compiler for Scalable Parallel Machines (1993) (97)
- Making caches work for graph analytics (2016) (93)
- Cache aware optimization of stream programs (2005) (90)
- Portable performance on heterogeneous architectures (2013) (87)
- Increasing and detecting memory address congruence (2002) (81)
- StreamIt: A Compiler for Streaming Applications ⁄ (2002) (80)
- The SUIF Compiler System: a Parallelizing and Optimizing Research Compiler (1994) (79)
- Format abstraction for sparse tensor algebra compilers (2018) (78)
- Dynamic cache contention detection in multi-threaded applications (2011) (77)
- Scalar operand networks (2005) (77)
- Halide: decoupling algorithms from schedules for high-performance image processing (2017) (77)
- A unified framework for schedule and storage optimization (2001) (76)
- An Overview of the SUIF Compiler for Scalable Parallel Machines (1995) (76)
- Language and Compiler Design for Streaming Applications (2004) (75)
- A 16-issue multiple-program-counter microprocessor with point-to-point scalar operand network (2003) (71)
- Interprocedural parallelization analysis in SUIF (2005) (70)
- Evaluating End-to-End Optimization for Data Analytics Applications in Weld (2018) (68)
- Interprocedural Analysis for Parallelization (1995) (66)
- Phased scheduling of stream programs (2003) (65)
- Umbra: efficient and scalable memory shadowing (2010) (64)
- Teleport messaging for distributed stream programs (2005) (63)
- Computer-aided design for microfluidic chips based on multilayer soft lithography (2009) (61)
- A lightweight streaming layer for multicore execution (2008) (59)
- Convergent scheduling (2002) (58)
- Linear analysis and optimization of stream programs (2003) (58)
- How to Do a Million Watchpoints: Efficient Debugging Using Dynamic Instrumentation (2008) (56)
- Dynamic native optimization of interpreters (2003) (56)
- Parallelizing Compiler Techniques Based on Linear Inequalities (1997) (56)
- The three pillars of machine programming (2018) (55)
- Simit: A Language for Physical Simulation (2016) (54)
- Siblingrivalry: online autotuning through local competitions (2012) (53)
- Evaluation of IVR data collection UIs for untrained rural users (2010) (52)
- MPEG-2 decoding in a stream programming language (2006) (49)
- Multiprocessors from a software perspective (1996) (47)
- FlexCache: A Framework for Flexible Compiler Generated Data Caching (2000) (45)
- Searching the World Wide Web in Low-Connectivity Communities (2002) (43)
- Maintaining consistency and bounding capacity of software code caches (2005) (43)
- Hot Pages: Software Caching for Raw Microprocessors (1999) (42)
- Dynamic expressivity with static optimization for streaming languages (2013) (42)
- Softspec: Software-based Speculative Parallelism (2000) (42)
- The Raw Compiler Project (1999) (41)
- Tensor Algebra Compilation with Workspaces (2019) (39)
- An overview of the suif compiler system (1990) (39)
- Optimizing indirect memory references with milk (2016) (38)
- Detection of false sharing using machine learning (2013) (37)
- A Common Runtime for High Performance Data Analysis (2017) (37)
- Optimizing stream programs using linear state space analysis (2005) (35)
- Autotuning multigrid with PetaBricks (2009) (33)
- Genetic Programming Applied to Compiler Heuristic Optimization (2003) (32)
- Compiler Support for Scalable and Efficient Memory Systems (2001) (32)
- Optimizing ordered graph algorithms with GraphIt (2019) (31)
- Helium: lifting high-performance stencil kernels from stripped x86 binaries to halide DSL code (2015) (31)
- Taco: A tool to generate tensor algebra kernels (2017) (31)
- A sparse iteration space transformation framework for sparse tensor algebra (2020) (31)
- Efficient pipelining of nested loops: unroll-and-squash (2002) (31)
- Distributed Halide (2016) (30)
- An Audio Wiki for Publishing User-Generated Content in the Developing World (2008) (29)
- Exploiting vector parallelism in software pipelined loops (2005) (26)
- Automating the construction of compiler heuristics using machine learning (2006) (25)
- Automatic generation of efficient sparse tensor format conversion routines (2020) (25)
- Memory bank disambiguation using modulo unrolling for Raw machines (1998) (25)
- Ubiquitous memory introspection (2007) (24)
- Aikido: accelerating shared data dynamic analyses (2012) (24)
- goSLP: globally optimized superword level parallelism framework (2018) (23)
- Multicore Performance Optimization Using Partner Cores (2011) (23)
- Strength Reduction of Integer Division and Modulo Operations (2001) (22)
- The Raw Processor: A Composeable 32-Bit Fabric for Embedded and General Purpose Computing (2001) (22)
- General-purpose operating systems, such as Linux, (2007) (21)
- Techniques for Increasing and Detecting Memory Alignment (2001) (19)
- Efficient memory shadowing for 64-bit architectures (2010) (19)
- GraphIt - A High-Performance DSL for Graph Analytics (2018) (19)
- Unified compilation techniques for shared and distributed address space machines (1995) (19)
- Interprocedural Parallelization Analysis: A Case Study (1995) (19)
- Interprocedural analysis for parallelization: design and experience (1995) (18)
- Weld: Rethinking the Interface Between Data-Intensive Applications (2017) (18)
- BHive: A Benchmark Suite and Measurement Framework for Validating x86-64 Basic Block Performance Models (2019) (18)
- Compilation of sparse array programming models (2021) (18)
- VeGen: a vectorizer generator for SIMD and beyond (2021) (17)
- A Productive Programming Environment for Stream Computing (2005) (17)
- StreamJIT: a commensal compiler for high-performance stream programming (2014) (17)
- A Common Backend for Hardware Acceleration on FPGA (2017) (16)
- Phased Computation Graphs in the Polyhedral Model (2002) (15)
- Multicores from the Compiler's Perspective: A Blessing or a Curse? (2005) (15)
- A step towards unifying schedule and storage optimization (2007) (15)
- A Unified Backend for Targeting FPGAs from DSLs (2018) (14)
- TIRAMISU: A Polyhedral Compiler for Dense and Sparse Deep Learning (2020) (13)
- Hyperparameter Tuning in Bandit-Based Adaptive Operator Selection (2012) (13)
- Gigabit IP Routing on Raw (2002) (12)
- A variational study of two-nucleon systems with lattice QCD (2021) (12)
- Cimple: instruction and memory level parallelism: a DSL for uncovering ILP and MLP (2018) (12)
- Seq: A High-Performance Language for Bioinformatics (2019) (12)
- SUDS: Primitive Mechanisms for Memory Dependence Speculation (1999) (12)
- Gloss: Seamless Live Reconfiguration and Reoptimization of Stream Programs (2018) (12)
- PetaBricks: a language and compiler based on autotuning (2011) (11)
- Optimizing Cache Performance for Graph Analytics (2016) (11)
- Abstraction Layers for Scalable Microfluidic Biocomputers (2006) (11)
- A common machine language for grid-based architectures (2002) (11)
- Providing Web search capability for low-connectivity communities (2002) (10)
- An efficient evolutionary algorithm for solving incrementally structured problems (2011) (10)
- Simit (2016) (9)
- (How) can programmers conquer the multicore menace? (2008) (9)
- Manipulating lossless video in the compressed domain (2009) (8)
- Programmable Microfluidics (2004) (8)
- Compiling Graph Applications for GPU s with GraphIt (2021) (7)
- Flexcache: a framework for compiler generated data caching (2001) (7)
- A Python-based programming language for high-performance computational genomics (2021) (7)
- Revec: program rejuvenation through revectorization (2019) (6)
- Execution Model Enforcement Via Program Shepherding (2003) (6)
- Tiramisu: A Code Optimization Framework for High Performance Systems (2018) (6)
- Hot compilers for future hot chips (1995) (6)
- Proceedings of the 18th ACM SIGPLAN symposium on Principles and practice of parallel programming (2013) (6)
- BuildIt: A Type-Based Multi-stage Programming Framework for Code Generation in C++ (2021) (5)
- Adapting Convergent Scheduling Using Machine-Learning (2003) (5)
- Sparse Tensor Algebra Optimizations with Workspaces (2018) (5)
- The Multiprocessor as a General-Purpose Processor: A Software Perspective (1996) (5)
- Autoscheduling for sparse tensor algebra with an asymptotic cost model (2022) (5)
- Tiled Multicore Processors (2009) (5)
- Interprocedural Parallelization Analysis: Preliminary Results (1995) (4)
- All you need is superword-level parallelism: systematic control-flow vectorization with SLP (2022) (4)
- An Asymptotic Cost Model for Autoscheduling Sparse Tensor Programs (2021) (4)
- How to build scalable on-chip ILP networks for a decentralized architecture (2000) (4)
- Mapping Stream Programs into the Compressed Domain (2007) (4)
- Unified Sparse Formats for Tensor Algebra Compilers (2018) (3)
- Scalar Operand Networks: Design, Implementation, and Analysis (2004) (3)
- A Unified Iteration Space Transformation Framework for Sparse and Dense Tensor Algebra (2019) (3)
- Abstraction Layers for Scalable Microfluidic Biocomputers (Extended Version) (2006) (3)
- Reliable Computation on Unreliable Hardware : Can We Have Our Digital Cake and Eat It ? (2014) (3)
- Self-defending software: Automatically patching security vulnerabilities (2009) (3)
- Dynamic Sparse Tensor Algebra Compilation (2021) (2)
- Learning automatic schedulers with projective reparameterization (2019) (2)
- Proceedings of the ACM SIGPLAN 2008 Conference on Programming Language Design and Implementation, Tucson, AZ, USA, June 7-13, 2008 (2008) (2)
- High-Productivity Stream Programming For High-Performance Systems (2005) (2)
- Hitting the Sweet Spot for Streaming Languages : Dynamic Expressivity with Static Optimization (2012) (2)
- ZettaBricks: A Language Compiler and Runtime System for Anyscale Computing (2015) (2)
- 6.172 Performance Engineering of Software Systems, Fall 2009 (2009) (2)
- Cimple: Instruction and Memory Level Parallelism (2018) (2)
- PROGRAM SYNTHESIS FOR SCIENTIFIC COMPUTING (2021) (2)
- Interprocedural analysis for parallelization: preliminary results (1995) (2)
- The Three Pillars of Machine-Based Programming (2018) (2)
- Sparse Tensor Transpositions (2020) (2)
- Software orchestration of instruction level parallelism on tiled processor architectures (2005) (1)
- Ubiquitous Memory Introspection (Preliminary Manuscript) (2006) (1)
- Domain-Specific Language Abstractions for Compression (2021) (1)
- Tiramisu : A Polyhedral Compiler with A Scheduling Language for Targeting High Performance Systems Riyadh (2018) (1)
- Predicting Unroll Factors Using Nearest Neighbors (2004) (1)
- A Python-based optimization framework for high-performance genomics (2020) (1)
- High-bandwidth packet switching on the raw general-purpose architecture (2003) (1)
- SALSA: A Domain Specific Architecture for Sequence Alignment (2020) (1)
- MPEG-2 in a Stream Programming Language (2005) (1)
- Execution model via program shepherding (2003) (1)
- Outfoxing the Mammoths (2010) (1)
- An Attempt to Generate Code for Symmetric Tensor Computations (2021) (1)
- RawNet: Network Processing on the Raw Processor (2002) (1)
- Ubiquitous Stream Programming to Facilitate the Migration to Multicore Architectures (1)
- FlexCa he : A Framework for FlexibleCompiler Generated Data Ca hingCsaba (2000) (1)
- Domain Specific Language for Exascale (2018) (0)
- A Deep Learning Model for Loop Interchange (2023) (0)
- Technical Report about Tiramisu: a Three-Layered Abstraction for Hiding Hardware Complexity from DSL Compilers (2018) (0)
- Bit-Packing Optimization for StreamIt (2003) (0)
- Weld : Rethinking the Interface Between Data-Intensive Libraries (2017) (0)
- Achieving High Performance on Digital AlphaServerswith the SUIF (1999) (0)
- Compiler 2.0: Using Machine Learning to Modernize Compiler Technology (2020) (0)
- Automatic Generation of Sparse Tensor Kernels with Workspaces (2018) (0)
- Exploiting Vector Parallelism in Software Pipelined LoopsTopology Control (2005) (0)
- An Audio Wiki for Building Local Repositories of Knowledge in the Developing World (2016) (0)
- GrAPL 2020 Keynote Speaker The GraphIt Universal Graph Framework: Achieving HighPerformance across Algorithms, Graph Types, and Architectures (2020) (0)
- Compilers Without Borders (2016) (0)
- Chapter 1: Interprocedural Parallelization Analysis: a Case Study (1995) (0)
- Codon: A Compiler for High-Performance Pythonic Applications and DSLs (2023) (0)
- The TEK Search Engine (2016) (0)
- The sparse tensor algebra compiler (keynote) (2019) (0)
- Selective Vectorization for Short-Vector Instructions (2009) (0)
- Defying the Speed of Light : Wire-Exposed Architectures and Spatially-Aware Compilers (2003) (0)
- Proceedings of the Second Workshop on Optimizing Stencil Computations (2014) (0)
- Efficient Memory Shadowing for 64-bit Architectures Citation (2010) (0)
- Scalar Queue Conversion: Dynamic Single Assignment for Concurrent Scheduling (2003) (0)
- Proceedings of the 2008 ACM SIGPLAN conference on Programming language design and implementation (2008) (0)
- PriorityGraph: A Unified Programming Model for Optimizing Ordered Graph Algorithms (2019) (0)
- Interprocedural Analysis in SUIF (2007) (0)
- Defying the speed of light:: a spatially-aware compiler for wire-exposed architectures (2002) (0)
- 6.035 Computer Language Engineering (SMA 5502), Fall 2002 (2002) (0)
- StreamIt: A Language and Compiler for Communication-Exposed Architectures (2003) (0)
- Compiler Support for Structured Data (2023) (0)
- Stream Multicore Processors (2007) (0)
- D2X: An eXtensible conteXtual Debugger for Modern DSLs (2023) (0)
- Helium : lifting high-performance stencil kernels from stripped x 86 binaries to halide DSL code Citation (2015) (0)
- WACO: Learning Workload-Aware Co-optimization of the Format and Schedule of a Sparse Tensor Program (2023) (0)
- Cache Optimizations for Stream Programs by Janis Sermulips (2005) (0)
- The Raw Fabric: A Technology for Rapid Embedded System Customization (2004) (0)
- Autotuning multigrid with PetaBricks Citation (2012) (0)
- Proceedings of the 29th ACM SIGPLAN Conference on Programming Language Design and Implementation (2008) (0)
- Compilation Techniques for Graph Algorithms on GPUs (2020) (0)
- AEDAM: Whole Program Adaptive Error Detection and Mitigation—Year 3 Report (2019) (0)
- Computer Science and Artificial Intelligence Laboratory MPEG-2 in a Stream Programming Language (2005) (0)
- Geneti Programming Applied to (2007) (0)
- Autotuning algorithmic choice for input sensitivity Citation (2015) (0)
- Compliation Techniques for Graphs Algorithms on GPUs (2020) (0)
- Hot Pages : Software Caching for Raw Microprocessors Csaba (1999) (0)
- PCA RAW FABRIC: Architectural Prototyping, Demonstration and Evaluation (2007) (0)
- WOSC 2014: second workshop on optimizing stencil computations (2014) (0)
This paper list is powered by the following services:
What Schools Are Affiliated With Saman Amarasinghe?
Saman Amarasinghe is affiliated with the following schools: