Saman Amarasinghe

Saman Amarasinghe's AcademicInfluence.com Rankings

Saman Amarasinghe

Engineering

#6628

World Rank

#7964

Historical Rank

Electrical Engineering

#2001

World Rank

#2104

Historical Rank

engineering Degrees

Saman Amarasinghe

Computer Science

#8748

World Rank

#9198

Historical Rank

Parallel Computing

#47

World Rank

#49

Historical Rank

Database

#5748

World Rank

#5964

Historical Rank

computer-science Degrees

Download Badge

Engineering
Computer Science

Saman Amarasinghe's Degrees

PhD Electrical Engineering and Computer Science Stanford University
Masters Electrical Engineering and Computer Science Stanford University
Bachelors Electrical Engineering and Computer Science Stanford University

Why Is Saman Amarasinghe Influential?

(Suggest an Edit or Addition)

(See a Problem?)

Saman Amarasinghe's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

StreamIt: A Language for Streaming Applications (2002) (1281)
The Raw Microprocessor: A Computational Fabric for Software Circuits and General-Purpose Programs (2002) (1021)
Halide: a language and compiler for optimizing parallelism, locality, and recomputation in image processing pipelines (2013) (1010)
Baring It All to Software: Raw Machines (1997) (678)
Maximizing Multiprocessor Performance with the SUIF Compiler (1996) (644)
Secure Execution via Program Shepherding (2002) (639)
SUIF: an infrastructure for research on parallelizing and optimizing compilers (1994) (627)
Exploiting coarse-grained task, data, and pipeline parallelism in stream programs (2006) (596)
An infrastructure for adaptive dynamic optimization (2003) (565)
OpenTuner: An extensible framework for program autotuning (2014) (479)
Evaluation of the Raw microprocessor: an exposed-wire-delay architecture for ILP and streams (2004) (458)
PetaBricks: a language and compiler for algorithmic choice (2009) (419)
Automatically patching errors in deployed software (2009) (418)
Kendo: efficient deterministic multithreading in software (2009) (405)
Exploiting superword level parallelism with multimedia instruction sets (2000) (404)
A stream compiler for communication-exposed architectures (2002) (373)
Meta optimization: improving compiler heuristics with machine learning (2003) (309)
Decoupling algorithms from schedules for easy optimization of image processing pipelines (2012) (272)
Communication optimization and code generation for distributed memory machines (1993) (271)
Space-time scheduling of instruction-level parallelism on a raw machine (1998) (271)
Bidwidth analysis with application to silicon compilation (2000) (258)
The tensor algebra compiler (2017) (255)
Data and computation transformations for multiprocessors (1995) (254)
DAWG: A Defense Against Cache Timing Attacks in Speculative Execution Processors (2018) (210)
Array-data flow analysis and its use in array privatization (1993) (210)
A Practical Approach to Exploiting Coarse-Grained Pipeline Parallelism in C Programs (2007) (199)
Predicting unroll factors using supervised classification (2005) (197)
The suif compiler for scalable parallel machines (1995) (189)
Detecting Coarse - Grain Parallelism Using an Interprocedural Parallelizing Compiler (1995) (180)
An empirical characterization of stream programs and its implications for language and compiler design (2010) (173)
Tiramisu: A Polyhedral Compiler for Expressing Fast and Portable Code (2018) (168)
Digital microfluidics using soft lithography. (2006) (167)
Scalar operand networks: on-chip interconnect for ILP in partitioned architectures (2003) (163)
Abstraction layers for scalable microfluidic biocomputing (2008) (148)
Design and implementation of a dynamic optimization framework for windows (2000) (138)
Language and compiler support for auto-tuning variable-accuracy algorithms (2011) (137)
Parallelizing applications into silicon (1999) (128)
GraphIt: a high-performance graph DSL (2018) (122)
Transparent dynamic instrumentation (2012) (119)
Weld : A Common Runtime for High Performance Data Analytics (2016) (108)
Maps: a compiler-managed memory system for Raw machines (1999) (106)
Ithemal: Accurate, Portable and Fast Basic Block Throughput Estimation using Deep Neural Networks (2018) (104)
Autotuning algorithmic choice for input sensitivity (2015) (103)
An Overview of a Compiler for Scalable Parallel Machines (1993) (97)
Making caches work for graph analytics (2016) (93)
Cache aware optimization of stream programs (2005) (90)
Portable performance on heterogeneous architectures (2013) (87)
Increasing and detecting memory address congruence (2002) (81)
StreamIt: A Compiler for Streaming Applications ⁄ (2002) (80)
The SUIF Compiler System: a Parallelizing and Optimizing Research Compiler (1994) (79)
Format abstraction for sparse tensor algebra compilers (2018) (78)
Dynamic cache contention detection in multi-threaded applications (2011) (77)
Scalar operand networks (2005) (77)
Halide: decoupling algorithms from schedules for high-performance image processing (2017) (77)
A unified framework for schedule and storage optimization (2001) (76)
An Overview of the SUIF Compiler for Scalable Parallel Machines (1995) (76)
Language and Compiler Design for Streaming Applications (2004) (75)
A 16-issue multiple-program-counter microprocessor with point-to-point scalar operand network (2003) (71)
Interprocedural parallelization analysis in SUIF (2005) (70)
Evaluating End-to-End Optimization for Data Analytics Applications in Weld (2018) (68)
Interprocedural Analysis for Parallelization (1995) (66)
Phased scheduling of stream programs (2003) (65)
Umbra: efficient and scalable memory shadowing (2010) (64)
Teleport messaging for distributed stream programs (2005) (63)
Computer-aided design for microfluidic chips based on multilayer soft lithography (2009) (61)
A lightweight streaming layer for multicore execution (2008) (59)
Convergent scheduling (2002) (58)
Linear analysis and optimization of stream programs (2003) (58)
How to Do a Million Watchpoints: Efficient Debugging Using Dynamic Instrumentation (2008) (56)
Dynamic native optimization of interpreters (2003) (56)
Parallelizing Compiler Techniques Based on Linear Inequalities (1997) (56)
The three pillars of machine programming (2018) (55)
Simit: A Language for Physical Simulation (2016) (54)
Siblingrivalry: online autotuning through local competitions (2012) (53)
Evaluation of IVR data collection UIs for untrained rural users (2010) (52)
MPEG-2 decoding in a stream programming language (2006) (49)
Multiprocessors from a software perspective (1996) (47)
FlexCache: A Framework for Flexible Compiler Generated Data Caching (2000) (45)
Searching the World Wide Web in Low-Connectivity Communities (2002) (43)
Maintaining consistency and bounding capacity of software code caches (2005) (43)
Hot Pages: Software Caching for Raw Microprocessors (1999) (42)
Dynamic expressivity with static optimization for streaming languages (2013) (42)
Softspec: Software-based Speculative Parallelism (2000) (42)
The Raw Compiler Project (1999) (41)
Tensor Algebra Compilation with Workspaces (2019) (39)
An overview of the suif compiler system (1990) (39)
Optimizing indirect memory references with milk (2016) (38)
Detection of false sharing using machine learning (2013) (37)
A Common Runtime for High Performance Data Analysis (2017) (37)
Optimizing stream programs using linear state space analysis (2005) (35)
Autotuning multigrid with PetaBricks (2009) (33)
Genetic Programming Applied to Compiler Heuristic Optimization (2003) (32)
Compiler Support for Scalable and Efficient Memory Systems (2001) (32)
Optimizing ordered graph algorithms with GraphIt (2019) (31)
Helium: lifting high-performance stencil kernels from stripped x86 binaries to halide DSL code (2015) (31)
Taco: A tool to generate tensor algebra kernels (2017) (31)
A sparse iteration space transformation framework for sparse tensor algebra (2020) (31)
Efficient pipelining of nested loops: unroll-and-squash (2002) (31)
Distributed Halide (2016) (30)
An Audio Wiki for Publishing User-Generated Content in the Developing World (2008) (29)
Exploiting vector parallelism in software pipelined loops (2005) (26)
Automating the construction of compiler heuristics using machine learning (2006) (25)
Automatic generation of efficient sparse tensor format conversion routines (2020) (25)
Memory bank disambiguation using modulo unrolling for Raw machines (1998) (25)
Ubiquitous memory introspection (2007) (24)
Aikido: accelerating shared data dynamic analyses (2012) (24)
goSLP: globally optimized superword level parallelism framework (2018) (23)
Multicore Performance Optimization Using Partner Cores (2011) (23)
Strength Reduction of Integer Division and Modulo Operations (2001) (22)
The Raw Processor: A Composeable 32-Bit Fabric for Embedded and General Purpose Computing (2001) (22)
General-purpose operating systems, such as Linux, (2007) (21)
Techniques for Increasing and Detecting Memory Alignment (2001) (19)
Efficient memory shadowing for 64-bit architectures (2010) (19)
GraphIt - A High-Performance DSL for Graph Analytics (2018) (19)
Unified compilation techniques for shared and distributed address space machines (1995) (19)
Interprocedural Parallelization Analysis: A Case Study (1995) (19)
Interprocedural analysis for parallelization: design and experience (1995) (18)
Weld: Rethinking the Interface Between Data-Intensive Applications (2017) (18)
BHive: A Benchmark Suite and Measurement Framework for Validating x86-64 Basic Block Performance Models (2019) (18)
Compilation of sparse array programming models (2021) (18)
VeGen: a vectorizer generator for SIMD and beyond (2021) (17)
A Productive Programming Environment for Stream Computing (2005) (17)
StreamJIT: a commensal compiler for high-performance stream programming (2014) (17)
A Common Backend for Hardware Acceleration on FPGA (2017) (16)
Phased Computation Graphs in the Polyhedral Model (2002) (15)
Multicores from the Compiler's Perspective: A Blessing or a Curse? (2005) (15)
A step towards unifying schedule and storage optimization (2007) (15)
A Unified Backend for Targeting FPGAs from DSLs (2018) (14)
TIRAMISU: A Polyhedral Compiler for Dense and Sparse Deep Learning (2020) (13)
Hyperparameter Tuning in Bandit-Based Adaptive Operator Selection (2012) (13)
Gigabit IP Routing on Raw (2002) (12)
A variational study of two-nucleon systems with lattice QCD (2021) (12)
Cimple: instruction and memory level parallelism: a DSL for uncovering ILP and MLP (2018) (12)
Seq: A High-Performance Language for Bioinformatics (2019) (12)
SUDS: Primitive Mechanisms for Memory Dependence Speculation (1999) (12)
Gloss: Seamless Live Reconfiguration and Reoptimization of Stream Programs (2018) (12)
PetaBricks: a language and compiler based on autotuning (2011) (11)
Optimizing Cache Performance for Graph Analytics (2016) (11)
Abstraction Layers for Scalable Microfluidic Biocomputers (2006) (11)
A common machine language for grid-based architectures (2002) (11)
Providing Web search capability for low-connectivity communities (2002) (10)
An efficient evolutionary algorithm for solving incrementally structured problems (2011) (10)
Simit (2016) (9)
(How) can programmers conquer the multicore menace? (2008) (9)
Manipulating lossless video in the compressed domain (2009) (8)
Programmable Microfluidics (2004) (8)
Compiling Graph Applications for GPU s with GraphIt (2021) (7)
Flexcache: a framework for compiler generated data caching (2001) (7)
A Python-based programming language for high-performance computational genomics (2021) (7)
Revec: program rejuvenation through revectorization (2019) (6)
Execution Model Enforcement Via Program Shepherding (2003) (6)
Tiramisu: A Code Optimization Framework for High Performance Systems (2018) (6)
Hot compilers for future hot chips (1995) (6)
Proceedings of the 18th ACM SIGPLAN symposium on Principles and practice of parallel programming (2013) (6)
BuildIt: A Type-Based Multi-stage Programming Framework for Code Generation in C++ (2021) (5)
Adapting Convergent Scheduling Using Machine-Learning (2003) (5)
Sparse Tensor Algebra Optimizations with Workspaces (2018) (5)
The Multiprocessor as a General-Purpose Processor: A Software Perspective (1996) (5)
Autoscheduling for sparse tensor algebra with an asymptotic cost model (2022) (5)
Tiled Multicore Processors (2009) (5)
Interprocedural Parallelization Analysis: Preliminary Results (1995) (4)
All you need is superword-level parallelism: systematic control-flow vectorization with SLP (2022) (4)
An Asymptotic Cost Model for Autoscheduling Sparse Tensor Programs (2021) (4)
How to build scalable on-chip ILP networks for a decentralized architecture (2000) (4)
Mapping Stream Programs into the Compressed Domain (2007) (4)
Unified Sparse Formats for Tensor Algebra Compilers (2018) (3)
Scalar Operand Networks: Design, Implementation, and Analysis (2004) (3)
A Unified Iteration Space Transformation Framework for Sparse and Dense Tensor Algebra (2019) (3)
Abstraction Layers for Scalable Microfluidic Biocomputers (Extended Version) (2006) (3)
Reliable Computation on Unreliable Hardware : Can We Have Our Digital Cake and Eat It ? (2014) (3)
Self-defending software: Automatically patching security vulnerabilities (2009) (3)
Dynamic Sparse Tensor Algebra Compilation (2021) (2)
Learning automatic schedulers with projective reparameterization (2019) (2)
Proceedings of the ACM SIGPLAN 2008 Conference on Programming Language Design and Implementation, Tucson, AZ, USA, June 7-13, 2008 (2008) (2)
High-Productivity Stream Programming For High-Performance Systems (2005) (2)
Hitting the Sweet Spot for Streaming Languages : Dynamic Expressivity with Static Optimization (2012) (2)
ZettaBricks: A Language Compiler and Runtime System for Anyscale Computing (2015) (2)
6.172 Performance Engineering of Software Systems, Fall 2009 (2009) (2)
Cimple: Instruction and Memory Level Parallelism (2018) (2)
PROGRAM SYNTHESIS FOR SCIENTIFIC COMPUTING (2021) (2)
Interprocedural analysis for parallelization: preliminary results (1995) (2)
The Three Pillars of Machine-Based Programming (2018) (2)
Sparse Tensor Transpositions (2020) (2)
Software orchestration of instruction level parallelism on tiled processor architectures (2005) (1)
Ubiquitous Memory Introspection (Preliminary Manuscript) (2006) (1)
Domain-Specific Language Abstractions for Compression (2021) (1)
Tiramisu : A Polyhedral Compiler with A Scheduling Language for Targeting High Performance Systems Riyadh (2018) (1)
Predicting Unroll Factors Using Nearest Neighbors (2004) (1)
A Python-based optimization framework for high-performance genomics (2020) (1)
High-bandwidth packet switching on the raw general-purpose architecture (2003) (1)
SALSA: A Domain Specific Architecture for Sequence Alignment (2020) (1)
MPEG-2 in a Stream Programming Language (2005) (1)
Execution model via program shepherding (2003) (1)
Outfoxing the Mammoths (2010) (1)
An Attempt to Generate Code for Symmetric Tensor Computations (2021) (1)
RawNet: Network Processing on the Raw Processor (2002) (1)
Ubiquitous Stream Programming to Facilitate the Migration to Multicore Architectures (1)
FlexCa he : A Framework for FlexibleCompiler Generated Data Ca hingCsaba (2000) (1)
Domain Specific Language for Exascale (2018) (0)
A Deep Learning Model for Loop Interchange (2023) (0)
Technical Report about Tiramisu: a Three-Layered Abstraction for Hiding Hardware Complexity from DSL Compilers (2018) (0)
Bit-Packing Optimization for StreamIt (2003) (0)
Weld : Rethinking the Interface Between Data-Intensive Libraries (2017) (0)
Achieving High Performance on Digital AlphaServerswith the SUIF (1999) (0)
Compiler 2.0: Using Machine Learning to Modernize Compiler Technology (2020) (0)
Automatic Generation of Sparse Tensor Kernels with Workspaces (2018) (0)
Exploiting Vector Parallelism in Software Pipelined LoopsTopology Control (2005) (0)
An Audio Wiki for Building Local Repositories of Knowledge in the Developing World (2016) (0)
GrAPL 2020 Keynote Speaker The GraphIt Universal Graph Framework: Achieving HighPerformance across Algorithms, Graph Types, and Architectures (2020) (0)
Compilers Without Borders (2016) (0)
Chapter 1: Interprocedural Parallelization Analysis: a Case Study (1995) (0)
Codon: A Compiler for High-Performance Pythonic Applications and DSLs (2023) (0)
The TEK Search Engine (2016) (0)
The sparse tensor algebra compiler (keynote) (2019) (0)
Selective Vectorization for Short-Vector Instructions (2009) (0)
Defying the Speed of Light : Wire-Exposed Architectures and Spatially-Aware Compilers (2003) (0)
Proceedings of the Second Workshop on Optimizing Stencil Computations (2014) (0)
Efficient Memory Shadowing for 64-bit Architectures Citation (2010) (0)
Scalar Queue Conversion: Dynamic Single Assignment for Concurrent Scheduling (2003) (0)
Proceedings of the 2008 ACM SIGPLAN conference on Programming language design and implementation (2008) (0)
PriorityGraph: A Unified Programming Model for Optimizing Ordered Graph Algorithms (2019) (0)
Interprocedural Analysis in SUIF (2007) (0)
Defying the speed of light:: a spatially-aware compiler for wire-exposed architectures (2002) (0)
6.035 Computer Language Engineering (SMA 5502), Fall 2002 (2002) (0)
StreamIt: A Language and Compiler for Communication-Exposed Architectures (2003) (0)
Compiler Support for Structured Data (2023) (0)
Stream Multicore Processors (2007) (0)
D2X: An eXtensible conteXtual Debugger for Modern DSLs (2023) (0)
Helium : lifting high-performance stencil kernels from stripped x 86 binaries to halide DSL code Citation (2015) (0)
WACO: Learning Workload-Aware Co-optimization of the Format and Schedule of a Sparse Tensor Program (2023) (0)
Cache Optimizations for Stream Programs by Janis Sermulips (2005) (0)
The Raw Fabric: A Technology for Rapid Embedded System Customization (2004) (0)
Autotuning multigrid with PetaBricks Citation (2012) (0)
Proceedings of the 29th ACM SIGPLAN Conference on Programming Language Design and Implementation (2008) (0)
Compilation Techniques for Graph Algorithms on GPUs (2020) (0)
AEDAM: Whole Program Adaptive Error Detection and Mitigation—Year 3 Report (2019) (0)
Computer Science and Artificial Intelligence Laboratory MPEG-2 in a Stream Programming Language (2005) (0)
Geneti Programming Applied to (2007) (0)
Autotuning algorithmic choice for input sensitivity Citation (2015) (0)
Compliation Techniques for Graphs Algorithms on GPUs (2020) (0)
Hot Pages : Software Caching for Raw Microprocessors Csaba (1999) (0)
PCA RAW FABRIC: Architectural Prototyping, Demonstration and Evaluation (2007) (0)
WOSC 2014: second workshop on optimizing stencil computations (2014) (0)

This paper list is powered by the following services:

What Schools Are Affiliated With Saman Amarasinghe?

Saman Amarasinghe is affiliated with the following schools:

Massachusetts Institute of Technology

Saman Amarasinghe's Academic­Influence.com Rankings

Saman Amarasinghe's Degrees

Why Is Saman Amarasinghe Influential?

Saman Amarasinghe's Published Works

Published Works

What Schools Are Affiliated With Saman Amarasinghe?

Saman Amarasinghe's AcademicInfluence.com Rankings