Malcolm Slaney
#26,601
Most Influential Person Now
American electrical engineer
Malcolm Slaney's AcademicInfluence.com Rankings
Malcolm Slaneyengineering Degrees
Engineering
#1522
World Rank
#2228
Historical Rank
Electrical Engineering
#432
World Rank
#481
Historical Rank
Applied Physics
#1910
World Rank
#1944
Historical Rank
Download Badge
Engineering
Malcolm Slaney's Degrees
- PhD Electrical Engineering Stanford University
- Masters Electrical Engineering Stanford University
- Bachelors Electrical Engineering Stanford University
Why Is Malcolm Slaney Influential?
(Suggest an Edit or Addition)According to Wikipedia, Malcolm Slaney is an American electrical engineer, whose research has focused on machine perception and multimedia analysis. He is a Fellow of the IEEE for "contributions to perceptual signal processing and tomographic imaging". He is a consulting professor at the Stanford University Center for Computer Research in Music and Acoustics and an affiliate faculty member in the Electrical Engineering Department at the University of Washington.
Malcolm Slaney's Published Works
Published Works
- Principles of computerized tomographic imaging (2001) (3772)
- CNN architectures for large-scale audio classification (2016) (1652)
- Construction and evaluation of a robust multifeature speech/music discriminator (1997) (1019)
- Video Rewrite: driving visual speech with audio (1997) (755)
- Content-Based Music Information Retrieval: Current Directions and Future Challenges (2008) (668)
- Limitations of Imaging with First-Order Diffraction Tomography (1984) (593)
- An Efficient Implementation of the Patterson-Holdsworth Auditory Filter Bank (1997) (546)
- Attentional Selection in a Cocktail Party Environment Can Be Decoded from Single-Trial EEG. (2015) (527)
- Locality-Sensitive Hashing for Finding Nearest Neighbors [Lecture Notes] (2008) (305)
- Collaborative Filtering and the Missing at Random Assumption (2007) (282)
- Discrimination of speech from nonspeech based on multiscale spectro-temporal Modulations (2006) (255)
- Locality-Sensitive Hashing for Finding Nearest Neighbors (2008) (236)
- MSR Identity Toolbox v1.0: A MATLAB Toolbox for Speaker Recognition Research (2013) (235)
- A perceptual pitch detector (1990) (173)
- FaceSync: A Linear Operator for Measuring Synchronization of Video Facial Images and Audio Tracks (2000) (165)
- On the importance of time—a temporal representation of sound (1993) (160)
- Resolving tag ambiguity (2008) (155)
- Acoustic Chord Transcription and Key Extraction From Audio Using Key-Dependent HMMs Trained on Synthesized Audio (2008) (148)
- Semantic-audio retrieval (2002) (123)
- Image retrieval on large-scale image databases (2007) (109)
- Analysis of Minimum Distances in High-Dimensional Musical Spaces (2008) (107)
- Auditory model inversion for sound separation (1994) (101)
- Decoding the auditory brain with canonical component analysis (2018) (101)
- Baby Ears: a recognition system for affective vocalizations (1998) (101)
- Automatic audio morphing (1996) (100)
- Learning a Metric for Music Similarity (2008) (95)
- Learning Sparse Feature Representations for Music Annotation and Retrieval (2012) (89)
- BabyEars: A recognition system for affective vocalizations (2003) (85)
- A Classification-Based Polyphonic Piano Transcription Approach Using Learned Feature Representations (2011) (84)
- Mixtures of probability experts for audio retrieval and indexing (2002) (71)
- The Importance of Sequences in Musical Similarity (2006) (71)
- Web-Scale Multimedia Analysis: Does Content Matter? (2011) (70)
- PLSA on Large Scale Image Databases (2007) (70)
- A Comparison of Regularization Methods in Forward and Backward Models for Auditory Attention Decoding (2018) (67)
- Tell Me a Story (2012) (63)
- Optimal Parameters for Locality-Sensitive Hashing (2012) (63)
- Speech discrimination based on multiscale spectro-temporal modulations (2004) (59)
- Song Intersection by Approximate Nearest Neighbor Search (2006) (59)
- A critique of pure audition (1998) (54)
- The Quest for Ecological Validity in Hearing Science: What It Is, Why It Matters, and How to Advance It (2020) (54)
- PERCEPTUAL DISTANCE IN TIMBRE SPACE (2005) (53)
- A Unified System for Chord Transcription and Key Extraction Using Hidden Markov Models (2007) (52)
- Recommender Systems, Missing Data and Statistical Model Estimation (2011) (51)
- The thirteen colors of timbre (2005) (51)
- Fast Recognition of Remixed Music Audio (2007) (51)
- MACH1: nonuniform time-scale modification of speech (1998) (50)
- Artificial neural network features for speaker diarization (2014) (49)
- 3. Algorithms for Reconstruction with Nondiffracting Sources (2001) (48)
- during monaural and dichotic listening Neural coding of continuous speech in auditory cortex (2012) (47)
- Automatic Chord Recognition from Audio Using a HMM with Supervised Learning (2006) (45)
- Reliable tags using image similarity: mining specificity and expertise from large-scale multimedia databases (2009) (45)
- Correlograms and the Separation of Sounds (1990) (44)
- Similarity Based on Rating Data (2007) (41)
- Diffraction Tomography (1983) (38)
- Solving Demodulation as an Optimization Problem (2010) (35)
- Measuring playlist diversity for recommendation systems (2006) (34)
- Automatic chord recognition from audio using a supervised HMM trained with audio-from-symbolic data (2006) (32)
- Computational Models of Auditory Function (2001) (31)
- Low-power audio classification for ubiquitous sensor networks (2004) (31)
- Imaging with Diffraction Tomography (1985) (29)
- Putting a Face to the Voice: Fusing Audio and Visual Signals Across a Video to Determine Speakers (2017) (29)
- Being Literate with Large Document Collections: Observational Studies and Cost Structure Tradeoffs (2006) (28)
- Eye Gaze for Spoken Language Understanding in Multi-modal Conversational Interactions (2014) (27)
- Highly Accurate Mandarin Tone Classification In The Absence of Pitch Information (2014) (27)
- Improving the noise-robustness of mel-frequency cepstral coefficients for speech processing (2006) (27)
- Video rewrite: visual speech synthesis from video (1997) (27)
- A Study of Multimodal Addressee Detection in Human-Human-Computer Interaction (2015) (25)
- A non-negative framework for joint modeling of spectral structure and temporal dynamics in sound mixtures (2010) (25)
- Continuous visual vocabulary modelsfor pLSA-based scene recognition (2008) (24)
- Towards better performance with heterogeneous training data in acoustic modeling using deep neural networks (2014) (24)
- Speaker-independent vowel recognition: spectrograms versus cochleagrams (1990) (22)
- Multimedia edges: finding hierarchy in all dimensions (2001) (22)
- 7. Algebraic Reconstruction Algorithms (2001) (22)
- Pattern playback from 1950 to 1995 (1995) (20)
- Predicting success from music sales data: a statistical and adaptive approach (2006) (19)
- Auditory Measures for the Next Billion Users. (2020) (18)
- Comparing Local Feature Descriptors in pLSA-Based Image Models (2008) (18)
- Rapid Ocular Responses Are Modulated by Bottom-up-Driven Auditory Salience (2019) (18)
- Introduction to the special section on the 20th anniversary of the ACM international conference on multimedia (2013) (17)
- Using gaze patterns to study and predict reading struggles due to distraction (2011) (16)
- Precision-Recall Is Wrong for Multimedia (2011) (16)
- Analysis of Tomography Images of Bonded Fibre Networks to Measure Distributions of Fibre Segment Length and Fibre Orientation * * (2006) (15)
- COMPUTATIONAL MODEL OF THE LATERALISATION OF CLICKS AND THEIR ECHOES (1998) (15)
- The Relation of Eye Gaze and Face Pose: Potential Impact on Speech Recognition (2014) (15)
- Modeling Multitasking Users (2003) (14)
- The History and Future of CASA (2005) (14)
- Computer Vision for Human–Machine Interaction: Probabilistic Models of Verbal and Body Gestures (1998) (13)
- Image classification using the web graph (2010) (13)
- Characteristic contours of syllabic-level units in laughter (2013) (13)
- Gaze-enhanced speech recognition (2014) (13)
- Data driven suppression rule for speech enhancement (2013) (12)
- Connecting Deep Neural Networks to Physical, Perceptual, and Electrophysiological Auditory Signals (2018) (12)
- Unsupervised image ranking (2009) (12)
- Periodicity detection and localization using spike timing from the AER EAR (2009) (12)
- Hierarchical segmentation using latent semantic indexing in scale space (2001) (12)
- A timbre space for speech (2005) (11)
- A statistical model of timbre perception (2006) (11)
- Hierarchical Segmentation: Finding Changes in a Text Signal (2001) (11)
- A Comparison of Temporal Response Function Estimation Methods for Auditory Attention Decoding (2018) (10)
- Auditory stimulus-response modeling with a match-mismatch task (2020) (10)
- Multimodal addressee detection in multiparty dialogue systems (2015) (10)
- Simulation of One ’ s Own Voice in a Two-parameter Model (2014) (9)
- Connecting correlograms to neurophysiology and psychoacoustics (1997) (9)
- Pattern Playback in the 90s (1994) (9)
- Imaging with Higher Order Diffraction Tomography (1985) (8)
- A model of attention-driven scene analysis (2012) (8)
- Interactive signal processing documents (1990) (8)
- Content-BasedMusic Information Retrieval : Current Directions and Future Challenges (2008) (7)
- 6. Tomographic Imaging with Diffracting Sources (2001) (7)
- Measuring Information Understanding in Large Document Collections (2005) (7)
- Song Intersection by Approximate Nearest Neighbour Retrieval (2006) (6)
- The information content of demodulated speech (2010) (6)
- FastMPEG: time-scale modification of bit-compressed audio information (2001) (6)
- MACH 1 FOR NONUNIFORM TIME-SCALE MODIFICATION OF SPEECH : THEORY , TECHNIQUE , AND COMPARISONS (1998) (6)
- Pitch-gesture modeling using subband autocorrelation change detection (2013) (5)
- Deep Canonical Correlation Analysis For Decoding The Auditory Brain (2020) (5)
- A bipartite graph model for associating images and text (2006) (5)
- Using audio-visual information to understand speaker activity: Tracking active speakers on and off screen (2018) (5)
- Disentangling speech from surroundings in a neural audio codec (2022) (4)
- Probabilistic features for connecting eye gaze to spoken language understanding (2015) (4)
- Introduction to the Special Issue on Music Information Retrieval (2008) (4)
- QBT-Extended: An Annotated Dataset of Melodically Contoured Tapped Queries (2013) (4)
- A Cost Structure Analysis of Manual and Computer-supported Sensemaking Behavior (2005) (4)
- Determining the Euclidean Distance Between Two Steady State Sounds (2006) (4)
- Microwave Imaging with First Order Diffraction Tomography (2003) (4)
- 5. Aliasing Artifacts and Noise in CT Images (2001) (4)
- Towards mobile gaze-directed beamforming: a novel neuro-technology for hearing loss (2018) (3)
- Academia Meets Industry at the Multimedia Grand Challenge (2011) (3)
- VHP: Vibrotactile Haptics Platform for On-body Applications (2021) (3)
- Audio and Acoustic Signal Processing [In the Spotlight] (2011) (3)
- Visual representations of speech—A computer model based on correlation (1990) (3)
- Processing web-scale multimedia data (2010) (3)
- Decoding the auditory brain with canonical component analysis (2017) (3)
- Reconciliation of human and machine speech recognition performance (2009) (3)
- Measuring the Tools and Behaviors of Sensemaking (2004) (3)
- 4. Measurement of Projection Data—The Nondiffracting Case (2001) (2)
- Varying Time Constants and Gain Adaptation in Feature Extraction for Speech Processing (2007) (2)
- Decoding Auditory Attention (in Real Time) with EEG (2013) (2)
- Web-Scale Multimedia Processing and Applications [Scanning the Issue] (2012) (2)
- Temporal events in all dimensions and scales (2001) (2)
- Editorial: Special Section on Statistical and Perceptual Audio Processing (2006) (2)
- Neural Architecture Search for Energy Efficient Always-on Audio Models (2022) (1)
- Pay Attention, Please: Attention at the Telluride Neuromorphic Cognition Workshop (2012) (1)
- Rapid ocular responses are a robust marker for bottom-up driven auditory salience (2018) (1)
- : A RECOGNITION SYS'TEM FOR AFFECTIV (1998) (1)
- S6b.3 A Perceptual Pitch Detector (1990) (1)
- Understanding the Semantics of Media (2003) (1)
- Telluride Decoding Toolbox (2015) (1)
- Neural architecture search for energy-efficient always-on audio machine learning (2023) (1)
- Decoding Speech Sound Source Direction from Electroencephalography Data (2016) (1)
- Locality-Sensitive Hashing: Finding a Needle in a Haystack (2008) (1)
- 8. Reflection Tomography (2001) (1)
- Don't Click Here (2012) (1)
- The influence of pitch and noise on the discriminability of filterbank features (2014) (1)
- Eye gaze for understanding conversational speech (2014) (1)
- On the Estimation of Porosity in Composites by Oblique Angle Illumination (1983) (1)
- Auditory Attention : From Saliency to Models ( and Applications ) (2017) (0)
- IMAGING WITH DIFFRACTION TOMOGRAPHY (ACOUSTICS, MICROWAVE, OPTICS) (1985) (0)
- Analytic Worksheets: A Framework to Support Human Analysis of Large Streaming Data Volumes (2005) (0)
- Auditory Representation and Sound Separation (1991) (0)
- Scalable Audio-Content Analysis (2010) (0)
- Audio analysis for consumer and other industrial applications (2012) (0)
- BabyEars : A recognition system for affective vocalizations q (2002) (0)
- The story of AudioSapiana (2007) (0)
- Identifying authoritative sources of multimedia content: mining specificity and expertise from large-scale multimedia databases (2011) (0)
- Chapter 13 The History and Future of CASA (2004) (0)
- Multi-Channel Speech Denoising for Machine Ears (2022) (0)
- Automatic Audio Morp'hing (2004) (0)
- Optimal Parameters for Locality-Sensitive Hashing An algorithm is described that optimizes parameters for nearest-neighbor retrieval in web-scale search, at minimum computational cost. (2012) (0)
- Audiovisual Speech Processing: Image-based facial synthesis (2012) (0)
- Multimodal retrieval and ranking: more than waveforms (2010) (0)
- Plsa on Large Scale Image Databases Plsa on Large Scale Image Databases (2006) (0)
- Disentangling Speech from Surroundings with Neural Embeddings (2023) (0)
- 15 Probabilistic ~ 1 odels of Verbal and Body Gestures (2009) (0)
- 2. Signal Processing Fundamentals (2001) (0)
- Modification of Audible and Visual Speech (1999) (0)
- Identifying Authoritative Sources of Multimedia Content (2011) (0)
- Social network visualizations of streaming data: Design and use considerations (2005) (0)
- Acoustic differentiation of affective vocalizations to infants by mothers and fathers (1998) (0)
- VARYINGTIMECONSTANTSAND GAINADAPTATIONINFEATUREEXTRACTIONFOR SPEECHPROCESSING (2007) (0)
- Locality Sensitive Hashing for Large Music Databases (2008) (0)
- Editorial: Bio-inspired Audio Processing, Models and Systems (2019) (0)
- PLSAON LARGE SCALEIMAGEDATABASES (2007) (0)
- EYE GAZE FOR SPEECH RECOGNITION AND UNDERSTANDING (2014) (0)
- BACKPROJECTION WEIGHT DURING THE CT IMAGE RECONSTRUCTION (2017) (0)
This paper list is powered by the following services:
Other Resources About Malcolm Slaney
What Schools Are Affiliated With Malcolm Slaney?
Malcolm Slaney is affiliated with the following schools: