Daniel Patrick Whittlesey Ellis

Daniel Patrick Whittlesey Ellis's AcademicInfluence.com Rankings

Engineering

#3474

World Rank

#4561

Historical Rank

Electrical Engineering

#787

World Rank

#855

Historical Rank

engineering Degrees

Daniel Patrick Whittlesey Ellis

Computer Science

#4491

World Rank

#4737

Historical Rank

Computational Linguistics

#445

World Rank

#452

Historical Rank

Machine Learning

#897

World Rank

#909

Historical Rank

Database

#1704

World Rank

#1786

Historical Rank

computer-science Degrees

Download Badge

Engineering
Computer Science

Daniel Patrick Whittlesey Ellis's Degrees

PhD Electrical Engineering Stanford University
Masters Electrical Engineering Stanford University
Bachelors Electrical Engineering Stanford University

Why Is Daniel Patrick Whittlesey Ellis Influential?

(Suggest an Edit or Addition)

(See a Problem?)

Daniel Patrick Whittlesey Ellis's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

Audio Set: An ontology and human-labeled dataset for audio events (2017) (1812)
CNN architectures for large-scale audio classification (2016) (1652)
librosa: Audio and Music Signal Analysis in Python (2015) (1538)
The Million Song Dataset (2011) (1226)
Tandem connectionist feature extraction for conventional HMM systems (2000) (819)
The ICSI Meeting Corpus (2003) (768)
Prediction-driven computational auditory scene analysis (1996) (429)
Speech and Audio Signal Processing - Processing and Perception of Speech and Music, Second Edition (1999) (401)
Identifying `Cover Songs' with Chroma Features and Dynamic Programming Beat Tracking (2007) (396)
A Large-Scale Evaluation of Acoustic and Subjective Music-Similarity Measures (2004) (388)
MIR_EVAL: A Transparent Implementation of Common MIR Metrics (2014) (387)
Song-Level Features and Support Vector Machines for Music Classification (2005) (341)
Model-Based Expectation-Maximization Source Separation and Localization (2010) (311)
Consumer video understanding: a benchmark database and an evaluation of human and machine performance (2011) (290)
A Discriminative Model for Polyphonic Piano Transcription (2007) (270)
Feed-Forward Networks with Attention Can Solve Some Long-Term Memory Problems (2015) (258)
Beat Tracking with Dynamic Programming (2006) (258)
Signal Processing for Music Analysis (2011) (249)
Chord segmentation and recognition using EM-trained hidden markov models (2003) (228)
Melody Transcription From Music Audio: Approaches and Evaluation (2007) (224)
Melody Extraction from Polyphonic Music Signals: Approaches, applications, and challenges (2014) (204)
The auditory organization of speech and other sources in listeners and computational models (2001) (203)
The Quest for Ground Truth in Musical Artist Similarity (2002) (196)
Computational Analysis of Sound Scenes and Events (2017) (193)
Decoding speech in the presence of other sources (2005) (189)
A Web-Based Game for Collecting Music Metadata (2008) (187)
Locating singing voice segments within music signals (2001) (183)
Speech and Audio Signal Processing (2011) (176)
Beat Tracking by Dynamic Programming (2007) (172)
INSIGHTS INTO SPOKEN LANGUAGE GLEANED FROM PHONETIC TRANSCRIPTION OF THE SWITCHBOARD CORPUS (1996) (168)
The Meeting Project at ICSI (2001) (164)
Ground-truth transcriptions of real music from force-aligned MIDI syntheses (2003) (159)
Support vector machine active learning for music retrieval (2006) (151)
Laughter Detection in Meetings (2004) (147)
Kodak's consumer video benchmark data set: concept definition and annotation (2007) (139)
Spectral vs. spectro-temporal features for acoustic event detection (2011) (139)
Large-scale multimodal semantic concept detection for consumer video (2007) (135)
Speech/music discrimination based on posterior probability features (1999) (133)
USING VOICE SEGMENTS TO IMPROVE ARTIST CLASSIFICATION OF MUSIC (2002) (130)
Classifying Music Audio with Timbral and Chroma Features (2007) (130)
Minimal-impact audio-based personal archives (2004) (124)
Frequency-domain linear prediction for temporal features (2003) (117)
Multispeaker speech activity detection for the ICSI meeting recorder (2001) (116)
Multiple-Instance Learning for Music Information Retrieval (2008) (116)
Unsupervised Learning of Semantic Audio Representations (2017) (116)
General-purpose Tagging of Freesound Audio with AudioSet Labels: Task Description, Dataset, and Baseline (2018) (115)
Active Learning for Interactive Multimedia Retrieval (2008) (111)
Columbia-UCF TRECVID2010 Multimedia Event Detection: Combining Multiple Modalities, Contextual Concepts, and Temporal Matching (2010) (111)
Pushing the envelope - aside [speech recognition] (2005) (107)
Audio-Based Semantic Concept Classification for Consumer Video (2010) (103)
Large-Scale Cover Song Recognition Using the 2D Fourier Transform Magnitude (2012) (100)
An EM Algorithm for Localizing Multiple Sound Sources in Reverberant Environments (2006) (97)
Tandem acoustic modeling in large-vocabulary recognition (2001) (96)
Feature extraction using non-linear transformation for robust speech recognition on the Aurora database (2000) (95)
Model-Based Monaural Source Separation Using a Vector-Quantized Phase-Vocoder Representation (2006) (91)
Autoregressive Modeling of Temporal Envelopes (2007) (90)
Automatic Record Reviews (2004) (88)
Large-scale cover song recognition using hashed chroma landmarks (2011) (88)
Anchor space for classification and similarity measurement of music (2003) (84)
Speech separation using speaker-adapted eigenvoice speech models (2010) (81)
Using knowledge to organize sound: The prediction-driven approach to computational auditory scene analysis and its application to speech/nonspeech mixtures (1999) (81)
Chord Recognition and Segmentation Using EM-trained Hidden Markov Models (2003) (80)
Noise Robust Pitch Tracking by Subband Autocorrelation Classification (2012) (79)
The million song dataset challenge (2012) (78)
Learning Sound Event Classifiers from Web Audio with Noisy Labels (2019) (78)
Evaluation of Distance Measures Between Gaussian Mixture Models of MFCCs (2007) (76)
Analyzing Song Structure with Spectral Clustering (2014) (75)
Transcribing Multi-Instrument Polyphonic Music With Hierarchical Eigeninstruments (2011) (74)
Audio tagging with noisy labels and minimal supervision (2019) (72)
Size matters: an empirical study of neural network training for large vocabulary continuous speech recognition (1999) (72)
Multi-stream speech recognition: ready for prime time? (1999) (71)
Connectionist speech recognition of Broadcast News (2002) (70)
Multiband audio modeling for single-channel acoustic source separation (2004) (70)
Short-term audio-visual atoms for generic video concept classification (2009) (68)
Model-Based Scene Analysis (2005) (66)
Cover song detection: From high scores to general classification (2010) (64)
Exploring Low Cost Laser Sensors to Identify Flying Insect Species (2015) (63)
Classification-based melody transcription (2006) (63)
Content-Aware Collaborative Music Recommendation Using Pre-trained Neural Networks (2015) (62)
A Classification Approach to Melody Transcription (2005) (62)
Multimodal Segmentation of Lifelog Data (2007) (61)
LP-TRAP: linear predictive temporal patterns (2004) (60)
Pitch-based emphasis detection for characterization of meeting recordings (2003) (58)
Using mutual information to design feature combinations (2000) (56)
Fingerprinting to Identify Repeated Sound Events in Long-Duration Personal Audio Recordings (2007) (56)
Using Broad Phonetic Group Experts for Improved Speech Recognition (2007) (56)
Toward Evaluation Techniques for Music Similarity (2003) (55)
Sound texture modelling with linear prediction in both time and frequency domains (2003) (52)
Real-time CSound: Software Synthesis with Sensing and Control (1990) (51)
Detecting Alarm Sounds (2001) (51)
Multi-channel source separation by factorial HMMs (2003) (51)
A simple correlation-based model of intelligibility for nonlinear speech enhancement and separation (2009) (50)
All for one: feature combination for highly channel-degraded speech activity detection (2013) (50)
Improving Universal Sound Separation Using Sound Classification (2019) (50)
Speech enhancement by sparse, low-rank, and dictionary spectrogram decomposition (2013) (49)
A computer implementation of psychoacoustic grouping rules (1993) (49)
Dialect and Accent Recognition Using Phonetic-Segmentation Supervectors (2011) (49)
Estimating single-channel source separation masks: relevance vector machine classifiers vs. pitch-based masking (2006) (48)
CONNECTIONIST FEATURE EXTRACTION FOR CONVENTIONAL HMM SYSTEMS (1999) (47)
Inharmonic speech reveals the role of harmonicity in the cocktail party problem (2018) (47)
Classifying soundtracks with audio texture features (2011) (47)
Audio fingerprinting to identify multiple videos of an event (2010) (47)
Detecting sound events in basketball video archive (2001) (46)
Mid-level representations for Computational Auditory Scene Analysis (1995) (46)
What’s all the Fuss about Free Universal Sound Separation Data? (2020) (45)
AUDIO MUSIC MOOD CLASSIFICATION USING SUPPORT VECTOR MACHINE (2007) (45)
Applying Machine Learning and Audio Analysis Techniques to Insect Recognition in Intelligent Traps (2013) (45)
Audio information access from meeting rooms (2003) (45)
Speech separation in humans and machines (2005) (44)
Learning to segment songs with ordinal linear discriminant analysis (2014) (42)
Quantitative Analysis of a Common Audio Similarity Measure (2009) (40)
Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of On-Screen Sounds (2020) (40)
Investigations into tandem acoustic modeling for the Aurora task (2001) (39)
EM Localization and Separation using Interaural Level and Phase Cues (2007) (38)
Cross-correlation of beat-synchronous representations for music similarity (2008) (38)
Accessing Minimal-Impact Personal Audio Archives (2006) (37)
The 2007 LabROSA Cover Song Detection System (2007) (36)
Speaker turn segmentation based on between-channel differences (2004) (36)
IBM Research and Columbia University TRECVID-2011 Multimedia Event Detection (MED) System (2011) (36)
Evaluating Source Separation Algorithms With Reverberant Speech (2010) (35)
A Probabilistic Subspace Model for Multi-instrument Polyphonic Transcription (2010) (35)
SONG-LEVEL FEATURES AND SVMS FOR MUSIC CLASSIFICATION (2005) (35)
PREDICTION-DRIVEN COMPUTATIONAL AUDITORY SCENE ANALYSIS FOR DENSE SOUND MIXTURES (1996) (35)
Structured Prediction Models for Chord Transcription of Music Audio (2009) (35)
Eigenrhythms: Drum pattern basis sets for classification and generation (2004) (35)
Features for segmenting and classifying long-duration recordings of "personal" audio (2004) (34)
Identifying "Cover Songs" with Beat-Synchronous Chroma Features (2006) (34)
Call detection and extraction using Bayesian inference (2006) (34)
Optimizing DTW-based audio-to-MIDI alignment and matching (2016) (33)
An Overview of the SPRACH System for the Transcription of Broadcast News (1999) (33)
Large-Scale Content-Based Matching of MIDI and Audio Files (2015) (33)
Extracting information from music audio (2006) (33)
Decoding speech in the presence of other sound sources (2000) (33)
Selection, parameter estimation, and discriminative training of hidden Markov models for general audio modeling (2003) (32)
AVA-Speech: A Densely Labeled Dataset of Speech Activity in Movies (2018) (31)
Learning the meaning of music (2005) (31)
STREAM COMBINATION BEFORE AND/OR AFTER THE ACOUSTIC MODEL (1999) (30)
A tempo-insensitive distance measure for cover song identification based on chroma features (2008) (30)
Leveraging repetition for improved automatic lyric transcription in popular music (2014) (29)
The Benefit of Temporally-Strong Labels in Audio Event Classification (2021) (29)
The weft: a representation for periodic sounds (1997) (29)
Speech feature smoothing for robust ASR (2005) (29)
Detecting local semantic concepts in environmental sounds using Markov model based clustering (2010) (28)
Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019) (2018) (28)
Large-scale audio event discovery in one million YouTube videos (2017) (27)
Eavesdropping on the Arctic: Automated bioacoustics reveal dynamics in songbird breeding phenology (2018) (27)
Beta Process Sparse Nonnegative Matrix Factorization for Music (2013) (26)
Automatically Extracting Performance Data from Recordings of Trained Singers (2011) (26)
Extracting Ground-Truth Information from MIDI Files: A MIDIfesto (2016) (26)
Using acoustic condition clustering to improve acoustic change detection on broadcast news (2000) (26)
LABROSA'S AUDIO MUSIC SIMILARITY AND CLASSIFICATION SUBMISSIONS (2007) (25)
Multi-voice polyphonic music transcription using eigeninstruments (2009) (25)
Monaural Speech Separation using Source-Adapted Models (2007) (24)
Voice activity detection in personal audio recordings using autocorrelogram compensation (2006) (24)
Clustering Beat-Chroma Patterns in a Large Music Database (2010) (24)
A Quantitative Comparison of Different Approaches for Melody Extraction from Polyphonic Audio Recordings (2006) (23)
Echoprint: An Open Music Identification Service (2011) (22)
Combining localization cues and source model constraints for binaural source separation (2011) (22)
Coincidence, Categorization, and Consolidation: Learning to Recognize Sounds with Minimal Supervision (2019) (22)
IBM Research and Columbia University TRECVID-2012 Multimedia Event Detection (MED), Multimedia Event Recounting (MER), and Semantic Indexing (SIN) Systems (2012) (22)
Datasets and Evaluation (2018) (21)
A probability model for interaural phase difference (2006) (21)
Data-driven voice source waveform analysis and synthesis (2012) (21)
Soundtrack classification by transient events (2011) (20)
Improving MIDI-audio alignment with acoustic features (2009) (20)
Evaluating Speech Separation Systems (2005) (20)
Codebook-based Scalable Music Tagging with Poisson Matrix Factorization (2014) (19)
Source separation based on binaural cues and source model constraints (2008) (19)
Towards single-channel unsupervised source separation of speech mixtures: the layered harmonics/formants separation-tracking model (2004) (19)
PLP2: Autoregressive modeling of auditory-like 2-D spectro-temporal patterns (2004) (19)
The Echo Nest Musical Fingerprint (2010) (18)
Combined speech and speaker recognition with speaker-adapted connectionist models (1999) (18)
Deformable Spectrograms (2005) (18)
Improving Generalization for Classification-Based Polyphonic Piano Transcription (2007) (17)
Computational Auditory Scene Analysis: Principles, Practice and Applications (1999) (17)
Addressing Missing Labels in Large-Scale Sound Event Recognition Using a Teacher-Student Framework With Loss Masking (2020) (17)
Multi-channel source separation by beamforming trained with factorial HMMs (2003) (16)
An Introduction to Signal Processing for Speech (2010) (16)
Stylization of pitch with syllable-based linear segments (2008) (16)
Micbots: Collecting large realistic datasets for speech and audio research using mobile robots (2015) (15)
A Perceptual Representation of Sound for Auditory Signal Separation (1992) (15)
Introduction to sound scene and event analysis (2018) (15)
Using mutual information to design class-specific phone recognizers (2003) (15)
Better beat tracking through robust onset aggregation (2014) (14)
Making a scene: alignment of complete sets of clips based on pairwise audio match (2012) (14)
Detecting music in ambient audio by long-window autocorrelation (2008) (14)
Improved recognition by combining different features and different systems (2000) (14)
A Wavelet Based Sinusoid Model of Sound for Auditory Signal Separation (1991) (14)
Hierarchic models of hearing for sound separation and reconstruction (1993) (13)
Pruning subsequence search with attention-based embedding (2016) (13)
Finding similar acoustic events using matching pursuit and locality-sensitive hashing (2009) (13)
The 2010 LabROSA Chord Recognition System (2010) (12)
Error visualization for tandem acoustic modeling on the Aurora task (2002) (12)
Audio-visual atoms for generic video concept classification (2010) (12)
A Video Compression-Based Approach to Measure Music Structural Similarity (2013) (12)
Clap detection and discrimination for rhythm therapy (2005) (11)
Handling Asynchrony in Audio-Score Alignment (2009) (11)
Inharmonic speech: a tool for the study of speech perception and separation (2012) (11)
Estimating the Number of Marine Mammals Using Recordings of Clicks from One Microphone (2006) (11)
A variational EM algorithm for learning eigenvoice parameters in mixed signals (2009) (11)
Large-Scale Weakly-Supervised Content Embeddings for Music Recommendation and Tagging (2020) (10)
Towards Learning Semantic Audio Representations from Unlabeled Data (2017) (10)
Reducing errors by increasing the error rate: MLP Acoustic Modeling for Broadcast News Transcription (1999) (10)
Subband autocorrelation features for video soundtrack classification (2013) (9)
A Chroma-based Tempo-insensitive Distance Measure for Cover Song Identification (2007) (9)
A perceptual representation of sound for source separation. (1992) (9)
IBM Research and Columbia University TRECVID-2013 Multimedia Event Detection (MED), Multimedia Event Recounting (MER), Surveillance Event Detection (SED), and Semantic Indexing (SIN) Systems (2013) (9)
Solo Voice Detection Via Optimal Cancellation (2007) (9)
Voice source waveform analysis and synthesis using principal component analysis and Gaussian mixture modelling (2009) (8)
MuLan: A Joint Embedding of Music Audio and Natural Language (2022) (8)
Music-Content-Adaptive Robust Principal Component Analysis for a Semantically Consistent Separation of Foreground and Background in Music Audio Signals (2014) (8)
Preliminary intelligibility tests of a monaural speech segregation system (2008) (7)
Hidden Markov Model Based Speech Activity Detection for the ICSI Meeting Project (2001) (7)
THE HYDRA SYSTEM OF UNSTRUCTURED COVER SONG DETECTION (2009) (7)
Data-driven articulatory inversion incorporating articulator priors (2008) (7)
Pushing the Envelope – Aside : Beyond the Spectral Envelope as the Fundamental Representation for Speech Recognition (2008) (7)
Signal Processing Magazine E-Newsletter: Inside Out (2007) (7)
IMPROVING GENERALIZATION FOR POLYPHONIC PIANO TRANSCRIPTION (2007) (7)
Self-Supervised Learning from Automatically Separated Sound Scenes (2021) (7)
Modeling nonlinear circuits with linearized dynamical models via kernel regression (2013) (7)
Learning auditory models of machine voices (2005) (6)
Modelling Sound Dynamics Using Deformable Spectrograms: Segmenting the Spectrogram into Smooth Regions (2006) (6)
Introduction to the Special Issue on Music Signal Processing (2011) (6)
4 Model-Based Scene Analysis (2006) (6)
Phone Recognition for Mixed Speech Signals : Comparison of Human Auditory Cortex and Machine Performance (2015) (5)
Combining bottom-up and top-down constraints to achieve robust ASR: The multisource decoder (2001) (5)
A perceptual representation of audio for co-channel source separation (1991) (5)
Midlevel representations for computational auditory scene analysis: the Weft element (1998) (5)
Timescale Modification and Wavelet Representations (1992) (5)
Speech enhancement by low-rank and convolutive dictionary spectrogram decomposition (2014) (5)
Accessing Minimal-Impact Personal Audio (2006) (4)
Detecting proximity from personal audio recordings (2014) (4)
Barefoot multimedia, or, All is not what it seems, Moriarty (1994) (4)
Introduction to the Special Issue on Music Information Retrieval (2008) (4)
Recognition and Organization of Speech and Audio (2001) (4)
Speech decoloration based on the product-of-filters model (2014) (4)
Detailed graphical models for source separation and missing data interpolation in audio (2004) (4)
Simultaneous Speech and Speaker Recognition Using Hybrid Architecture (1999) (3)
Guided harmonic sinusoid estimation in a multi-pitch environment (2009) (3)
The Ideal Interaural Parameter Mask: A bound on binaural separation systems (2009) (3)
THE 2009 LABROSA PRETRAINED AUDIO CHORD RECOGNITION SYSTEM (2009) (3)
Evaluating music sequence models through missing data (2011) (3)
Temporal integration as a consequence of multi-source decoding (2002) (3)
Automatic analysis of heart sounds using speech recognition techniques (2005) (3)
Proceedings of the 2012 ACM international workshop on Audio and multimedia methods for large-scale video analysis (2012) (3)
Pattern Recognition Applied to Music Signals (2003) (3)
SELECTION , PARAMETER ESTIMATION AND DISCRIMINATIVE TRAINING OF HIDDEN MARKOV MODELS FOR GENERIC ACOUSTIC MODELING (3)
A History and Overview of Machine Listening (2010) (3)
Audio signal recognition for speech, music, and environmental sounds (2003) (3)
Cover Song ID with Beat-Synchronous Chroma Features (2006) (3)
Underconstrained stochastic representations for top-down computational auditory scene analysis (1995) (3)
The Cepstrum as a Spectral Analyzer (2011) (2)
PLP-squared: autoregressive modeling of auditory-like 2-d spectro-temporal patterns (2004) (2)
Brief History of Automatic Speech Recognition (2011) (2)
Introduction to the special issue on the recognition and organization of real-world sound (2004) (2)
A simulation of vowel segregation based on across-channel glottal-pulse synchrony (1994) (2)
Reproducing Pitch Experiments in “ Measuring the Evolution of Contemporary Western Popular Music ” (2013) (2)
The Auditory System as a Filter Bank (2011) (2)
Editorial: Special Section on Statistical and Perceptual Audio Processing (2006) (2)
Automatic Speech Recognition (2011) (2)
SPEECH RECOGNITION AS A COMPONENT IN COMPUTATIONAL AUDITORY SCENE ANALYSIS (2)
Modeling the auditory organization of speech - a summary and some comments (1998) (2)
Blind MVA Speech Feature Processing on Aurora 2.0 (2004) (2)
AMVA'12: ACM international workshop on audio and multimedia methods for large-scale video analysis (2012) (1)
The Multimedia Lexicon : Automatic object and structure discovery in audio-video-text content (2000) (1)
Computational Auditory Scene Analysis (2005) (1)
Human Speech Recognition (2011) (1)
Musical Instrument Acoustics (2011) (1)
Chapter 1 EVALUATING SPEECH SEPARATION SYSTEMS (2004) (1)
Scene Analysis for Speech and Audio Recognition (2003) (1)
Semantic Audio Analysis (2003) (1)
Content-adaptive speech enhancement by a sparsely-activated dictionary plus low rank decomposition (2014) (1)
Proceedings of the 1999 DARPA Broadcast News Workshop (1999) (1)
Direct processing of mpeg audio using companding and BFP techniques (2011) (1)
An overview of digital audio (1997) (1)
A comparison of pitch extraction methodologies for dolphin vocalization (2008) (1)
Feature Extraction for ASR (2011) (1)
Speech Analysis and Synthesis Overview (2011) (1)
THE AUDITORY ORGANIZATION OF SPEECH IN LISTENERS AND MACHINES (1998) (1)
Analysis of Everyday Sounds (2007) (1)
Estimating timing and channel distortion across related signals (2014) (1)
Description and analysis of novelties introduced in DCASE Task 4 2022 on the baseline system (2022) (1)
Linear prediction of temporal envelopes for speech and audio applications (2007) (1)
Using Learned Source Models to Organize Sound Mixtures (2006) (0)
Digital Filters and Discrete Fourier Transform (2011) (0)
Machine Listening : Sound organization for multimedia understanding (2001) (0)
Modeling Meeting Turns (2003) (0)
Some Aspects of Computer Music Synthesis (2011) (0)
Current work at ICSI (1999) (0)
Computers, Robotics, and the Human Brain (2008) (0)
On the Other Side (2000) (0)
Active Learning for Interactive Multimedia Retrieval Algorithms that employ feedback from users to guide the search process can provide relatively rapid and efficient results from large multimedia data collections. (2008) (0)
What Can We Learn from Large Music Databases (2004) (0)
8. Pattern Classification (2011) (0)
DRAFT On Machine Perception of Sound Ph (2016) (0)
Handling Speech in the Wild (2012) (0)
Pushing Up Hypotheses Using Context-Dependent Links (2007) (0)
Sound texture modelling with linear prediction in both time and frequency domains (2003) (0)
Sound Analysis Research at LabROSA (2005) (0)
Tandem acoustic modeling: Neural nets for mainstream ASR? (2000) (0)
Building a Binaural Source Separator (2006) (0)
Lecture 3: Perception (2013) (0)
Free Universal Sound Separation Dataset (2020) (0)
Medium‐Rate and High‐Rate Vocoders (2011) (0)
Music Audio Research at LabROSA (2010) (0)
Audio Information Extraction (2002) (0)
Modeling the Auditory Component of Speech (2012) (0)
Columbia: Recent + Future (2004) (0)
EXTRACTING FROM MUSIC AUDIO Information includes individual notes, tempo, beat, and other musical properties, along with listener preferences based on how the listener experiences music. (2006) (0)
Sound content analysis for indexing and understanding (2000) (0)
Automatic Prosody Labeling Final Project Report for EE 6820-Spring 05 Professor : (2005) (0)
Auditory Scene Analysis: phenomena, theories and computational models (1998) (0)
Jan 2000 European Trip report: THISL and RESPITE (2000) (0)
Speech Recognition technology from the ICSI Realization Group (1998) (0)
Mapping Meetings: Columbia's Plans (2001) (0)
The 2008 LabROSA Supervised Chord Recognition System (2008) (0)
Speech‐Recognition Overview (2011) (0)
Fourth International Workshop on Computation Advances in Multisensor Adaptive Processing (CAMSAP) (2011) (0)
Session details: Special oral session 3: multi-modal music information retrieval (2010) (0)
Sequential Organization from an Ecological Perspective (2009) (0)
An overview of Speech Recognition research at ICSI (1999) (0)
6. Digital Signal Processing (2011) (0)
Extracting and Using Music Audio Information (2007) (0)
Separating Speech from Speech Noise Annual Report 2006 (2007) (0)
EXTRACTING GROUND TRUTH INFORMATION FROM MIDI FILES: (2016) (0)
General Soundtrack Analysis (2002) (0)
Recognizing and Classifying Environmental Sounds (2012) (0)
Perceptually-Inspired Music Audio Analysis (2012) (0)
RESPITE: Tandem and multistream research (2000) (0)
Learning, Using, and Adapting Models in Scene Analysis (2009) (0)
Tandem modeling investigations (2001) (0)
Computer-implemented methods and systems for modeling and recognition of speech (2010) (0)
Airplane Noise Detection based on Hidden Markov Model Classification (2002) (0)
Joint Audio-Visual Signatures for Web Video Analysis (2010) (0)
ITR : Listen and Learn – Artificial Intelligence in Auditory Environments (2003) (0)
Deterministic Sequence Recognition for ASR (2011) (0)
Statistical Sequence Recognition (2011) (0)
Data-Driven Music Audio Understanding IIS-0713334 Annual Report 2008 (2008) (0)
Searching and Describing Audio Databases (2005) (0)
Chapter 1 Introduction to Sound Scene and Event Analysis (2019) (0)
Some aspects of the ICSI 1998 Broadcast News effort (1998) (0)
Statistical Model Training (2011) (0)
Nonlinear mapping for feature extraction in automatic speech recognition (2009) (0)
Learning and Scene Analysis (2004) (0)
39. Source Separation (2011) (0)
ICSI /ThisL status report (1997) (0)
MUSIC CLASSIFICATION USING SUPPORT VECTOR MACHINES WITH COVARIANCE AND MODULATION FEATURES (2007) (0)
Linguistic Categories for Speech Recognition (2011) (0)
Dataset Balancing Can Hurt Model Performance (2023) (0)
13. Room Acoustics (2011) (0)
17. Speech Perception (2011) (0)
The Listening Machine : 1 st Annual Report (2004) (0)
LabROSA's audio music similarity and classification systems (2007) (0)
Using Speech Models for Separation (2009) (0)
Music Signal Analysis (2011) (0)
41. Speaker Verification (2011) (0)
Enhancing a model of auditory information processing to exhibit accommodation (1992) (0)
VQ Source Models: Perceptual and Phase Issues (2006) (0)
Workshop summary: Sparse methods for music audio (2009) (0)
Eurospeech, RESPITE and THISL (1999) (0)
European projects update (1999) (0)
Using speech models for separation in monaural and binaural contexts. (2010) (0)
Perceptual Audio Coding (2011) (0)
MIREX 2005: What did we learn? (2005) (0)
30. Speech Synthesis (2011) (0)
38. Music Retrieval (2011) (0)
Integrating CASA with other approaches (2004) (0)
Sound, Mixtures, and Learning: LabROSA Overview (2003) (0)
Speech Separation for Recognition and Enhancement (2011) (0)
Kelly Dobson Digital + Media Department Chair Rhode Island School of Design ( RISD ) CRA Conference (2012) (0)
Content-based analysis and indexing for speech, sound and multimedia (2000) (0)
Environmental Sound Recognition and Classification (2011) (0)
Segmenting and Classifying Long-Duration Recordings of "Personal Audio" (2004) (0)
librosa: 0.4.0 release candidate 2 (2015) (0)
EARS Novel Approaches: New Features, New Units (2003) (0)
Characterizing Musical Correlates of Large-Scale Discovery Behavior (2019) (0)
Sound, Mixtures, and Learning (2002) (0)
Computational Models of Auditory Organization (2001) (0)
Music processing above and below the fundamental frequency (2004) (0)
Speech Separation: Evaluation (2004) (0)
Frequency-Domain Linear Prediction (FDLP) Features (2003) (0)
Auditory Scene Analysis in Humans and Machines (2006) (0)
Transforming Spontaneous to Read Speech (2005) (0)
Acoustic Tube Modeling of Speech Production (2011) (0)
Machine Recognition of Sounds in Mixtures (2003) (0)
NSF-CAREER : The Listening Machine IIS-0238301 2003 – 2008 Final Report (2009) (0)
Mining for the Meaning of Music (2008) (0)
Speaker Turns from Between-Channel Differences (2004) (0)
Ideas for Next-Generation ASR (2003) (0)
Using Source Models in Speech Separation (2007) (0)
Automatically segmenting and clustering minimal-impact personal audio archives (2006) (0)
Review of SPRACH/Thisl meetings Cambridge UK, 1998sep03/04 (1998) (0)
General chair's introduction (2011) (0)
The State of Music at LabROSA (2013) (0)
Speech interfaces: A survey and some current projects (2000) (0)
Synthetic Audio: A Brief History (2011) (0)
Sound, Mixtures, and Learning: A Perspective on CASA (2003) (0)
Evaluation set DCASE 2021 task 4 (for submissions) (2021) (0)
RESPITE progress report (1999) (0)
Synthesis and Coding (2011) (0)
Audio Life Logs : Spotting Events , Integrating Information , and Protecting Privacy (2005) (0)
The Challenge of Communicating Computational Research (2013) (0)
Minimal-Impact Personal Audio Archives (2006) (0)
Multimedia Applications of Audio Recognition (2004) (0)
Sound Organization by Source Models in Humans and Machines (2006) (0)
On Communicating Computational Research (2013) (0)
Personal Audio Archives (2006) (0)
Searching for Similar Phrases in Music Audio (2007) (0)
Elen E4810 Digital Signal Processing Final Solutions (2011) (0)
Meeting Recorder: Audio Processing (2002) (0)
The Listening Machine : Sound Source Organization for Multimedia Understanding (2002) (0)
TeachWare: Audio Resources [Best of the Web] (2008) (0)
Searching for Speech in Personal Audio (2005) (0)
Learning the sound of musical instruments (0)
Data-Driven Music Audio Understanding (2006) (0)
Modeling Music Similarity : Automatic Prediction of Subjective Preference (2002) (0)
Some Projects in Real-World Sound Analysis (2009) (0)
Modeling Music Similarity : Signal-based Models of Subjective Preference (2004) (0)
Separating Speech from Speech Noise to Improve Intelligibility (2005) (0)
NSF-CAREER: The Listening Machine Annual Report 2004 (2005) (0)
Automatic audio analysis for content description and indexing (1998) (0)
SPRACH/ThisL review (1998) (0)
Melody Extraction from Polyphonic Music Signals Melody Extraction from Polyphonic Music Signals (2014) (0)
On the importance of illusions for artificial listeners (1997) (0)
Speech Recognition at ICSI: Broadcast News and beyond (1998) (0)
Using the Soundtrack to Classify Videos (2011) (0)
THISL progress report - 1999sep (1999) (0)
NSF-CAREER: The Listening Machine IIS-0238301 Annual Report 2007 (2008) (0)
Model-Based Separation in Humans and Machines (2006) (0)
Music Information Retrieval for Jazz (2012) (0)
Inharmonic speech reveals the role of harmonicity in the cocktail party problem (2018) (0)
THISL progress report (1999) (0)
Five Ways to Present Results 3 . Sharing Code 4 . Conclusions On Communicating Computational Research (2013) (0)
Extracting Information from Sound (2011) (0)
Broadcast News: Features and acoustic modelling (1998) (0)
USING PRE-TRAINED NEURAL NETWORKS (2015) (0)
Mining Large-Scale Music Data Sets (2012) (0)
Speech Recognition and Understanding (2011) (0)
Eigenrhythms: Drum Track Bases (2004) (0)
Discriminant Acoustic Probability Estimation (2011) (0)
Speaker Identification E6820 Spring '08 Final Project Report (2008) (0)
Enhancing the Intelligibility of Speech in Speech Noise (2005) (0)
On Neural Phone Recognition of Mixed-Source ECoG Signals (2019) (0)
Low‐Rate Vocoders (2011) (0)
1 SPEECH IS AT LEAST 4-DIMENSIONAL : RECEPTIVE FIELDS IN TIME-FREQUENCY (1996) (0)
The Big Picture The State of Music at LabROSA (2014) (0)
Audio and Music Research at LabROSA (2004) (0)
A cross‐correlation model of common‐period grouping (1993) (0)
Augmenting and Exploiting Auditory Perception for Complex Scene Analysis (2013) (0)
Using Sound Source Models to Organize Mixtures (2007) (0)
Acoustic Model Training: Further Topics (2011) (0)
Beat-Synchronous Chroma Representations for Music Analysis (2007) (0)
Current Music Research at LabROSA (2008) (0)
42. Speaker Diarization (2011) (0)

This paper list is powered by the following services:

What Schools Are Affiliated With Daniel Patrick Whittlesey Ellis?

Daniel Patrick Whittlesey Ellis is affiliated with the following schools:

Columbia University

Daniel Patrick Whittlesey Ellis's Academic­Influence.com Rankings

Daniel Patrick Whittlesey Ellis's Degrees

Why Is Daniel Patrick Whittlesey Ellis Influential?

Daniel Patrick Whittlesey Ellis's Published Works

Published Works

What Schools Are Affiliated With Daniel Patrick Whittlesey Ellis?

Daniel Patrick Whittlesey Ellis's AcademicInfluence.com Rankings