Bhiksha Raj
#140,137
Most Influential Person Now
Bhiksha Raj's AcademicInfluence.com Rankings
Bhiksha Rajengineering Degrees
Engineering
#5195
World Rank
#6440
Historical Rank
Electrical Engineering
#1421
World Rank
#1515
Historical Rank

Bhiksha Rajcomputer-science Degrees
Computer Science
#6674
World Rank
#7034
Historical Rank
Algorithms
#242
World Rank
#245
Historical Rank
Machine Learning
#2245
World Rank
#2273
Historical Rank
Database
#3757
World Rank
#3911
Historical Rank

Download Badge
Engineering Computer Science
Bhiksha Raj's Degrees
- PhD Electrical and Computer Engineering Carnegie Mellon University
- Masters Electrical and Computer Engineering Carnegie Mellon University
Why Is Bhiksha Raj Influential?
(Suggest an Edit or Addition)Bhiksha Raj's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- SphereFace: Deep Hypersphere Embedding for Face Recognition (2017) (2177)
- Sphinx-4: a flexible open source framework for speech recognition (2004) (533)
- A vector Taylor series approach for environment-independent speech recognition (1996) (483)
- DCASE2017 Challenge Setup: Tasks, Datasets and Baseline System (2017) (421)
- A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research (2016) (306)
- Beyond Gaussian Pyramid: Multi-skip Feature Stacking for action recognition (2014) (287)
- Speech denoising using nonnegative matrix factorization with priors (2008) (277)
- Supervised and Semi-supervised Separation of Sounds from Single-Channel Mixtures (2007) (267)
- Reconstruction of missing features for robust speech recognition (2004) (259)
- Missing-feature approaches in speech recognition (2005) (256)
- THE CMU SPHINX-4 SPEECH RECOGNITION SYSTEM (2001) (189)
- On the Origin of Deep Learning (2017) (185)
- Greedy sparsity-constrained optimization (2011) (184)
- A Probabilistic Latent Variable Model for Acoustic Modeling (2006) (180)
- A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition (2004) (173)
- Multiparty Differential Privacy via Aggregation of Locally Trained Classifiers (2010) (160)
- Audio Event Detection using Weakly Labeled Data (2016) (151)
- Probabilistic Latent Variable Models as Nonnegative Factorizations (2008) (145)
- Likelihood-maximizing beamforming for robust hands-free speech recognition (2004) (139)
- Non-negative Hidden Markov Modeling of Audio with Application to Source Separation (2010) (134)
- Design of the CMU sphinx-4 decoder (2003) (132)
- Soft Mask Methods for Single-Channel Speaker Separation (2007) (129)
- Non-negative matrix factorization based compensation of music for automatic speech recognition (2010) (129)
- Microphone Array Processing for Distant Speech Recognition: From Close-Talking Microphones to Far-Field Sensors (2012) (121)
- The 1996 Hub-4 Sphinx-3 System (1997) (107)
- One-handed gesture recognition using ultrasonic Doppler sonar (2009) (105)
- Regularized non-negative matrix factorization with temporal dependencies for speech denoising (2008) (98)
- Techniques for Noise Robustness in Automatic Speech Recognition (2012) (91)
- Sparse and shift-invariant feature extraction from non-negative data (2008) (91)
- Speech in Noisy Environments: robust automatic segmentation, feature extraction, and hypothesis combination (2001) (90)
- Automatic generation of subword units for speech recognition systems (2002) (87)
- Sound Event Detection in the DCASE 2017 Challenge (2019) (85)
- Sparse Overcomplete Latent Variable Decomposition of Counts Data (2007) (84)
- A Sparse Non-Parametric Approach for Single Channel Separation of Known Sounds (2009) (78)
- Privacy-Preserving Speaker Verification and Identification Using Gaussian Mixture Models (2013) (78)
- Voice Impersonation Using Generative Adversarial Networks (2018) (78)
- The 1997 CMU Sphinx-3 English Broadcast News Transcription System (1997) (77)
- Acoustic Doppler sonar for gait recogination (2007) (75)
- Data-driven environmental compensation for speech recognition: A unified approach (1998) (74)
- Active-Set Newton Algorithm for Overcomplete Non-Negative Representations of Audio (2013) (70)
- Compositional Models for Audio Processing: Uncovering the structure of sound mixtures (2015) (69)
- COMPENSATION FOR ENVIRONMENTAL DEGRADATION IN AUTOMATIC SPEECH RECOGNITION (1999) (64)
- Unsupervised Learning of Acoustic Unit Descriptors for Audio Content Representation and Classification (2011) (61)
- Shift-Invariant Probabilistic Latent Component Analysis (2007) (58)
- Audio event detection from acoustic unit occurrence patterns (2012) (57)
- A boosting approach for confidence scoring (2001) (56)
- The effects of background music on speech recognition accuracy (1997) (55)
- Multivariate-Gaussian-based cepstral normalization for robust speech recognition (1995) (54)
- Ultrasonic Doppler Sensing in HCI (2012) (53)
- Quantization-based language model compression (2001) (52)
- Microphone array processing for distant speech recognition: Towards real-world deployment (2012) (52)
- A Closer Look at Weak Label Learning for Audio Events (2018) (52)
- Multi-channel source separation by factorial HMMs (2003) (51)
- Privacy-preserving speech processing: cryptographic and string-matching frameworks show promise (2013) (51)
- Gammatone sub-band magnitude-domain dereverberation for ASR (2011) (51)
- Measuring prevalence of other-oriented transactive contributions using an automated measure of speech style accommodation (2013) (50)
- Bandwidth expansion of narrowband speech using non-negative matrix factorization (2005) (50)
- Swara Histogram Based Structural Analysis And Identification Of Indian Classical Ragas (2013) (48)
- Deep CNN Framework for Audio Event Recognition using Weakly Labeled Web Data (2017) (47)
- Missing data imputation for spectral audio signals (2009) (47)
- Sparse Overcomplete Decomposition for Single Channel Speaker Separation (2007) (46)
- On tracking noise with linear dynamical system models (2004) (46)
- Privacy preserving probabilistic inference with Hidden Markov Models (2011) (45)
- Automatic clustering and generation of contextual questions for tied states in hidden Markov models (1999) (45)
- Signal separation for robust speech recognition based on phase difference information obtained in the frequency domain (2009) (44)
- Experiments on the DCASE Challenge 2016: Acoustic Scene Classification and Sound Event Detection in Real Life Recording (2016) (44)
- Disjoint Mapping Network for Cross-modal Matching of Voices and Faces (2018) (44)
- Channel selection based on multichannel cross-correlation coefficients for distant speech recognition (2011) (43)
- Ultrasonic Doppler Sensor for Voice Activity Detection (2007) (41)
- Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery (2017) (40)
- Phoneme-Dependent NMF for Speech Enhancement in Monaural Mixtures (2011) (39)
- Face Reconstruction from Voice using Generative Adversarial Networks (2019) (39)
- Missing Data Imputation for Time-Frequency Representations of Audio Signals (2011) (39)
- A hierarchical system for word discovery exploiting DTW-based initialization (2013) (39)
- CMU-Informedia @ TRECVID 2013 Multimedia Event Detection (2013) (38)
- Efficient autism spectrum disorder prediction with eye movement: A machine learning framework (2015) (38)
- A minimum mean squared error estimator for single channel speaker separation (2004) (38)
- Latent Dirichlet Decomposition for Single Channel Speaker Separation (2006) (36)
- Scale independent raga identification using chromagram patterns and swara based features (2013) (36)
- Audio event and scene recognition: A unified approach using strongly and weakly labeled data (2016) (35)
- Joint sparsity models for wideband array processing (2011) (35)
- Privacy-preserving speaker verification as password matching (2012) (34)
- Exploring the Best Loss Function for DNN-Based Low-latency Speech Enhancement with Temporal Convolutional Networks (2020) (34)
- Speech recognizer-based microphone array processing for robust hands-free speech recognition (2002) (34)
- Cross Modal Audio Search and Retrieval with Joint Embeddings Based on Text and Audio (2019) (34)
- Contrast and Order Representations for Video Self-supervised Learning (2021) (33)
- Classifier-based mask estimation for missing feature methods of robust speech recognition (2000) (33)
- Cepstral compensation by polynomial approximation for environment-independent speech recognition (1996) (33)
- Is normalization indispensable for training deep neural network? (2020) (33)
- Reconstruction of damaged spectrographic features for robust speech recognition (2000) (32)
- Tracking noise via dynamical systems with a continuum of states (2003) (32)
- Informedia@TrecVID 2014: MED and MER (2014) (32)
- Inference of missing spectrographic features for robust speech recognition (1998) (31)
- Ultrasonic Doppler sensor for speaker recognition (2008) (30)
- A Survey: Time Travel in Deep Learning Space: An Introduction to Deep Learning Models and How Deep Learning Models Evolved from the Initial Ideas (2015) (30)
- Complex recurrent neural networks for denoising speech signals (2015) (29)
- Environmental Noise Embeddings for Robust Speech Recognition (2016) (28)
- Content-Based Representations of Audio Using Siamese Neural Networks (2017) (27)
- A Comparison Between Spoken Queries and Menu-Based Interfaces for In-car Digital Music Selection (2005) (27)
- Viral Spread via Entertainment and Voice-Messaging Among Telephone Users in India (2016) (25)
- Synthesizing speech from Doppler signals (2010) (24)
- Unsupervised word segmentation from noisy input (2013) (24)
- Ultrasonic sensing for robust speech recognition (2010) (24)
- HEAR: Holistic Evaluation of Audio Representations (2022) (22)
- Informedia e-lamp @ TRECVID 2012 multimedia event detection and recounting MED and MER (2012) (22)
- Weakly supervised scalable audio content analysis (2016) (22)
- Privacy Preserving Speaker Verification Using Adapted GMMs (2011) (22)
- Latent-variable decomposition based dereverberation of monaural and multi-channel signals (2010) (22)
- Learning-Based Auditory Encoding for Robust Speech Recognition (2010) (22)
- Recognizing speech from simultaneous speakers (2005) (21)
- A Unifying Analysis of Projected Gradient Descent for $ell_p$-constrained Least Squares (2011) (21)
- Iterative Bayesian word segmentation for unsupervised vocabulary discovery from phoneme lattices (2014) (21)
- Active-set newton algorithm for non-negative sparse coding of audio (2014) (21)
- Signal and Feature Compensa-tion Methods for Robust Speech Recognition (2002) (21)
- Content-based Video Indexing and Retrieval Using Corr-LDA (2016) (21)
- Analysis-by-synthesis features for speech recognition (2008) (21)
- THE 1999 CMU 10X REAL TIME BROADCAST NEWS TRANSCRIPTION SYSTEM (1999) (21)
- RECOGNITION OF CONTINUOUS BROADCAST NEWS WITH MULTIPLE UNKNOWN SPEAKERS AND ENVIRONMENTS (1995) (20)
- Robust 1-bit Compressive Sensing via Gradient Support Pursuit (2013) (20)
- Soft mask estimation for single channel speaker separation (2004) (20)
- Reducing communication overhead in distributed learning by an order of magnitude (almost) (2015) (20)
- The Right to Talk: An Audio-Visual Transformer Approach (2021) (19)
- Unsupervised Structure Discovery for Semantic Analysis of Audio (2012) (19)
- Calibration of microphone arrays for improved speech recognition (2001) (19)
- Lossless compression of language model structure and word identifiers (2003) (19)
- An Unsupervised Dynamic Bayesian Network Approach to Measuring Speech Style Accommodation (2012) (19)
- Structured redefinition of sound units by merging and splitting for improved speech recognition (2000) (19)
- Speech-recognizer-based filter optimization for microphone array processing (2003) (19)
- A Comparative Study Of Indian And Western Music Forms (2013) (18)
- Classification in Likelihood Spaces (2004) (18)
- Classifier-based non-linear projection for adaptive endpointing of continuous speech (2003) (18)
- Sound event classification using ontology-based neural networks (2018) (17)
- Spectrographic seam patterns for discriminative word spotting (2012) (17)
- An approach for self-training audio event detectors using web data (2016) (17)
- The MERL SpokenQuery information retrieval system a system for retrieving pertinent documents from a spoken query (2002) (17)
- Privacy-Preserving Speaker Authentication (2012) (17)
- Example-Driven Bandwidth Expansion (2007) (17)
- The automatic assessment of knowledge integration processes in project teams (2011) (17)
- Unsupervised hierarchical structure induction for deeper semantic analysis of audio (2013) (17)
- Secure Modular Hashing (2015) (17)
- Large Margin Gaussian Mixture Models with Differential Privacy (2012) (17)
- On the Appropriateness of Complex-Valued Neural Networks for Speech Enhancement (2016) (16)
- Binary Sparse Coding of Convolutive Mixtures for Sound Localization and Separation via Spatialization (2016) (16)
- An iterative least-squares technique for dereverberation (2011) (16)
- Supervised monaural source separation based on autoencoders (2017) (16)
- Robust Speech Recognition: The case for restoring missing features (2001) (16)
- Rapid development of public health education systems in low-literacy multilingual environments: combating ebola through voice messaging (2015) (16)
- The Incredible Shrinking Neural Network: New Perspectives on Learning Representations Through The Lens of Pruning (2017) (16)
- Probabilistic Factorization of Non-negative Data with Entropic Co-occurrence Constraints (2009) (15)
- FoolHD: Fooling Speaker Identification by Highly Imperceptible Adversarial Disturbances (2020) (15)
- FreeMatch: Self-adaptive Thresholding for Semi-supervised Learning (2022) (15)
- Short-term analysis for estimating physical parameters of speakers (2016) (15)
- The relationship of voice onset time and Voice Offset Time to physical age (2016) (15)
- Automatic generation of phone sets and lexical transcriptions (2000) (14)
- Learning Model-Based Sparsity via Projected Gradient Descent (2012) (14)
- Secure binary embeddings of front-end factor analysis for privacy preserving speaker verification (2013) (14)
- Classification with free energy at raised temperatures (2003) (14)
- A Speech-in List-out Approach to Spoken User Interfaces (2004) (13)
- An FFT-Based Companding Front End for Noise-Robust Automatic Speech Recognition (2007) (13)
- SphereFace2: Binary Classification is All You Need for Deep Face Recognition (2021) (13)
- SPECTROGRAM DIMENSIONALITY REDUCTION WITH INDEPENDENCE CONSTRAINTS (2010) (13)
- Be Careful What You Backpropagate: A Case For Linear Output Activations & Gradient Boosting (2017) (13)
- Doppler based speed estimation of vehicles using passive sensor (2013) (13)
- Formant manipulations in voice disguise by mimicry (2016) (12)
- SphereFace Revived: Unifying Hyperspherical Face Recognition (2021) (12)
- Forensic anthropometry from voice: An articulatory-phonetic approach (2016) (12)
- Model Compensation and Matched Condition Methods for Robust Speech Recognition (2002) (12)
- Compositional models for audio processing (2014) (12)
- Privacy-preserving speaker verification using garbled GMMS (2014) (12)
- Recognizing talking faces from acoustic Doppler reflections (2008) (12)
- Optimization of the DET curve in speaker verification (2012) (11)
- Sensor and Data Systems, Audio-Assisted Cameras and Acoustic Doppler Sensors (2007) (11)
- Continuous Feature Adaptation for Non-Native Speech Recognition (2007) (11)
- Classifier Risk Estimation under Limited Labeling Resources (2016) (11)
- On the Origin of Deep Learning On the Origin of Deep Learning (2017) (11)
- Unsupervised Fusion Weight Learning in Multiple Classifier Systems (2015) (11)
- ADAPTATION AND COMPENSATION : APPROACHES TO MICR OPHONE AND SPEAKER INDEPENDENCE IN AUTOMATIC SPEECH RECOGNITION (1996) (11)
- Hide and Speak: Towards Deep Neural Networks for Speech Steganography (2019) (11)
- Learning Sound Events From Webly Labeled Data (2018) (10)
- Event detection in short duration audio using Gaussian Mixture Model and Random Forest Classifier (2013) (10)
- Microphone Array Post-filter based on Spatially-Correlated Noise Measurements for Distant Speech Recognition (2012) (10)
- Handcrafted Local Features are Convolutional Neural Networks (2015) (10)
- Detection and Evaluation of Human and Machine Generated Speech in Spoofing Attacks on Automatic Speaker Verification Systems (2020) (10)
- When to Interrupt: A Comparative Analysis of Interruption Timings Within Collaborative Communication Tasks (2017) (10)
- Human Behaviour Recognition Using Wifi Channel State Information (2019) (10)
- The Markov selection model for concurrent speech recognition (2010) (10)
- A joint decoding algorithm for multiple-example-based addition of words to a pronunciation lexicon (2009) (10)
- Detecting Psychological Distress in Adults Through Transcriptions of Clinical Interviews (2016) (9)
- Privacy Preserving Speech Processing (2013) (9)
- Speaker tracking with spherical microphone arrays (2013) (9)
- Discriminatively trained dependency language modeling for conversational speech recognition (2013) (9)
- Maximum kurtosis beamforming with a subspace filter for distant speech recognition (2011) (9)
- Mining Multimodal Repositories for Speech Affecting Diseases (2018) (9)
- HEAR 2021: Holistic Evaluation of Audio Representations (2022) (9)
- Unsupervised Word Discovery from Phonetic Input Using Nested Pitman-Yor Language Modeling (2013) (9)
- DCASE 2017 Task 1: Acoustic Scene Classification Using Shift-Invariant Kernels and Random Features (2018) (9)
- A unified approach for robust speech recognition (1995) (9)
- Features and Kernels for Audio Event Recognition (2016) (9)
- Discovering sound concepts and acoustic relations in text (2016) (8)
- Self-Supervised 3D Face Reconstruction via Conditional Estimation (2021) (8)
- Creating a linguistic plausibility dataset with non-expert annotators (2010) (8)
- Detecting sound objects in audio recordings (2014) (8)
- AudioSentibank: Large-scale Semantic Ontology of Acoustic Concepts for Audio Content Analysis (2016) (8)
- Inferring room semantics using acoustic monitoring (2017) (8)
- Deriving vocal tract shapes from electromagnetic articulograph data via geometric adaptation and matching (2009) (8)
- AN ACOUSTIC DOPPLER-BASED FRONT END FOR HANDS FREE SPOKEN USER INTERFACES (2006) (8)
- Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection (2020) (7)
- Hide and Speak: Deep Neural Networks for Speech Steganography (2019) (7)
- The REVERB Challenge: A Benchmark Task for Reverberation-Robust ASR Techniques (2017) (7)
- NELS - Never-Ending Learner of Sounds (2018) (7)
- Bandwidth Expansionwith a pólya URN Model (2007) (7)
- Spokenquery: an alternate approach to chosing items with speech (2004) (7)
- Privacy Preserving Protocols for Eigenvector Computation (2010) (7)
- Logsum Using Garbled Circuits (2015) (7)
- Privacy-Preserving Important Passage Retrieval (2014) (7)
- Factorization With Temporal Dependencies for Speech Denoising (2008) (7)
- Acoustic Scene Classification Using Discrete Random Hashing for Laplacian Kernel Machines (2018) (6)
- Deriving Compact Feature Representations Via Annealed Contraction (2020) (6)
- Reconstructing faces from voices (2019) (6)
- Improving weakly supervised sound event detection with self-supervised auxiliary tasks (2021) (6)
- Efficient Protocols for Principal Eigenvector Computation over Private Data (2011) (6)
- Sherlock: A Crowd-sourced System For Automatic Tagging Of Indoor Floor Plans (2020) (6)
- Querying Depression Vlogs (2018) (6)
- Towards fusion of feature extraction and acoustic model training: a top down process for robust speech recognition (2009) (6)
- Privacy-Preserving Multi-Document Summarization (2015) (6)
- Large Margin Multiclass Gaussian Classification with Differential Privacy (2010) (6)
- Domain adduced state tying for cross-domain acoustic modelling (1999) (6)
- A companding front end for noise-robust automatic speech recognition (2005) (6)
- Sequential Randomized Smoothing for Adversarially Robust Speech Recognition (2021) (6)
- Privacy-preserving speaker verification using secure binary embeddings (2014) (6)
- Speaker verification using Secure Binary Embeddings (2013) (6)
- Topic and Prosodic Modeling for Interruption Management in Multi-User Multitasking Communication Interactions (2017) (5)
- Learning contextual relevance of audio segments using discriminative models over AUD sequences (2011) (5)
- Synergy between human and machine approaches to sound/scene recognition and processing: An overview of ICASSP special session (2023) (5)
- CEPSTRAL COMPENSATION USING STATISTICAL LINEARIZATION (2000) (5)
- Privacy-preserving Query-by-Example Speech Search (2015) (5)
- Comparison of width-wise and length-wise language model compression (2001) (5)
- USB: A Unified Semi-supervised Learning Benchmark for Classification (2022) (5)
- Signal and Feature Compensation Methods for Robust Speech Recognition (2018) (5)
- AudioPairBank: towards a large-scale tag-pair-based audio content analysis (2016) (5)
- On the pragmatism of using binary classifiers over data intensive neural network classifiers for detection of COVID-19 from voice (2022) (5)
- Neural Regression Trees (2018) (5)
- SEPARATING A FOREGROUND SINGER FROM (2006) (5)
- A Comparative Analysis of Human-Mediated and System-Mediated Interruptions for Multi-user, Multitasking Interactions (2017) (5)
- Microphone array processing for distant speech recognition: Spherical arrays (2012) (5)
- The Basics of Automatic Speech Recognition (2012) (5)
- Speech Separation by Humans and Machines (2004) (5)
- Exploiting Temporal Sequence Structure for Semantic Analysis of Multimedia (2012) (5)
- The in-the-Wild Speech Medical Corpus (2021) (5)
- Probabilistic Latent Variable Model for Sparse Decompositions of Non-negative Data (2009) (4)
- Online Video Instance Segmentation via Robust Context Fusion (2022) (4)
- Speech-Based UI Design for the Automobile (2008) (4)
- Optimizing Neural Network Embeddings Using a Pair-Wise Loss for Text-Independent Speaker Verification (2019) (4)
- Privacy Preserving Spam Filtering (2011) (4)
- Time Signal Classification Using Random Convolutional Features (2019) (4)
- Identifying Actions for Sound Event Classification (2021) (4)
- TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement (2023) (4)
- Predicting Idea Co-Construction in Speech Data using Insights from Sociolinguistics (2012) (4)
- Nonlinear Semi-Parametric Models for Survival Analysis (2019) (4)
- A two factor transformation for speaker verification through ℓ1 comparison (2017) (4)
- Audio Content Based Geotagging in Multimedia (2016) (4)
- Semantic Indexing (2014) (4)
- Privacy preserving Distance computation using somewhat-trusted third parties (2016) (4)
- R^2VOS: Robust Referring Video Object Segmentation via Relational Multimodal Cycle Consistency (2022) (4)
- A Corrective Learning Approach for Text-Independent Speaker Verification (2018) (4)
- Feature compensation with secondary sensor measurements for robust speech recognition (2005) (4)
- Multimedia Event Detection and Recounting (2013) (3)
- Structured sparse coding for microphone array location calibration (2012) (3)
- In-the-Wild End-to-End Detection of Speech Affecting Diseases (2019) (3)
- AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation (2022) (3)
- A paired test for recognizer selection with untranscribed data (2011) (3)
- Improving Speech Enhancement through Fine-Grained Speech Characteristics (2022) (3)
- Detecting gender differences in perception of emotion in crowdsourced data (2019) (3)
- An integrated approach to improve speech recognition rate for non-native speakers (2006) (3)
- Recent improvements of ASR models in the face of adversarial attacks (2022) (3)
- Analysing Speech for Clinical Applications (2018) (3)
- Point3D: tracking actions as moving points with 3D CNNs (2022) (3)
- Proceeding of the 1 st International Workshop on Privacy-Preserving IR : When Information Retrieval Meets Privacy and Security ( PIR 2014 ) (2014) (3)
- APPROACHES TO ENVIRONMENT COMPENSATION IN AUTOMATIC SPEECH RECOGNITION (1995) (3)
- Panoramic Video Salient Object Detection with Ambisonic Audio Guidance (2022) (3)
- USB: A Unified Semi-supervised Learning Benchmark (2022) (3)
- Optimization of the DET curve in speaker verification under noisy conditions (2013) (3)
- A Comparison of Latent Variable Models For Conversation Analysis (2011) (3)
- The Problem of Robustness in Automatic Speech Recognition (2012) (3)
- Efficient Integration of Multi-channel Information for Speaker-independent Speech Separation (2020) (3)
- Towards End-to-End Private Automatic Speaker Recognition (2022) (3)
- Editorial: Special Section on Statistical and Perceptual Audio Processing (2006) (2)
- Mask Proxy Loss for Text-Independent Speaker Recognition (2020) (2)
- Word Particles Applied to Information Retrieval (2009) (2)
- Voice driven applications in non-stationary and chaotic environment (2007) (2)
- VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning (2022) (2)
- Exploring Optimal DNN Architecture for End-to-End Beamformers Based on Time-frequency References (2020) (2)
- Block-wise incremental adaptation algorithm for maximum kurtosis beamforming (2011) (2)
- A Paradigm for Limited Vocabulary Speech Recognition Based on Redundant Spectro-Temporal Feature Sets (2011) (2)
- Self-supervision and Learnable STRFs for Age, Emotion, and Country Prediction (2022) (2)
- The use of sense in unsupervised training of acoustic models for ASR systems (2010) (2)
- Hierarchical Routing Mixture of Experts (2019) (2)
- Investigation on effectiveness of mid-level feature representation for semantic boundary detection in news video (2003) (2)
- DISTANT MULTI-SPEAKER VOICE ACTIVITY DETECTION USING RELATIVE ENERGY RATIO (2011) (2)
- Framework for Evaluation of Sound Event Detection in Web Videos (2017) (2)
- On the combination of voice prompt suppression with maximum kurtosis beamforming (2011) (2)
- W-Net BF: DNN-based Beamformer Using Joint Training Approach (2019) (2)
- Controlled AutoEncoders to Generate Faces from Voices (2021) (2)
- Attacking a privacy preserving music matching algorithm (2012) (2)
- A novel ranking method for multiple classifier systems (2015) (2)
- Non-Determinism in Neural Networks for Adversarial Robustness (2019) (2)
- Spectrogram dimensionality reductionwith independence constraints (2010) (2)
- Post-masking: a hybrid approach to array processing for speech recognition (2014) (2)
- Ungrounded independent non-negative factor analysis (2010) (1)
- Ensemble approach in speaker verification (2013) (1)
- Informedia at TRECVID2014: MED and MER, Semantic Indexing, Surveillance Event Detection (2014) (1)
- SoftMatch: Addressing the Quantity-Quality Trade-off in Semi-supervised Learning (2023) (1)
- BLOCK-SPARSE BASIS SETS FOR IMPROVED AUDIO CONTENT ESTIMATION (2013) (1)
- Speech Analytics for Medical Applications (2018) (1)
- Higher-order Network for Action Recognition (2018) (1)
- On the implementation of a secure musical database matching (2011) (1)
- APPLYING RECURRENT NEURAL NETWORK TO ARABIC NAMED ENTITY RECOGNITION (2016) (1)
- An Embarrassingly Simple Baseline for Imbalanced Semi-Supervised Learning (2022) (1)
- A Boosting Approach for Confidence Scoring (2021) (1)
- Plagiarism Detection in Polyphonic Music using Monaural Signal Separation (2015) (1)
- A Comparison of Prosody Modification using Instants of Significant Excitation and Mel-Cepstral Vocoder (2011) (1)
- Automatic In-the-wild Dataset Annotation with Deep Generalized Multiple Instance Learning (2020) (1)
- The phonetic bases of vocal expressed emotion: natural versus acted (2019) (1)
- FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding (2023) (1)
- Adaptation of SVM for MIL for inferring the polarity of movies and movie reviews (2016) (1)
- Compensation for speech recognition in degraded acoustical environments (1996) (1)
- Informedia @ TRECVID 2014 (2014) (1)
- A hybrid physical and statistical dynamic articulatory framework incorporating analysis-by-synthesis for improved phone classification (2010) (1)
- Shadowing as peer experiential learning for faculty instructional development strategy: A case study on a computer science course (2021) (1)
- An information filter for voice prompt suppression (2011) (1)
- Exploiting Non-Linear Redundancy for Neural Model Compression (2020) (1)
- Optimal Strategies for Matching and Retrieval Problems by Comparing Covariates (2018) (1)
- Cross-utterance context for multimodal video transcription (2022) (1)
- Masked Proxy Loss for Text-Independent Speaker Verification (2020) (1)
- Language identification using spectro-temporal patch features (2012) (1)
- Artificial Creative Intelligence: Breaking the Imitation Barrier (2020) (1)
- Reconstructing Noise-Corrupted Spectrographic Components for Robust Speech Recognition (2011) (1)
- Constant Random Perturbations Provide Adversarial Robustness with Minimal Effect on Accuracy (2021) (1)
- SPEAKER VERIFICATION USING SECURE BINARY EMBEDDINGS JosPortˆ (2013) (1)
- Missing-Feature Approaches in Speech Recognition [ Improving recognition accuracy in noise by using partial spectrographic information ] (2009) (1)
- Ble Mixture Gaussian Distributions with the following Means and Variances: 4.2. Performance of Ratz on Speech Recogni- Tion in Noise 5. Summary Acknowledgements -0.2 0.2 0.4 0.6 0.8 1.0 1.2 (0)
- Improving Perceptual Quality, Intelligibility, and Acoustics on VoIP Platforms (2023) (0)
- Semi-supervised context-aware discovery of unknown audio concepts (2013) (0)
- APPROACHES TO MICROPHONE INDEPENDENCE IN AUTOMATIC SPEECH RECOGNITION (2003) (0)
- A Multipath Sparse Beamfroming Method (2013) (0)
- Improving sound event detection with ontologies (2023) (0)
- Privacy Preserving Biometric Identity Verification (2017) (0)
- Interactive Evaluation of Classifiers Under Limited Resources (2018) (0)
- 1 . 1 Motion SIFT ( MoSIFT ) Feature (2012) (0)
- Probabilistic deduction of symbol mappings for extension of lexicons (2007) (0)
- Quantization-basedLanguageModel Compression (2001) (0)
- Synthesizing speech from surface electromyography and acoustic Doppler sonar. (2010) (0)
- Improving headphone spatialization for stereo music (2015) (0)
- XNOR-FORMER: Learning Accurate Approximations in Long Speech Transformers (2022) (0)
- A Boosting Appr oach for Confidence Scoring (2001) (0)
- N ov 2 01 6 AUDIO CONTENT BASED GEOTAGGING IN MULTIMEDIA (2018) (0)
- Technical Program for the 2005 IEEE Workshop on Automatic Speech Recognition and Understanding - ASRU2005 (2005) (0)
- Privacy-preserving Automatic Speaker Diarization (2022) (0)
- Speech Recognizer Based Maximum Likelihood Beamforming (2005) (0)
- Surveillance Event Detection ( SED ) Discriminative Features and Interactive Feedback Utilization (2012) (0)
- Speech Communication: Preface (2011) (0)
- THE INCREDIBLE SHRINKING NEURAL NETWORK: NEW PERSPECTIVES (2016) (0)
- Training image classifiers using Semi-Weak Label Data (2021) (0)
- CLASSIFICATION USING ONTOLOGY-BASED NEURAL NETWORKS (2018) (0)
- Approach to Learning Generalized Audio Representation Through Batch Embedding Covariance Regularization and Constant-Q Transforms (2023) (0)
- Bear the Query in Mind: Visual Grounding with Query-conditioned Convolution (2022) (0)
- Robust speech recognition using missing features (2003) (0)
- MAXIMUM KURTOSIS BEAMFORMING (2011) (0)
- Towards Adversarial Robustness Via Compact Feature Representations (2021) (0)
- There is more than one kind of robustness: Fooling Whisper with adversarial examples (2022) (0)
- Unifying the Discrete and Continuous Emotion labels for Speech Emotion Recognition (2022) (0)
- Automatic assessment of student “reasoning” processes in face-to-face interactions using speech data (2010) (0)
- Locality constrained transitive distance clustering on speech data (2015) (0)
- Audition for multimedia computing (2017) (0)
- Understanding Political Polarisation using Language Models: A dataset and method (2023) (0)
- A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research (2016) (0)
- Not all broken defenses are equal: The dead angles of adversarial accuracy (2022) (0)
- Text, Speech, and Dialogue: 21st International Conference, TSD 2018, Brno, Czech Republic, September 11-14, 2018, Proceedings (2018) (0)
- Inferring missing spectral data. (2008) (0)
- Measuring prevalence of other-oriented transactive contributions using an automated measure of speech style accommodation (2013) (0)
- An Approach to Ontological Learning from Weak Labels (2023) (0)
- Watch What You Pretrain For: Targeted, Transferable Adversarial Examples on Self-Supervised Speech Recognition models (2022) (0)
- Two New Techniques for Natural Spoken User Interfaces (2006) (0)
- Discriminative Dictionary Learning for Autism Spectrum Disorder Identification (2021) (0)
- Voice biometrics: privacy in paralinguistic and extralinguistic tasks for health applications (2021) (0)
- Scalable Audio-Content Analysis (2010) (0)
- Crowdsourced Video Subtitling with Adaptation Based on User-Corrected Lattices (2016) (0)
- How many perturbations break this model? Evaluating robustness beyond adversarial accuracy (2022) (0)
- PAAPLoss: A Phonetic-Aligned Acoustic Parameter Loss for Speech Enhancement (2023) (0)
- Discovery of temporal patterns in continuous nonrandom sound sequences. (2008) (0)
- Properties and Applications of Ultrasonic Doppler Sensing in Human-Computer Interaction (2012) (0)
- AudioPairBank: towards a large-scale tag-pair-based audio content analysis (2018) (0)
- a ) Integrated I / O Environment 5 . 1 . 1 VOICE-A Voice Oriented Interactive Computing Environment (2014) (0)
- EXPANSION OF NARROWBAND S TRIX FACTORIZA (0)
- Describing emotions with acoustic property prompts for speech emotion recognition (2022) (0)
- SPEECH RECOGNITION RATE FOR NON-NATIVE SPEAKERS (2021) (0)
- Learnable Higher-order Representation for Action Recognition (2019) (0)
- Optimal Strategies For Comparing Covariates To Solve Matching Problems (2021) (0)
- Positional Encoding for Capturing Modality Specific Cadence for Emotion Detection (2022) (0)
- Ontological Learning from Weak Labels (2022) (0)
- Preface (2011) (0)
This paper list is powered by the following services:
What Schools Are Affiliated With Bhiksha Raj?
Bhiksha Raj is affiliated with the following schools: