Bhiksha Raj

Bhiksha Raj's AcademicInfluence.com Rankings

Bhiksha Raj

Engineering

#5195

World Rank

#6440

Historical Rank

Electrical Engineering

#1421

World Rank

#1515

Historical Rank

engineering Degrees

Bhiksha Raj

Computer Science

#6674

World Rank

#7034

Historical Rank

Algorithms

#242

World Rank

#245

Historical Rank

Machine Learning

#2245

World Rank

#2273

Historical Rank

Database

#3757

World Rank

#3911

Historical Rank

computer-science Degrees

Download Badge

Engineering
Computer Science

Bhiksha Raj's Degrees

PhD Electrical and Computer Engineering Carnegie Mellon University
Masters Electrical and Computer Engineering Carnegie Mellon University

Why Is Bhiksha Raj Influential?

(Suggest an Edit or Addition)

(See a Problem?)

Bhiksha Raj's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

SphereFace: Deep Hypersphere Embedding for Face Recognition (2017) (2177)
Sphinx-4: a flexible open source framework for speech recognition (2004) (533)
A vector Taylor series approach for environment-independent speech recognition (1996) (483)
DCASE2017 Challenge Setup: Tasks, Datasets and Baseline System (2017) (421)
A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research (2016) (306)
Beyond Gaussian Pyramid: Multi-skip Feature Stacking for action recognition (2014) (287)
Speech denoising using nonnegative matrix factorization with priors (2008) (277)
Supervised and Semi-supervised Separation of Sounds from Single-Channel Mixtures (2007) (267)
Reconstruction of missing features for robust speech recognition (2004) (259)
Missing-feature approaches in speech recognition (2005) (256)
THE CMU SPHINX-4 SPEECH RECOGNITION SYSTEM (2001) (189)
On the Origin of Deep Learning (2017) (185)
Greedy sparsity-constrained optimization (2011) (184)
A Probabilistic Latent Variable Model for Acoustic Modeling (2006) (180)
A Bayesian classifier for spectrographic mask estimation for missing feature speech recognition (2004) (173)
Multiparty Differential Privacy via Aggregation of Locally Trained Classifiers (2010) (160)
Audio Event Detection using Weakly Labeled Data (2016) (151)
Probabilistic Latent Variable Models as Nonnegative Factorizations (2008) (145)
Likelihood-maximizing beamforming for robust hands-free speech recognition (2004) (139)
Non-negative Hidden Markov Modeling of Audio with Application to Source Separation (2010) (134)
Design of the CMU sphinx-4 decoder (2003) (132)
Soft Mask Methods for Single-Channel Speaker Separation (2007) (129)
Non-negative matrix factorization based compensation of music for automatic speech recognition (2010) (129)
Microphone Array Processing for Distant Speech Recognition: From Close-Talking Microphones to Far-Field Sensors (2012) (121)
The 1996 Hub-4 Sphinx-3 System (1997) (107)
One-handed gesture recognition using ultrasonic Doppler sonar (2009) (105)
Regularized non-negative matrix factorization with temporal dependencies for speech denoising (2008) (98)
Techniques for Noise Robustness in Automatic Speech Recognition (2012) (91)
Sparse and shift-invariant feature extraction from non-negative data (2008) (91)
Speech in Noisy Environments: robust automatic segmentation, feature extraction, and hypothesis combination (2001) (90)
Automatic generation of subword units for speech recognition systems (2002) (87)
Sound Event Detection in the DCASE 2017 Challenge (2019) (85)
Sparse Overcomplete Latent Variable Decomposition of Counts Data (2007) (84)
A Sparse Non-Parametric Approach for Single Channel Separation of Known Sounds (2009) (78)
Privacy-Preserving Speaker Verification and Identification Using Gaussian Mixture Models (2013) (78)
Voice Impersonation Using Generative Adversarial Networks (2018) (78)
The 1997 CMU Sphinx-3 English Broadcast News Transcription System (1997) (77)
Acoustic Doppler sonar for gait recogination (2007) (75)
Data-driven environmental compensation for speech recognition: A unified approach (1998) (74)
Active-Set Newton Algorithm for Overcomplete Non-Negative Representations of Audio (2013) (70)
Compositional Models for Audio Processing: Uncovering the structure of sound mixtures (2015) (69)
COMPENSATION FOR ENVIRONMENTAL DEGRADATION IN AUTOMATIC SPEECH RECOGNITION (1999) (64)
Unsupervised Learning of Acoustic Unit Descriptors for Audio Content Representation and Classification (2011) (61)
Shift-Invariant Probabilistic Latent Component Analysis (2007) (58)
Audio event detection from acoustic unit occurrence patterns (2012) (57)
A boosting approach for confidence scoring (2001) (56)
The effects of background music on speech recognition accuracy (1997) (55)
Multivariate-Gaussian-based cepstral normalization for robust speech recognition (1995) (54)
Ultrasonic Doppler Sensing in HCI (2012) (53)
Quantization-based language model compression (2001) (52)
Microphone array processing for distant speech recognition: Towards real-world deployment (2012) (52)
A Closer Look at Weak Label Learning for Audio Events (2018) (52)
Multi-channel source separation by factorial HMMs (2003) (51)
Privacy-preserving speech processing: cryptographic and string-matching frameworks show promise (2013) (51)
Gammatone sub-band magnitude-domain dereverberation for ASR (2011) (51)
Measuring prevalence of other-oriented transactive contributions using an automated measure of speech style accommodation (2013) (50)
Bandwidth expansion of narrowband speech using non-negative matrix factorization (2005) (50)
Swara Histogram Based Structural Analysis And Identification Of Indian Classical Ragas (2013) (48)
Deep CNN Framework for Audio Event Recognition using Weakly Labeled Web Data (2017) (47)
Missing data imputation for spectral audio signals (2009) (47)
Sparse Overcomplete Decomposition for Single Channel Speaker Separation (2007) (46)
On tracking noise with linear dynamical system models (2004) (46)
Privacy preserving probabilistic inference with Hidden Markov Models (2011) (45)
Automatic clustering and generation of contextual questions for tied states in hidden Markov models (1999) (45)
Signal separation for robust speech recognition based on phase difference information obtained in the frequency domain (2009) (44)
Experiments on the DCASE Challenge 2016: Acoustic Scene Classification and Sound Event Detection in Real Life Recording (2016) (44)
Disjoint Mapping Network for Cross-modal Matching of Voices and Faces (2018) (44)
Channel selection based on multichannel cross-correlation coefficients for distant speech recognition (2011) (43)
Ultrasonic Doppler Sensor for Voice Activity Detection (2007) (41)
Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery (2017) (40)
Phoneme-Dependent NMF for Speech Enhancement in Monaural Mixtures (2011) (39)
Face Reconstruction from Voice using Generative Adversarial Networks (2019) (39)
Missing Data Imputation for Time-Frequency Representations of Audio Signals (2011) (39)
A hierarchical system for word discovery exploiting DTW-based initialization (2013) (39)
CMU-Informedia @ TRECVID 2013 Multimedia Event Detection (2013) (38)
Efficient autism spectrum disorder prediction with eye movement: A machine learning framework (2015) (38)
A minimum mean squared error estimator for single channel speaker separation (2004) (38)
Latent Dirichlet Decomposition for Single Channel Speaker Separation (2006) (36)
Scale independent raga identification using chromagram patterns and swara based features (2013) (36)
Audio event and scene recognition: A unified approach using strongly and weakly labeled data (2016) (35)
Joint sparsity models for wideband array processing (2011) (35)
Privacy-preserving speaker verification as password matching (2012) (34)
Exploring the Best Loss Function for DNN-Based Low-latency Speech Enhancement with Temporal Convolutional Networks (2020) (34)
Speech recognizer-based microphone array processing for robust hands-free speech recognition (2002) (34)
Cross Modal Audio Search and Retrieval with Joint Embeddings Based on Text and Audio (2019) (34)
Contrast and Order Representations for Video Self-supervised Learning (2021) (33)
Classifier-based mask estimation for missing feature methods of robust speech recognition (2000) (33)
Cepstral compensation by polynomial approximation for environment-independent speech recognition (1996) (33)
Is normalization indispensable for training deep neural network? (2020) (33)
Reconstruction of damaged spectrographic features for robust speech recognition (2000) (32)
Tracking noise via dynamical systems with a continuum of states (2003) (32)
Informedia@TrecVID 2014: MED and MER (2014) (32)
Inference of missing spectrographic features for robust speech recognition (1998) (31)
Ultrasonic Doppler sensor for speaker recognition (2008) (30)
A Survey: Time Travel in Deep Learning Space: An Introduction to Deep Learning Models and How Deep Learning Models Evolved from the Initial Ideas (2015) (30)
Complex recurrent neural networks for denoising speech signals (2015) (29)
Environmental Noise Embeddings for Robust Speech Recognition (2016) (28)
Content-Based Representations of Audio Using Siamese Neural Networks (2017) (27)
A Comparison Between Spoken Queries and Menu-Based Interfaces for In-car Digital Music Selection (2005) (27)
Viral Spread via Entertainment and Voice-Messaging Among Telephone Users in India (2016) (25)
Synthesizing speech from Doppler signals (2010) (24)
Unsupervised word segmentation from noisy input (2013) (24)
Ultrasonic sensing for robust speech recognition (2010) (24)
HEAR: Holistic Evaluation of Audio Representations (2022) (22)
Informedia e-lamp @ TRECVID 2012 multimedia event detection and recounting MED and MER (2012) (22)
Weakly supervised scalable audio content analysis (2016) (22)
Privacy Preserving Speaker Verification Using Adapted GMMs (2011) (22)
Latent-variable decomposition based dereverberation of monaural and multi-channel signals (2010) (22)
Learning-Based Auditory Encoding for Robust Speech Recognition (2010) (22)
Recognizing speech from simultaneous speakers (2005) (21)
A Unifying Analysis of Projected Gradient Descent for $ell_p$-constrained Least Squares (2011) (21)
Iterative Bayesian word segmentation for unsupervised vocabulary discovery from phoneme lattices (2014) (21)
Active-set newton algorithm for non-negative sparse coding of audio (2014) (21)
Signal and Feature Compensa-tion Methods for Robust Speech Recognition (2002) (21)
Content-based Video Indexing and Retrieval Using Corr-LDA (2016) (21)
Analysis-by-synthesis features for speech recognition (2008) (21)
THE 1999 CMU 10X REAL TIME BROADCAST NEWS TRANSCRIPTION SYSTEM (1999) (21)
RECOGNITION OF CONTINUOUS BROADCAST NEWS WITH MULTIPLE UNKNOWN SPEAKERS AND ENVIRONMENTS (1995) (20)
Robust 1-bit Compressive Sensing via Gradient Support Pursuit (2013) (20)
Soft mask estimation for single channel speaker separation (2004) (20)
Reducing communication overhead in distributed learning by an order of magnitude (almost) (2015) (20)
The Right to Talk: An Audio-Visual Transformer Approach (2021) (19)
Unsupervised Structure Discovery for Semantic Analysis of Audio (2012) (19)
Calibration of microphone arrays for improved speech recognition (2001) (19)
Lossless compression of language model structure and word identifiers (2003) (19)
An Unsupervised Dynamic Bayesian Network Approach to Measuring Speech Style Accommodation (2012) (19)
Structured redefinition of sound units by merging and splitting for improved speech recognition (2000) (19)
Speech-recognizer-based filter optimization for microphone array processing (2003) (19)
A Comparative Study Of Indian And Western Music Forms (2013) (18)
Classification in Likelihood Spaces (2004) (18)
Classifier-based non-linear projection for adaptive endpointing of continuous speech (2003) (18)
Sound event classification using ontology-based neural networks (2018) (17)
Spectrographic seam patterns for discriminative word spotting (2012) (17)
An approach for self-training audio event detectors using web data (2016) (17)
The MERL SpokenQuery information retrieval system a system for retrieving pertinent documents from a spoken query (2002) (17)
Privacy-Preserving Speaker Authentication (2012) (17)
Example-Driven Bandwidth Expansion (2007) (17)
The automatic assessment of knowledge integration processes in project teams (2011) (17)
Unsupervised hierarchical structure induction for deeper semantic analysis of audio (2013) (17)
Secure Modular Hashing (2015) (17)
Large Margin Gaussian Mixture Models with Differential Privacy (2012) (17)
On the Appropriateness of Complex-Valued Neural Networks for Speech Enhancement (2016) (16)
Binary Sparse Coding of Convolutive Mixtures for Sound Localization and Separation via Spatialization (2016) (16)
An iterative least-squares technique for dereverberation (2011) (16)
Supervised monaural source separation based on autoencoders (2017) (16)
Robust Speech Recognition: The case for restoring missing features (2001) (16)
Rapid development of public health education systems in low-literacy multilingual environments: combating ebola through voice messaging (2015) (16)
The Incredible Shrinking Neural Network: New Perspectives on Learning Representations Through The Lens of Pruning (2017) (16)
Probabilistic Factorization of Non-negative Data with Entropic Co-occurrence Constraints (2009) (15)
FoolHD: Fooling Speaker Identification by Highly Imperceptible Adversarial Disturbances (2020) (15)
FreeMatch: Self-adaptive Thresholding for Semi-supervised Learning (2022) (15)
Short-term analysis for estimating physical parameters of speakers (2016) (15)
The relationship of voice onset time and Voice Offset Time to physical age (2016) (15)
Automatic generation of phone sets and lexical transcriptions (2000) (14)
Learning Model-Based Sparsity via Projected Gradient Descent (2012) (14)
Secure binary embeddings of front-end factor analysis for privacy preserving speaker verification (2013) (14)
Classification with free energy at raised temperatures (2003) (14)
A Speech-in List-out Approach to Spoken User Interfaces (2004) (13)
An FFT-Based Companding Front End for Noise-Robust Automatic Speech Recognition (2007) (13)
SphereFace2: Binary Classification is All You Need for Deep Face Recognition (2021) (13)
SPECTROGRAM DIMENSIONALITY REDUCTION WITH INDEPENDENCE CONSTRAINTS (2010) (13)
Be Careful What You Backpropagate: A Case For Linear Output Activations & Gradient Boosting (2017) (13)
Doppler based speed estimation of vehicles using passive sensor (2013) (13)
Formant manipulations in voice disguise by mimicry (2016) (12)
SphereFace Revived: Unifying Hyperspherical Face Recognition (2021) (12)
Forensic anthropometry from voice: An articulatory-phonetic approach (2016) (12)
Model Compensation and Matched Condition Methods for Robust Speech Recognition (2002) (12)
Compositional models for audio processing (2014) (12)
Privacy-preserving speaker verification using garbled GMMS (2014) (12)
Recognizing talking faces from acoustic Doppler reflections (2008) (12)
Optimization of the DET curve in speaker verification (2012) (11)
Sensor and Data Systems, Audio-Assisted Cameras and Acoustic Doppler Sensors (2007) (11)
Continuous Feature Adaptation for Non-Native Speech Recognition (2007) (11)
Classifier Risk Estimation under Limited Labeling Resources (2016) (11)
On the Origin of Deep Learning On the Origin of Deep Learning (2017) (11)
Unsupervised Fusion Weight Learning in Multiple Classifier Systems (2015) (11)
ADAPTATION AND COMPENSATION : APPROACHES TO MICR OPHONE AND SPEAKER INDEPENDENCE IN AUTOMATIC SPEECH RECOGNITION (1996) (11)
Hide and Speak: Towards Deep Neural Networks for Speech Steganography (2019) (11)
Learning Sound Events From Webly Labeled Data (2018) (10)
Event detection in short duration audio using Gaussian Mixture Model and Random Forest Classifier (2013) (10)
Microphone Array Post-filter based on Spatially-Correlated Noise Measurements for Distant Speech Recognition (2012) (10)
Handcrafted Local Features are Convolutional Neural Networks (2015) (10)
Detection and Evaluation of Human and Machine Generated Speech in Spoofing Attacks on Automatic Speaker Verification Systems (2020) (10)
When to Interrupt: A Comparative Analysis of Interruption Timings Within Collaborative Communication Tasks (2017) (10)
Human Behaviour Recognition Using Wifi Channel State Information (2019) (10)
The Markov selection model for concurrent speech recognition (2010) (10)
A joint decoding algorithm for multiple-example-based addition of words to a pronunciation lexicon (2009) (10)
Detecting Psychological Distress in Adults Through Transcriptions of Clinical Interviews (2016) (9)
Privacy Preserving Speech Processing (2013) (9)
Speaker tracking with spherical microphone arrays (2013) (9)
Discriminatively trained dependency language modeling for conversational speech recognition (2013) (9)
Maximum kurtosis beamforming with a subspace filter for distant speech recognition (2011) (9)
Mining Multimodal Repositories for Speech Affecting Diseases (2018) (9)
HEAR 2021: Holistic Evaluation of Audio Representations (2022) (9)
Unsupervised Word Discovery from Phonetic Input Using Nested Pitman-Yor Language Modeling (2013) (9)
DCASE 2017 Task 1: Acoustic Scene Classification Using Shift-Invariant Kernels and Random Features (2018) (9)
A unified approach for robust speech recognition (1995) (9)
Features and Kernels for Audio Event Recognition (2016) (9)
Discovering sound concepts and acoustic relations in text (2016) (8)
Self-Supervised 3D Face Reconstruction via Conditional Estimation (2021) (8)
Creating a linguistic plausibility dataset with non-expert annotators (2010) (8)
Detecting sound objects in audio recordings (2014) (8)
AudioSentibank: Large-scale Semantic Ontology of Acoustic Concepts for Audio Content Analysis (2016) (8)
Inferring room semantics using acoustic monitoring (2017) (8)
Deriving vocal tract shapes from electromagnetic articulograph data via geometric adaptation and matching (2009) (8)
AN ACOUSTIC DOPPLER-BASED FRONT END FOR HANDS FREE SPOKEN USER INTERFACES (2006) (8)
Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection (2020) (7)
Hide and Speak: Deep Neural Networks for Speech Steganography (2019) (7)
The REVERB Challenge: A Benchmark Task for Reverberation-Robust ASR Techniques (2017) (7)
NELS - Never-Ending Learner of Sounds (2018) (7)
Bandwidth Expansionwith a pólya URN Model (2007) (7)
Spokenquery: an alternate approach to chosing items with speech (2004) (7)
Privacy Preserving Protocols for Eigenvector Computation (2010) (7)
Logsum Using Garbled Circuits (2015) (7)
Privacy-Preserving Important Passage Retrieval (2014) (7)
Factorization With Temporal Dependencies for Speech Denoising (2008) (7)
Acoustic Scene Classification Using Discrete Random Hashing for Laplacian Kernel Machines (2018) (6)
Deriving Compact Feature Representations Via Annealed Contraction (2020) (6)
Reconstructing faces from voices (2019) (6)
Improving weakly supervised sound event detection with self-supervised auxiliary tasks (2021) (6)
Efficient Protocols for Principal Eigenvector Computation over Private Data (2011) (6)
Sherlock: A Crowd-sourced System For Automatic Tagging Of Indoor Floor Plans (2020) (6)
Querying Depression Vlogs (2018) (6)
Towards fusion of feature extraction and acoustic model training: a top down process for robust speech recognition (2009) (6)
Privacy-Preserving Multi-Document Summarization (2015) (6)
Large Margin Multiclass Gaussian Classification with Differential Privacy (2010) (6)
Domain adduced state tying for cross-domain acoustic modelling (1999) (6)
A companding front end for noise-robust automatic speech recognition (2005) (6)
Sequential Randomized Smoothing for Adversarially Robust Speech Recognition (2021) (6)
Privacy-preserving speaker verification using secure binary embeddings (2014) (6)
Speaker verification using Secure Binary Embeddings (2013) (6)
Topic and Prosodic Modeling for Interruption Management in Multi-User Multitasking Communication Interactions (2017) (5)
Learning contextual relevance of audio segments using discriminative models over AUD sequences (2011) (5)
Synergy between human and machine approaches to sound/scene recognition and processing: An overview of ICASSP special session (2023) (5)
CEPSTRAL COMPENSATION USING STATISTICAL LINEARIZATION (2000) (5)
Privacy-preserving Query-by-Example Speech Search (2015) (5)
Comparison of width-wise and length-wise language model compression (2001) (5)
USB: A Unified Semi-supervised Learning Benchmark for Classification (2022) (5)
Signal and Feature Compensation Methods for Robust Speech Recognition (2018) (5)
AudioPairBank: towards a large-scale tag-pair-based audio content analysis (2016) (5)
On the pragmatism of using binary classifiers over data intensive neural network classifiers for detection of COVID-19 from voice (2022) (5)
Neural Regression Trees (2018) (5)
SEPARATING A FOREGROUND SINGER FROM (2006) (5)
A Comparative Analysis of Human-Mediated and System-Mediated Interruptions for Multi-user, Multitasking Interactions (2017) (5)
Microphone array processing for distant speech recognition: Spherical arrays (2012) (5)
The Basics of Automatic Speech Recognition (2012) (5)
Speech Separation by Humans and Machines (2004) (5)
Exploiting Temporal Sequence Structure for Semantic Analysis of Multimedia (2012) (5)
The in-the-Wild Speech Medical Corpus (2021) (5)
Probabilistic Latent Variable Model for Sparse Decompositions of Non-negative Data (2009) (4)
Online Video Instance Segmentation via Robust Context Fusion (2022) (4)
Speech-Based UI Design for the Automobile (2008) (4)
Optimizing Neural Network Embeddings Using a Pair-Wise Loss for Text-Independent Speaker Verification (2019) (4)
Privacy Preserving Spam Filtering (2011) (4)
Time Signal Classification Using Random Convolutional Features (2019) (4)
Identifying Actions for Sound Event Classification (2021) (4)
TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement (2023) (4)
Predicting Idea Co-Construction in Speech Data using Insights from Sociolinguistics (2012) (4)
Nonlinear Semi-Parametric Models for Survival Analysis (2019) (4)
A two factor transformation for speaker verification through ℓ1 comparison (2017) (4)
Audio Content Based Geotagging in Multimedia (2016) (4)
Semantic Indexing (2014) (4)
Privacy preserving Distance computation using somewhat-trusted third parties (2016) (4)
R^2VOS: Robust Referring Video Object Segmentation via Relational Multimodal Cycle Consistency (2022) (4)
A Corrective Learning Approach for Text-Independent Speaker Verification (2018) (4)
Feature compensation with secondary sensor measurements for robust speech recognition (2005) (4)
Multimedia Event Detection and Recounting (2013) (3)
Structured sparse coding for microphone array location calibration (2012) (3)
In-the-Wild End-to-End Detection of Speech Affecting Diseases (2019) (3)
AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation (2022) (3)
A paired test for recognizer selection with untranscribed data (2011) (3)
Improving Speech Enhancement through Fine-Grained Speech Characteristics (2022) (3)
Detecting gender differences in perception of emotion in crowdsourced data (2019) (3)
An integrated approach to improve speech recognition rate for non-native speakers (2006) (3)
Recent improvements of ASR models in the face of adversarial attacks (2022) (3)
Analysing Speech for Clinical Applications (2018) (3)
Point3D: tracking actions as moving points with 3D CNNs (2022) (3)
Proceeding of the 1 st International Workshop on Privacy-Preserving IR : When Information Retrieval Meets Privacy and Security ( PIR 2014 ) (2014) (3)
APPROACHES TO ENVIRONMENT COMPENSATION IN AUTOMATIC SPEECH RECOGNITION (1995) (3)
Panoramic Video Salient Object Detection with Ambisonic Audio Guidance (2022) (3)
USB: A Unified Semi-supervised Learning Benchmark (2022) (3)
Optimization of the DET curve in speaker verification under noisy conditions (2013) (3)
A Comparison of Latent Variable Models For Conversation Analysis (2011) (3)
The Problem of Robustness in Automatic Speech Recognition (2012) (3)
Efficient Integration of Multi-channel Information for Speaker-independent Speech Separation (2020) (3)
Towards End-to-End Private Automatic Speaker Recognition (2022) (3)
Editorial: Special Section on Statistical and Perceptual Audio Processing (2006) (2)
Mask Proxy Loss for Text-Independent Speaker Recognition (2020) (2)
Word Particles Applied to Information Retrieval (2009) (2)
Voice driven applications in non-stationary and chaotic environment (2007) (2)
VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning (2022) (2)
Exploring Optimal DNN Architecture for End-to-End Beamformers Based on Time-frequency References (2020) (2)
Block-wise incremental adaptation algorithm for maximum kurtosis beamforming (2011) (2)
A Paradigm for Limited Vocabulary Speech Recognition Based on Redundant Spectro-Temporal Feature Sets (2011) (2)
Self-supervision and Learnable STRFs for Age, Emotion, and Country Prediction (2022) (2)
The use of sense in unsupervised training of acoustic models for ASR systems (2010) (2)
Hierarchical Routing Mixture of Experts (2019) (2)
Investigation on effectiveness of mid-level feature representation for semantic boundary detection in news video (2003) (2)
DISTANT MULTI-SPEAKER VOICE ACTIVITY DETECTION USING RELATIVE ENERGY RATIO (2011) (2)
Framework for Evaluation of Sound Event Detection in Web Videos (2017) (2)
On the combination of voice prompt suppression with maximum kurtosis beamforming (2011) (2)
W-Net BF: DNN-based Beamformer Using Joint Training Approach (2019) (2)
Controlled AutoEncoders to Generate Faces from Voices (2021) (2)
Attacking a privacy preserving music matching algorithm (2012) (2)
A novel ranking method for multiple classifier systems (2015) (2)
Non-Determinism in Neural Networks for Adversarial Robustness (2019) (2)
Spectrogram dimensionality reductionwith independence constraints (2010) (2)
Post-masking: a hybrid approach to array processing for speech recognition (2014) (2)
Ungrounded independent non-negative factor analysis (2010) (1)
Ensemble approach in speaker verification (2013) (1)
Informedia at TRECVID2014: MED and MER, Semantic Indexing, Surveillance Event Detection (2014) (1)
SoftMatch: Addressing the Quantity-Quality Trade-off in Semi-supervised Learning (2023) (1)
BLOCK-SPARSE BASIS SETS FOR IMPROVED AUDIO CONTENT ESTIMATION (2013) (1)
Speech Analytics for Medical Applications (2018) (1)
Higher-order Network for Action Recognition (2018) (1)
On the implementation of a secure musical database matching (2011) (1)
APPLYING RECURRENT NEURAL NETWORK TO ARABIC NAMED ENTITY RECOGNITION (2016) (1)
An Embarrassingly Simple Baseline for Imbalanced Semi-Supervised Learning (2022) (1)
A Boosting Approach for Conﬁdence Scoring (2021) (1)
Plagiarism Detection in Polyphonic Music using Monaural Signal Separation (2015) (1)
A Comparison of Prosody Modification using Instants of Significant Excitation and Mel-Cepstral Vocoder (2011) (1)
Automatic In-the-wild Dataset Annotation with Deep Generalized Multiple Instance Learning (2020) (1)
The phonetic bases of vocal expressed emotion: natural versus acted (2019) (1)
FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding (2023) (1)
Adaptation of SVM for MIL for inferring the polarity of movies and movie reviews (2016) (1)
Compensation for speech recognition in degraded acoustical environments (1996) (1)
Informedia @ TRECVID 2014 (2014) (1)
A hybrid physical and statistical dynamic articulatory framework incorporating analysis-by-synthesis for improved phone classification (2010) (1)
Shadowing as peer experiential learning for faculty instructional development strategy: A case study on a computer science course (2021) (1)
An information filter for voice prompt suppression (2011) (1)
Exploiting Non-Linear Redundancy for Neural Model Compression (2020) (1)
Optimal Strategies for Matching and Retrieval Problems by Comparing Covariates (2018) (1)
Cross-utterance context for multimodal video transcription (2022) (1)
Masked Proxy Loss for Text-Independent Speaker Verification (2020) (1)
Language identification using spectro-temporal patch features (2012) (1)
Artificial Creative Intelligence: Breaking the Imitation Barrier (2020) (1)
Reconstructing Noise-Corrupted Spectrographic Components for Robust Speech Recognition (2011) (1)
Constant Random Perturbations Provide Adversarial Robustness with Minimal Effect on Accuracy (2021) (1)
SPEAKER VERIFICATION USING SECURE BINARY EMBEDDINGS JosPortˆ (2013) (1)
Missing-Feature Approaches in Speech Recognition [ Improving recognition accuracy in noise by using partial spectrographic information ] (2009) (1)
Ble Mixture Gaussian Distributions with the following Means and Variances: 4.2. Performance of Ratz on Speech Recogni- Tion in Noise 5. Summary Acknowledgements -0.2 0.2 0.4 0.6 0.8 1.0 1.2 (0)
Improving Perceptual Quality, Intelligibility, and Acoustics on VoIP Platforms (2023) (0)
Semi-supervised context-aware discovery of unknown audio concepts (2013) (0)
APPROACHES TO MICROPHONE INDEPENDENCE IN AUTOMATIC SPEECH RECOGNITION (2003) (0)
A Multipath Sparse Beamfroming Method (2013) (0)
Improving sound event detection with ontologies (2023) (0)
Privacy Preserving Biometric Identity Verification (2017) (0)
Interactive Evaluation of Classifiers Under Limited Resources (2018) (0)
1 . 1 Motion SIFT ( MoSIFT ) Feature (2012) (0)
Probabilistic deduction of symbol mappings for extension of lexicons (2007) (0)
Quantization-basedLanguageModel Compression (2001) (0)
Synthesizing speech from surface electromyography and acoustic Doppler sonar. (2010) (0)
Improving headphone spatialization for stereo music (2015) (0)
XNOR-FORMER: Learning Accurate Approximations in Long Speech Transformers (2022) (0)
A Boosting Appr oach for Confidence Scoring (2001) (0)
N ov 2 01 6 AUDIO CONTENT BASED GEOTAGGING IN MULTIMEDIA (2018) (0)
Technical Program for the 2005 IEEE Workshop on Automatic Speech Recognition and Understanding - ASRU2005 (2005) (0)
Privacy-preserving Automatic Speaker Diarization (2022) (0)
Speech Recognizer Based Maximum Likelihood Beamforming (2005) (0)
Surveillance Event Detection ( SED ) Discriminative Features and Interactive Feedback Utilization (2012) (0)
Speech Communication: Preface (2011) (0)
THE INCREDIBLE SHRINKING NEURAL NETWORK: NEW PERSPECTIVES (2016) (0)
Training image classifiers using Semi-Weak Label Data (2021) (0)
CLASSIFICATION USING ONTOLOGY-BASED NEURAL NETWORKS (2018) (0)
Approach to Learning Generalized Audio Representation Through Batch Embedding Covariance Regularization and Constant-Q Transforms (2023) (0)
Bear the Query in Mind: Visual Grounding with Query-conditioned Convolution (2022) (0)
Robust speech recognition using missing features (2003) (0)
MAXIMUM KURTOSIS BEAMFORMING (2011) (0)
Towards Adversarial Robustness Via Compact Feature Representations (2021) (0)
There is more than one kind of robustness: Fooling Whisper with adversarial examples (2022) (0)
Unifying the Discrete and Continuous Emotion labels for Speech Emotion Recognition (2022) (0)
Automatic assessment of student “reasoning” processes in face-to-face interactions using speech data (2010) (0)
Locality constrained transitive distance clustering on speech data (2015) (0)
Audition for multimedia computing (2017) (0)
Understanding Political Polarisation using Language Models: A dataset and method (2023) (0)
A summary of the REVERB challenge: state-of-the-art and remaining challenges in reverberant speech processing research (2016) (0)
Not all broken defenses are equal: The dead angles of adversarial accuracy (2022) (0)
Text, Speech, and Dialogue: 21st International Conference, TSD 2018, Brno, Czech Republic, September 11-14, 2018, Proceedings (2018) (0)
Inferring missing spectral data. (2008) (0)
Measuring prevalence of other-oriented transactive contributions using an automated measure of speech style accommodation (2013) (0)
An Approach to Ontological Learning from Weak Labels (2023) (0)
Watch What You Pretrain For: Targeted, Transferable Adversarial Examples on Self-Supervised Speech Recognition models (2022) (0)
Two New Techniques for Natural Spoken User Interfaces (2006) (0)
Discriminative Dictionary Learning for Autism Spectrum Disorder Identification (2021) (0)
Voice biometrics: privacy in paralinguistic and extralinguistic tasks for health applications (2021) (0)
Scalable Audio-Content Analysis (2010) (0)
Crowdsourced Video Subtitling with Adaptation Based on User-Corrected Lattices (2016) (0)
How many perturbations break this model? Evaluating robustness beyond adversarial accuracy (2022) (0)
PAAPLoss: A Phonetic-Aligned Acoustic Parameter Loss for Speech Enhancement (2023) (0)
Discovery of temporal patterns in continuous nonrandom sound sequences. (2008) (0)
Properties and Applications of Ultrasonic Doppler Sensing in Human-Computer Interaction (2012) (0)
AudioPairBank: towards a large-scale tag-pair-based audio content analysis (2018) (0)
a ) Integrated I / O Environment 5 . 1 . 1 VOICE-A Voice Oriented Interactive Computing Environment (2014) (0)
EXPANSION OF NARROWBAND S TRIX FACTORIZA (0)
Describing emotions with acoustic property prompts for speech emotion recognition (2022) (0)
SPEECH RECOGNITION RATE FOR NON-NATIVE SPEAKERS (2021) (0)
Learnable Higher-order Representation for Action Recognition (2019) (0)
Optimal Strategies For Comparing Covariates To Solve Matching Problems (2021) (0)
Positional Encoding for Capturing Modality Specific Cadence for Emotion Detection (2022) (0)
Ontological Learning from Weak Labels (2022) (0)
Preface (2011) (0)

This paper list is powered by the following services:

What Schools Are Affiliated With Bhiksha Raj?

Bhiksha Raj is affiliated with the following schools:

Carnegie Mellon University

Bhiksha Raj's Academic­Influence.com Rankings

Bhiksha Raj's Degrees

Why Is Bhiksha Raj Influential?

Bhiksha Raj's Published Works

Published Works

What Schools Are Affiliated With Bhiksha Raj?

Bhiksha Raj's AcademicInfluence.com Rankings