Sridha Sridharan

Sridha Sridharan's AcademicInfluence.com Rankings

Sridha Sridharan

Engineering

#4489

World Rank

#5686

Historical Rank

Applied Physics

#1100

World Rank

#1128

Historical Rank

engineering Degrees

Download Badge

Engineering

Sridha Sridharan's Degrees

PhD Electrical and Computer Engineering Carnegie Mellon University
Masters Electrical and Computer Engineering Carnegie Mellon University
Bachelors Electronics and Communication Engineering IIT Madras

Why Is Sridha Sridharan Influential?

(Suggest an Edit or Addition)

(See a Problem?)

Sridha Sridharan's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

Feature warping for robust speaker verification (2001) (789)
Crowd Counting Using Multiple Local Features (2009) (293)
i-vector Based Speaker Recognition on Short Utterances (2011) (266)
Soft + Hardwired Attention: An LSTM Framework for Human Trajectory Prediction and Abnormal Event Detection (2017) (254)
Automatically Detecting Pain in Video Through Facial Action Units (2011) (246)
Iris Recognition With Off-the-Shelf CNN Features: A Deep Learning Perspective (2018) (237)
Texture for script identification (2005) (198)
Two Stream LSTM: A Deep Fusion Framework for Human Action Recognition (2017) (145)
The QUT-NOISE-TIMIT corpus for the evaluation of voice activity detection algorithms (2010) (141)
Explicit modelling of session variability for speaker verification (2008) (139)
Face authentication test on the BANCA database (2004) (134)
Person-independent facial expression detection using Constrained Local Models (2011) (131)
Gait energy volumes and frontal gait recognition using depth images (2011) (130)
An evaluation of crowd counting methods, features and regression models (2015) (127)
Large-Scale Analysis of Soccer Matches Using Spatiotemporal Tracking Data (2014) (125)
A Database for Person Re-Identification in Multi-Camera Surveillance Networks (2012) (124)
Long range iris recognition: A survey (2017) (123)
A Mask-Based Approach for the Geometric Calibration of Thermal-Infrared Cameras (2012) (121)
Real-time adaptive background segmentation (2003) (108)
Robust speaker recognition using microphone arrays (2001) (105)
Image2Mesh: A Learning Framework for Single Image 3D Reconstruction (2017) (98)
Textures of optical flow for real-time anomaly detection in crowds (2011) (97)
Automatically detecting pain using facial actions (2009) (94)
Least squares congealing for unsupervised alignment of images (2008) (94)
A phonetic search approach to the 2006 NIST spoken term detection evaluation (2007) (94)
Super-resolution for biometrics: A comprehensive survey (2018) (90)
Liveness detection based on 3D face shape analysis (2013) (89)
An adaptive optical flow technique for person tracking systems (2007) (89)
Improving short utterance i-vector speaker verification using utterance variance modelling and compensation techniques (2014) (85)
Modelling session variability in text-independent speaker verification (2005) (85)
A syntactic approach to automatic lip feature extraction for speaker identification (1998) (77)
Design and Cryptanalysis of Transform-Based Analog Speech Scamblers (1993) (76)
Correlation-aware Adversarial Domain Adaptation and Generalization (2019) (75)
Clustered Blind Beamforming From Ad-Hoc Microphone Arrays (2011) (75)
Factor analysis subspace estimation for speaker verification with short utterances (2008) (74)
Identifying Team Style in Soccer Using Formations Learned from Spatiotemporal Tracking Data (2014) (73)
An approach to statistical lip modelling for speaker identification via chromatic feature extraction (1998) (71)
Elastic LiDAR Fusion: Dense Map-Centric Continuous-Time SLAM (2017) (71)
Deep Learning for Medical Anomaly Detection – A Survey (2020) (69)
In the Pursuit of Effective Affective Computing: The Relationship Between Features and Registration (2012) (68)
Real-Time Adaptive Foreground/Background Segmentation (2005) (67)
Fast Fourier transform based speech encryption system (1991) (67)
Improved Simultaneous Computation of Motion Detection and Optical Flow for Object Tracking (2009) (67)
Fourier Lucas-Kanade Algorithm (2012) (64)
Learning Free-Form Deformations for 3D Object Reconstruction (2018) (64)
Rapid Yet Accurate Speech Indexing Using Dynamic Match Lattice Spotting (2007) (63)
Adaptive Fusion of Speech and Lip Information for Robust Speaker Identification (2001) (62)
Evaluation of image resolution and super-resolution on face recognition performance (2012) (62)
Face recognition from 3D data using Iterative Closest Point algorithm and Gaussian mixture models (2004) (60)
Predicting the Future: A Jointly Learnt Model for Action Anticipation (2019) (60)
Dynamic match phone-lattice searches for very fast and accurate unrestricted vocabulary keyword spotting (2005) (59)
Soft-Biometrics: Unconstrained Authentication in a Surveillance Environment (2009) (59)
Large-Scale Analysis of Formations in Soccer (2013) (58)
Super-Resolved Faces for Improved Face Recognition from Surveillance Video (2007) (58)
Combined 2D/3D Face Recognition Using Log-Gabor Templates (2006) (57)
PLDA based speaker recognition on short utterances (2012) (56)
Improved facial expression recognition via uni-hyperplane classification (2012) (56)
Integration strategies for audio-visual speech processing: applied to text-dependent speaker recognition (2005) (56)
Deep Spatio-Temporal Features for Multimodal Emotion Recognition (2017) (56)
Tracking by Prediction: A Deep Generative Model for Mutli-person Localisation and Tracking (2018) (56)
Score-Level Multibiometric Fusion Based on Dempster–Shafer Theory Incorporating Uncertainty Factors (2014) (56)
Feature-domain super-resolution framework for Gabor-based face and iris recognition (2012) (55)
Investigation into Optical Flow Super-Resolution for Surveillance Applications (2005) (54)
Quality-Driven Super-Resolution for Less Constrained Iris Recognition at a Distance and on the Move (2011) (54)
Deep spatio-temporal feature fusion with compact bilinear pooling for multimodal emotion recognition (2018) (52)
The Delta-Phase Spectrum With Application to Voice Activity Detection and Speaker Recognition (2011) (52)
Noise robust voice activity detection using features extracted from the time-domain autocorrelation function (2010) (51)
Deep Classification of Epileptic Signals (2018) (51)
Face recognition using fractal codes (2001) (50)
Local inter-session variability modelling for object classification (2014) (49)
Least-squares congealing for large numbers of images (2009) (49)
Feature-domain super-resolution for iris recognition (2011) (49)
Scene invariant multi camera crowd counting (2014) (48)
Using Synthetic Data to Improve Facial Expression Analysis with 3D Convolutional Networks (2017) (48)
GD-GAN: Generative Adversarial Networks for Trajectory Prediction and Group Detection in Crowds (2018) (48)
Tree Memory Networks for Modelling Long-term Temporal Dependencies (2017) (48)
Methods to improve Gaussian mixture model based language identification system (2002) (48)
Vector quantization based Gaussian modeling for speaker verification (2000) (48)
Efficient constrained local model fitting for non-rigid face alignment (2009) (48)
Trainable speech synthesis with trended hidden Markov models (2001) (47)
Dynamic texture reconstruction from sparse codes for unusual event detection in crowded scenes (2011) (47)
An MRF based abnormal event detection approach using motion and appearance features (2014) (47)
Going Deeper: Autonomous Steering with Neural Memory Networks (2017) (44)
Factor analysis modelling for speaker verification with short utterances (2008) (42)
Experiments in Session Variability Modelling for Speaker Verification (2006) (41)
Adaptive Optical Flow for Person Tracking (2005) (41)
“ Sweet-Spot ” : Using Spatiotemporal Data to Discover and Predict Shots in Tennis (2013) (40)
Discovering Team Structures in Soccer from Spatiotemporal Data (2016) (40)
Spatiotemporal Camera-LiDAR Calibration: A Targetless and Structureless Approach (2020) (39)
Recent Advances in Camera Planning for Large Area Surveillance (2016) (39)
I-vector based speaker recognition using advanced channel compensation techniques (2014) (39)
Deep Learning for Patient-Independent Epileptic Seizure Prediction Using Scalp EEG Signals (2021) (38)
Adaptive mouth segmentation using chromatic features (2002) (38)
Real-Time Mobile 3D Temperature Mapping (2015) (38)
A Robust Interpretable Deep Learning Classifier for Heart Anomaly Detection Without Segmentation (2020) (38)
Multi-Component Image Translation for Deep Domain Generalization (2018) (38)
Heart Sound Segmentation Using Bidirectional LSTMs With Attention (2020) (37)
Experiments in SVM-based Speaker Verification Using Short Utterances (2010) (36)
Recognising Team Activities from Noisy Data (2013) (36)
Multichannel speech separation by eigendecomposition and its application to co-talker interference removal (1997) (35)
Multi-spectral stereo image matching using mutual information (2004) (34)
On Minimum Discrepancy Estimation for Deep Domain Adaptation (2019) (34)
Sparse Temporal Representations for Facial Expression Recognition (2011) (34)
Locus: LiDAR-based Place Recognition using Spatiotemporal Higher-Order Pooling (2020) (34)
Fusing shrinking and expanding active contour models for robust iris segementation (2010) (33)
Optimal Camera Planning Under Versatile User Constraints in Multi-Camera Image Processing Systems (2014) (33)
Automated analysis of seizure semiology and brain electrical activity in presurgery evaluation of epilepsy: A focused survey (2017) (33)
Improving deep convolutional neural networks with unsupervised feature learning (2015) (33)
Crowd Counting Using Group Tracking and Local Features (2010) (33)
Discriminant NAP for SVM speaker recognition (2008) (33)
Improving out-domain PLDA speaker verification using unsupervised inter-dataset variability compensation approach (2015) (33)
Robust speaker verification via fusion of speech and lip modalities (1999) (32)
Multiscale Representation for 3-D Face Recognition (2007) (32)
Making Confident Speaker Verification Decisions With Minimal Speech (2010) (32)
Deep facial analysis: A new phase I epilepsy evaluation using computer vision (2018) (32)
MTRNet: A Generic Scene Text Eraser (2019) (31)
Predicting movie ratings from audience behaviors (2014) (31)
Predicting Shot Locations in Tennis Using Spatiotemporal Data (2013) (31)
The use of temporal speech and lip information for multi-modal speaker identification via multi-stream HMMs (2000) (31)
Spoken term detection using fast phonetic decoding (2009) (30)
Initialised eigenlip estimator for fast lip tracking using linear regression (2000) (30)
Speech enhancement using critical band spectral subtraction (1998) (30)
Predicting Serves in Tennis using Style Priors (2015) (30)
Gaussian mixture modelling of broad phonetic and syllabic events for text-independent speaker verification (2005) (29)
A unified approach to multi-pose audio-visual ASR (2007) (29)
Improving pain recognition through better utilisation of temporal information (2008) (29)
The use of phase in complex spectrum subtraction for robust speech recognition (2011) (29)
Speaker recognition in reverberant enclosures (1996) (29)
Affine Adaptation of Local Image Features Using the Hessian Matrix (2009) (29)
Attention Driven Fusion for Multi-Modal Emotion Recognition (2020) (29)
Real-time adaptive background segmentation (2003) (28)
Forecasting Future Action Sequences with Neural Memory Networks (2019) (27)
Identification of Children at Risk of Schizophrenia via Deep Learning and EEG Responses (2020) (27)
Optimising Figure of Merit for phonetic spoken term detection (2010) (27)
Improved GrabCut Segmentation via GMM Optimisation (2008) (27)
Forecasting Events Using an Augmented Hidden Conditional Random Field (2014) (26)
Dataset-invariant covariance normalization for out-domain PLDA speaker verification (2015) (26)
Multi-spectral fusion for surveillance systems (2008) (26)
Automatic UAV Forced Landing Site Detection Using Machine Learning (2014) (25)
Near-field Adaptive Beamformer for Robust Speech Recognition (2002) (25)
Automatically detecting action units from faces of pain: Comparing shape and appearance features (2009) (25)
Fine-grained Action Segmentation using the Semi-Supervised Action GAN (2019) (25)
Task Specific Visual Saliency Prediction with Memory Augmented Conditional Generative Adversarial Networks (2018) (25)
Fourier Active Appearance Models (2011) (25)
Lip detection for audio-visual speech recognition in-car environment (2010) (24)
Determining operational measures from multi-camera surveillance systems using soft biometrics (2011) (24)
The Automated Cryptanalysis of Analog Speech Scramblers (1991) (24)
Improved SVM speaker verification through data-driven background dataset collection (2009) (24)
Improving short utterance based i-vector speaker recognition using source and utterance-duration normalization techniques (2013) (24)
Hierarchical Relational Attention for Video Question Answering (2018) (24)
Memory Augmented Deep Generative Models for Forecasting the Next Shot Location in Tennis (2019) (23)
Deep Inverse Reinforcement Learning for Behavior Prediction in Autonomous Driving: Accurate Forecasts of Vehicle Motion (2021) (23)
Hessian-Based Affine Adaptation of Salient Local Image Features (2012) (23)
Improving PLDA speaker verification performance using domain mismatch compensation techniques (2018) (23)
Locating People in Video from Semantic Descriptions: A New Database and Approach (2014) (23)
Gaze tracking for region of interest coding in JPEG 2000 (2006) (22)
Compressive Sensing for Gait Recognition (2011) (22)
Predicting Ball Ownership in Basketball from a Monocular View Using Only Player Trajectories (2015) (22)
Speaker attribution of multiple telephone conversations using a complete-linkage clustering approach (2012) (22)
3D face verification using a free-parts approach (2008) (21)
Effects of speech coding on text-dependent speaker recognition (1997) (21)
Learning Temporal Strategic Relationships using Generative Adversarial Imitation Learning (2018) (21)
Gabor Filter Bank Representation for 3D Face Recognition (2005) (21)
Use of brain computer interface to drive: preliminary results (2012) (21)
A Study of x-Vector Based Speaker Recognition on Short Utterances (2019) (21)
Discovering methods of scoring in soccer using tracking data (2015) (21)
Searching for people using semantic soft biometric descriptions (2015) (20)
Data-driven clustering for blind feature mapping in speaker verification (2005) (20)
Speaker Identification Using Higher Order Spectral Phase Features and their Effectiveness vis-a-vis Mel-Cepstral Features (2004) (20)
Multiple cameras for audio-visual speech recognition in an automotive environment (2013) (20)
Unusual Event Detection in Crowded Scenes Using Bag of LBPs in Spatio-Temporal Patches (2011) (20)
Forecasting the Next Shot Location in Tennis Using Fine-Grained Spatiotemporal Tracking Data (2016) (20)
A comparison of session variability compensation techniques for SVM-based speaker recognition (2007) (20)
Patch-Based Representation of Visual Speech (2006) (20)
Detecting rare events using Kullback-Leibler divergence: A weakly supervised approach (2016) (20)
Hand-held monocular SLAM in thermal-infrared (2012) (20)
Patch-based analysis of visual speech from multiple views (2008) (20)
Data-Driven Background Dataset Selection for SVM-Based Speaker Verification (2010) (20)
Probabilistic Surfel Fusion for Dense LiDAR Mapping (2017) (19)
Continuous pose-invariant lipreading (2008) (19)
3D ellipsoid fitting for multi-view gait recognition (2011) (19)
MTRNet++: One-stage Mask-based Scene Text Eraser (2019) (19)
Improved phonetic and lexical speaker recognition through MAP adaptation (2004) (19)
Fine-Grained Retrieval of Sports Plays using Tree-Based Alignment of Trajectories (2017) (19)
Histogram of Weighted Local Directions for Gait Recognition (2013) (19)
Person Re-Identification Using Group Information (2013) (19)
Extending the Task of Diarization to Speaker Attribution (2011) (18)
Deep Motion Analysis for Epileptic Seizure Classification (2018) (18)
The Backfilled GEI - A Cross-Capture Modality Gait Feature for Frontal and Side-View Gait Recognition (2012) (18)
Cryptanalysis of frequency domain analogue speech scramblers (1993) (18)
Elasticity Meets Continuous-Time: Map-Centric Dense 3D LiDAR SLAM (2020) (18)
Interactive Sports Analytics (2018) (18)
Neural memory plasticity for medical anomaly detection (2020) (18)
Coupled Generative Adversarial Network for Continuous Fine-Grained Action Segmentation (2019) (18)
Understanding Patients’ Behavior: Vision-Based Analysis of Seizure Disorders (2019) (18)
Multi-Level Sequence GAN for Group Activity Recognition (2018) (17)
The use of speech and lip modalities for robust speaker verification under adverse conditions (1999) (17)
Gaze Based Personal Identification (2010) (17)
Telephone based speaker recognition using multiple binary classifier and Gaussian mixture models (1997) (17)
Understanding and analyzing a large collection of archived swimming videos (2014) (17)
Improving PLDA speaker verification with limited development data (2014) (17)
Efficient Articulated Trajectory Reconstruction Using Dynamic Programming and Filters (2012) (17)
The QUT-NOISE-SRE protocol for the evaluation of noisy speaker recognition (2015) (17)
Rank Minimization across Appearance and Shape for AAM Ensemble Fitting (2013) (17)
Speech encryption in the transform domain (1990) (17)
Rethinking Planar Homography Estimation Using Perspective Fields (2018) (17)
Deep Auto-Encoders With Sequential Learning for Multimodal Dimensional Emotion Recognition (2020) (17)
Visual Voice Activity Detection Using Frontal versus Profile Views (2011) (17)
Multi-view Intelligent Vehicle Surveillance System (2006) (16)
Compact Model Representation for 3D Reconstruction (2017) (16)
Dynamic visual features for audio-visual speaker verification (2010) (16)
Dependence of GMM adaptation on feature post-processing for speaker recognition (2003) (16)
Face recognition from super-resolved images (2005) (16)
Interpretability performance assessment of JPEG2000 and part 1 compliant region of interest coding (2003) (16)
An Evaluation of Different Features and Learning Models for Anomalous Event Detection (2013) (16)
Pedestrian Trajectory Prediction with Structured Memory Hierarchies (2018) (16)
Scene Invariant Crowd Counting (2011) (16)
LoGG3D-Net: Locally Guided Global Descriptor Learning for 3D Place Recognition (2021) (16)
An extended pose-invariant lipreading system (2007) (16)
An Efficient Framework for Zero-Shot Sketch-Based Image Retrieval (2021) (16)
Swimmer Localization from a Moving Camera (2013) (16)
Meta Transfer Learning for Facial Emotion Recognition (2018) (16)
An Accurate Method for Skew Determination in Document Images (2002) (16)
Speaker Attribution of Australian Broadcast News Data (2013) (15)
Robustness to expression variations in fractal-based face recognition (2001) (15)
Automatic surveillance in transportation hubs: No longer just about catching the bad guy (2015) (15)
Complete-linkage clustering for voice activity detection in audio and visual speech (2015) (15)
Abandoned object detection using multi-layer motion detection (2008) (15)
Fused HMM-adaptation of multi-stream HMMs for audio-visual speech recognition (2007) (15)
Geometric Deep Learning for Subject Independent Epileptic Seizure Prediction Using Scalp EEG Signals (2021) (15)
Clustering of ad-hoc microphone arrays for robust blind beamforming (2010) (14)
Neighbourhood Context Embeddings in Deep Inverse Reinforcement Learning for Predicting Pedestrian Motion Over Long Time Horizons (2019) (14)
Aberrant epileptic seizure identification: A computer vision perspective (2019) (14)
A Continuous Speech Recognition Evaluation Protocol for the AVICAR Database (2008) (14)
The effect of language models on phonetic decoding for spoken term detection (2009) (14)
Cross-lingual pronunciation modelling for indonesian speech recognition (2003) (14)
Improved Facial-Feature Detection for AVSP via Unsupervised Clustering and Discriminant Analysis (2003) (14)
Constrained Design of Deep Iris Networks (2019) (14)
Chromatic colour spaces for skin detection using GMMS (2002) (14)
Combat sports analytics: Boxing punch classification using overhead depthimagery (2015) (14)
Exploiting Human Social Cognition for the Detection of Fake and Fraudulent Faces via Memory Networks (2019) (13)
A Deep Four-Stream Siamese Convolutional Neural Network with Joint Verification and Identification Loss for Person Re-Detection (2018) (13)
Spatio Temporal Feature Evaluation for Action Recognition (2012) (13)
Fine-grained action recognition of boxing punches from depth imagery (2017) (13)
Quality based frame selection for video face recognition (2012) (13)
Component-Based Attention for Large-Scale Trademark Retrieval (2018) (13)
Recognising audio-visual speech in vehicles using the AVICAR database (2010) (13)
Within-session variability modelling for factor analysis speaker verification (2009) (13)
Robust 3D Face Recognition from Expression Categorisation (2007) (12)
Robust mean super-resolution for less cooperative NIR iris recognition at a distance and on the move (2010) (12)
Unusual Scene Detection Using Distributed Behaviour Model and Sparse Representation (2012) (12)
Activity Analysis in Complicated Scenes Using DFT Coefficients of Particle Trajectories (2012) (12)
Quality Based Frame Selection for Face Clustering in News Video (2013) (12)
Microphone array sub-band speech recognition (2001) (12)
Noise robust voice activity detection using normal probability testing and time-domain histogram analysis (2010) (12)
An Exploration of Feature Detector Performance in the Thermal-Infrared Modality (2011) (12)
Domain Generalization in Biosignal Classification (2020) (12)
Evaluating Automatic Road Detection across a Large Aerial Imagery Collection (2011) (12)
Group Segmentation During Object Tracking Using Optical Flow Discontinuities (2010) (12)
Speech Enhancement Using Microphone Array with Multi-Stage Processing (1996) (12)
Multi-Channel Sub-Band Speech Recognition (2001) (12)
Semantic Consistency and Identity Mapping Multi-Component Generative Adversarial Network for Person Re-Identification (2020) (12)
A Comparison of Session Variability Compensation Approaches for Speaker Verification (2010) (12)
SVM Speaker Verification Using Session Variability Modelling and GMM Supervectors (2007) (11)
Multi-view human pose estimation using modified five-point skeleton model (2008) (11)
Deformable face ensemble alignment with robust grouped-L1 anchors (2013) (11)
Scene Invariant Crowd Counting and Crowd Occupancy Analysis (2012) (11)
A study on the effects of using short utterance length development data in the design of GPLDA speaker verification systems (2017) (11)
A cluster-voting approach for speaker diarization and linking of Australian broadcast news recordings (2015) (11)
Can You Describe Him for Me? A Technique for Semantic Person Search in Video (2012) (11)
Visual speech recognition across multiple views (2008) (11)
Context from within: Hierarchical context modeling for semantic segmentation (2020) (11)
A hierarchical multimodal system for motion analysis in patients with epilepsy (2018) (11)
Visual attention based ROI maps from gaze tracking data (2004) (11)
A cascaded long short-term memory (LSTM) driven generic visual question answering (VQA) (2017) (11)
Robust Photogeometric Localization Over Time for Map-Centric Loop Closure (2019) (11)
Joint identification-verification for person re-identification: A four stream deep learning approach with improved quartet loss function (2020) (11)
Comparison of Four Distance Measures for Long Time Text-Independent Speaker Identification (1996) (11)
An Efficient and Robust System for Multiperson Event Detection in Real-World Indoor Surveillance Scenes (2015) (11)
TMMF: Temporal Multi-Modal Fusion for Single-Stage Continuous Gesture Recognition (2020) (11)
Discriminative Optimization of the Figure of Merit for Phonetic Spoken Term Detection (2011) (11)
Chromatic lip tracking using a connectivity based fuzzy thresholding technique (1999) (10)
Multilingual phone clustering for recognition of spontaneous indonesian speech utilising pronunciation modelling techniques (2003) (10)
A study of speaker clustering for speaker attribution in large telephone conversation datasets (2016) (10)
A robust UAV landing site detection system using mid-level discriminative patches (2016) (10)
Logarithmic quantisation of wavelet coefficients for improved texture classification performance (2004) (10)
Audio-visual speaker verification using continuous fused HMMs (2006) (10)
Dealing with uncertainty in microphone placement in a microphone array speech recognition system (2008) (10)
Improving the PLDA based speaker verification in limited microphone data conditions (2013) (10)
DNN based Speaker Recognition on Short Utterances (2016) (10)
Representing Team Behaviours from Noisy Data Using Player Role (2014) (10)
Position-Independent Enhancement of Reverberant Speech (1997) (10)
Target-Specific Siamese Attention Network for Real-Time Object Tracking (2020) (10)
Dynamic Performance Measures for Object Tracking Systems (2009) (10)
A feature clustering algorithm for scale-space analysis of image structures (2008) (10)
Cross-language acoustic model refinement for the Indonesian language (2005) (9)
Resection-Intersection Bundle Adjustment Revisited (2013) (9)
Gate connected convolutional neural network for object tracking (2017) (9)
Automatic Tracking, Super-Resolution and Recognition of Human Faces from Surveillance Video (2007) (9)
Bayes factor scoring of GMMs for speaker verification (2004) (9)
Complex Event Detection Using Joint Max Margin and Semantic Features (2016) (9)
Three approaches to multilingual phone recognition (2003) (9)
Real-time video event detection in crowded scenes using MPEG derived features: A multiple instance learning approach (2014) (9)
A syllable-scale framework for language identification (2006) (8)
Negative Determinant of Hessian Features (2011) (8)
Activity recognition using binary tree SVM (2014) (8)
Geometry-constrained Car Recognition Using a 3D Perspective Network (2019) (8)
Scatter Difference NAP for SVM Speaker Recognition (2009) (8)
Interpretable Seizure Classification Using Unprocessed EEG With Multi-Channel Attentive Feature Fusion (2021) (8)
Multiple Instance Dictionary Learning for Activity Representation (2014) (8)
Speaker linking using complete-linkage clustering (2012) (8)
InCloud: Incremental Learning for Point Cloud Place Recognition (2022) (8)
Visual front-endwars: Viola-Jones face detector vs Fourier Lucas-Kanade (2013) (8)
Cross database training of audio-visual hidden Markov models for phone recognition (2015) (8)
The State of Aerial Surveillance: A Survey (2022) (8)
Memory based fusion for multi-modal deep learning (2020) (8)
Multi-modal semantic image segmentation (2021) (8)
SPEECH ENHANCEMENT USING NEAR-FIELD SUPERDIRECTIVITY WITH AN ADAPTIVE SIDELOBE CANCELER AND POST-FILTER (2000) (8)
Facial analysis in the wild with LSTM networks (2017) (8)
On the Performance and Use of Speaker Recognition Systems for Surveillance (2006) (8)
Comparing audio and visual information for speech processing (2005) (8)
On the Statistical Determination of Optimal Camera Configurations in Large Scale Surveillance Networks (2012) (8)
Improved GMM-based speaker verification using SVM-driven impostor dataset selection (2009) (7)
Improving Speech Recognition Accuracy for Small Vocabulary Applications in Adverse Environments (2000) (7)
Ball on beam on roller: a new control laboratory device (2002) (7)
Closed-Form Solutions for Low-Rank Non-Rigid Reconstruction (2015) (7)
A modified LIMA framework for spectral subtraction applied to in-car speech recognition (2008) (7)
Hierarchical temporal decomposition: a novel approach to efficient compression of spectral characteristics of speech (1998) (7)
Channel selection in the short-time modulation domain for distant speech recognition (2015) (7)
End-to-End Domain Adaptive Attention Network for Cross-Domain Person Re-Identification (2020) (7)
Pitch and energy trajectory modelling in a syllable length temporal framework for language identification (2004) (7)
Gaze-J2K: Gaze-influenced image voding using eye trackers and JPEG 2000 (2006) (7)
Automatic gender identification under adverse conditions (1997) (7)
Channel Graph Regularized Correlation Filters for Visual Object Tracking (2021) (7)
Dense Correspondence Extraction in Difficult Uncalibrated Scenarios (2009) (7)
Image 2 Mesh : A Learning Framework for Single Image 3 D Reconstruction (2018) (7)
Joint Deep Cross-Domain Transfer Learning for Emotion Recognition (2020) (7)
Can Audio-Visual Speech Recognition Outperform Acoustically Enhanced Speech Recognition in Automotive Environment? (2011) (7)
Progressive coding in JPEG2000 - improving content recognition performance using ROIs and importance maps (2002) (7)
On the Use of Factor Analysis with Restricted Target Data in Speaker Verification (2010) (7)
Understanding the Importance of Heart Sound Segmentation for Heart Anomaly Detection (2020) (7)
Enhancing automatic speaker identification using phoneme clustering and frame based parameter and frame size selection (1999) (7)
Data-Driven Impostor Selection for T-Norm Score Normalisation and the Background Dataset in SVM-Based Speaker Verification (2009) (7)
Detection of Fake and Fraudulent Faces via Neural Memory Networks (2021) (7)
Employing Phonetic Information in DNN Speaker Embeddings to Improve Speaker Recognition Performance (2018) (7)
Anchored Deformable Face Ensemble Alignment (2012) (7)
Robust Face Localisation Using Motion, Colour and Fusion (2003) (7)
Importance prioritization coding in JPEG2000 for interpretability with application to surveillance imagery (2003) (7)
A two stage fuzzy decision classifier for speaker identification (1996) (7)
Anomalous Event Detection Using a Semi-Two Dimensional Hidden Markov Model (2012) (7)
Problems associated with current area-based visual speech feature extraction techniques (2005) (6)
A link between cepstral shrinking and the weighted product rule in audio-visual speech recognition (2002) (6)
Two novel lossless algorithms to exploit index redundancy in VQ speech compression (1998) (6)
A suitability metric for mouth tracking through chromatic segmentation (2001) (6)
Deep features-based expression-invariant tied factor analysis for emotion recognition (2017) (6)
Minimising Speaker Verification Utterance Length through Confidence Based Early Verification Decisions (2009) (6)
Identifying Customer Behaviour and Dwell Time Using Soft Biometrics (2012) (6)
Improving The Effectiveness of Existing Noise Reduction Techniques Using Neural Networks (1996) (6)
An Examination of Audio-Visual Fused HMMs for Speaker Recognition (2006) (6)
The Australian English Speech Corpus for In-Car Speech processing (2009) (6)
PLDA based speaker verification with weighted LDA techniques (2012) (6)
Discovery of facial motions using deep machine perception (2016) (6)
Isolated word verification using cohort word-level verification (2003) (6)
Two-Stream Deep Feature Modelling for Automated Video Endoscopy Data Analysis (2020) (6)
An investigation of HMM classifier combination strategies for improved audio-visual speech recognition (2001) (6)
Frequency offset correction for HF radio speech reception (2000) (6)
Complex-Valued Iris Recognition Network (2020) (6)
Non-rigid Reconstruction with a Single Moving RGB-D Camera (2018) (6)
Person tracking using motion detection and optical flow (2005) (6)
Techniques for improving stereo depth maps of faces (2004) (6)
Vision-Based Mouth Motion Analysis in Epilepsy: A 3D Perspective (2019) (5)
Camera calibration in wireless multimedia sensor networks (2009) (5)
Speech compression with preservation of speaker identity (1997) (5)
LSTM guided ensemble correlation filter tracking with appearance model pool (2020) (5)
Exploiting multiple feature sets in data-driven impostor dataset selection for speaker verification (2010) (5)
2 D-3 D Hybrid Face Recognition Based on PCA and Feature Modelling (2006) (5)
Efficient real-time face detection for high resolution surveillance applications (2012) (5)
Comparing object alignment algorithms with appearance variation: Forward-additive vs inverse-composition (2008) (5)
CROSS LINGUAL MODELLING EXPERIMENTS FOR INDONESIAN (2002) (5)
SAIVT-QUT@TRECVid 2012: Interactive surveillance event detection (2012) (5)
Feature mapping using far-field microphones for distant speech recognition (2016) (5)
Importance coding of still imagery based on importance maps of visually interpretable regions (2001) (5)
The Role of Motion Models in Super-Resolving Surveillance Video for Face Recognition (2006) (5)
SAIVT-ADMRG @ MediaEval 2014 Social Event Detection (2014) (5)
A comparison of fusion techniques in mel-cepstral based speaker identification (1998) (5)
Temporarily-Aware Context Modeling Using Generative Adversarial Networks for Speech Activity Detection (2020) (5)
Robust and Interpretable Temporal Convolution Network for Event Detection in Lung Sound Recordings (2021) (5)
An Application of Fractal Image-Set Coding in Facial Recognition (2004) (5)
Deep domain adaptation for anti-spoofing in speaker verification systems (2019) (5)
High Quality Audio Coding: An Overview (1995) (5)
A Hierarchical Multi-modal System for Motion Analysis in Epileptic Patients (2020) (5)
2D-3D Face Recognition Based on PCA and Feature Modelling (2006) (5)
Social signal processing for pain monitoring using a hidden conditional random field (2014) (5)
Audio-visual speaker identification using the CUAVE database (2005) (5)
Accurate Silhouettes for Surveillance - Improved Motion Segmentation Using Graph Cuts (2010) (5)
Investigating Deep Neural Networks for Speaker Diarization in the DIHARD Challenge (2018) (5)
Topic dependent language modelling for spoken term detection (2014) (5)
Co-talker Separation Using the 'Cocktail Party Effect' (1996) (4)
Domain-invariant I-vector Feature Extraction for PLDA Speaker Verification (2018) (4)
Likelihood-maximising frameworks for enhanced in-car speech recognition (2009) (4)
Feature Modelling of PCA Difference Vectors for 2D and 3D Face Recognition (2006) (4)
Weighting and normalisation of synchronous HMMs for audio-visual speech recognition (2007) (4)
Learning Detectors Quickly with Stationary Statistics (2014) (4)
Voice Presentation Attack Detection Using Convolutional Neural Networks (2019) (4)
Phonetic spoken term search using topic information (2014) (4)
Detecting rare events using Kullback-Leibler divergence (2015) (4)
Voiced/Unvoiced/Silence Classification of Noisy Speech in Real Time Audio Signal Processing (1995) (4)
A Multi-Class Tracker Using a Scalable Condensation Filter (2006) (4)
Super-Resolved Face Images using Robust Optical Flow (2004) (4)
Robust Face Localisation Using Motion, Colour & Fusion (2003) (4)
Detecting anomalous events at railway level crossings (2013) (4)
Wild-Places: A Large-Scale Dataset for Lidar Place Recognition in Unstructured Natural Environments (2022) (4)
Learning Salient Features for Multimodal Emotion Recognition with Recurrent Neural Networks and Attention Based Fusion (2019) (4)
Cross Likelihood Ratio Based Speaker Clustering Using Eigenvoice Models (2011) (4)
Guidelines to Using Region of Interest Coding in JPEG 2000 (2003) (4)
Human-level face verification with intra-personal factor analysis and deep face representation (2018) (4)
Multi-Scale Representation for 3D Face Recognition (2007) (4)
Deep discovery of facial motions using a shallow embedding layer (2017) (4)
Affect recognition from scalp-EEG using channel-wise encoder networks coupled with geometric deep learning and multi-channel feature fusion (2022) (4)
Learning Temporal Alignment Uncertainty for Efficient Event Detection (2015) (4)
Patient-independent Epileptic Seizure Prediction using Deep Learning Models (2020) (4)
Domain Mismatch Modeling of Out-Domain i-Vectors for PLDA Speaker Verification (2017) (4)
Neural Memory Plasticity for Anomaly Detection (2019) (4)
Multimodal clothing recognition for semantic search in unconstrained surveillance imagery (2019) (4)
Study on pairwise LDA for x‐vector‐based speaker recognition (2019) (4)
The development of a new signal processing program at the Queensland University of Technology (1996) (4)
Normalisation and Recognition of 3D Face Data Using Robust Hausdorff Metric (2008) (4)
Bayes Factor based speaker clustering for speaker diarization (2010) (4)
Pose-driven Attention-guided Image Generation for Person Re-Identification (2021) (4)
Low-cost hardware speech enhancement for improved speech recognition in automotive environments (2010) (3)
Normalisation of 3D face data (2008) (3)
Multi-sensor tracking using a scalable condensation filter (2008) (3)
Locating People in Surveillance Video Using Soft Biometric Traits (2017) (3)
Speech-seeking microphone array with multi-stage processing (1995) (3)
Evaluation of two-view geometry methods with automatic ground-truth generation (2013) (3)
Skeleton Driven Non-Rigid Motion Tracking and 3D Reconstruction (2018) (3)
Large scale monitoring of crowds and building utilisation: A new database and distributed approach (2015) (3)
Importance prioritisation in JPEG 2000 for improved interpretability (2004) (3)
A comparison of Gaussian mixture and multiple binary classifier models for speaker verification (1996) (3)
Higher Order Spectral Phase Features for Speaker Identification (2004) (3)
Improving visual noise insensitivity in small vocabulary audio visual speech recognition applications (2001) (3)
Interpolative coding of speech parameters using hierarchical temporal decomposition (2003) (3)
Assessment of speech dialog systems using multi-modal cognitive load analysis and driving performance metrics (2009) (3)
Effects of speech coding on speaker verification (1996) (3)
JFA based speaker recognition using delta-phase and MFCC features (2012) (3)
Incorporating visual information for spoken term detection (2015) (3)
Speaker Verification using Hidden Markov Models in a Multilingual Text-constrained Framework (2006) (3)
Learning detectors quickly using structured covariance matrices (2014) (3)
Scale-space volume descriptors for automatic 3D facial feature extraction (2009) (3)
Hierarchical Attention Network for Action Segmentation (2020) (3)
Detection of unknown forms from document images (2003) (3)
Deep Context Modeling for Semantic Segmentation (2017) (3)
Cascading appearance-based features for visual speaker verification (2008) (3)
Calibrating Cameras in Poor-Conditioned Pitch-Based Sports Games (2018) (3)
Exploring visual features through Gabor representations for facial expression detection (2010) (3)
Tracking people in 3D using position, size and shape (2005) (3)
Eigengaze - covert behavioral biometric exploiting visual attention characteristics (2010) (3)
Semantic Segmentation Of Hands In Multimodal Images: A Region New-Based CNN Approach (2019) (3)
Speakers In The Wild (SITW): The QUT Speaker Recognition System (2016) (3)
Multi-Modal Object Tracking using Dynamic Performance Metrics (2010) (3)
Spoken Language Identification Utilising Both Acoustic and Phonetic Information (2003) (3)
Speech compaction using temporal decomposition (1998) (3)
The design and development of an undergraduate signal processing laboratory (1994) (3)
Learning Regional Attention Over Multi-Resolution Deep Convolutional Features For Trademark Retrieval (2021) (2)
Accurate 3D hand mesh recovery from a single RGB image (2021) (2)
A New Approach To Teaching Signal Processing At Undergraduate Level (1996) (2)
Bayes factor based speaker segmentation for speaker diarization (2010) (2)
Self-calibration of wireless cameras with restricted degrees of freedom (2012) (2)
Robust Speech Coding for the Preservation of Speaker Identity (1996) (2)
Discriminative Domain-Invariant Adversarial Network for Deep Domain Generalization (2021) (2)
What Is the Average Human Face? (2006) (2)
Wide baseline correspondence extraction beyond local features (2011) (2)
Deep Domain Generalization with Feature-norm Network (2021) (2)
Texture classification using gabor energy features and higher order spectral features: a comparative study (2005) (2)
A Real Time Audio Enhancement System (1995) (2)
Automatic Event Detection for Signal-based Surveillance (2016) (2)
Investigating in-domain data requirements for PLDA training (2015) (2)
Identifying Team Style in Soccer using Formations from Spatiotemporal Tracking Data (2014) (2)
Domain adaptation based Speaker Recognition on Short Utterances (2016) (2)
Normalisation of 3 D Face Data (2007) (2)
Fast, Dense Feature SDM on an iPhone (2016) (2)
Multi-Slice Net: A Novel Light Weight Framework For COVID-19 Diagnosis (2021) (2)
Combined coding of audio and speech signals using LPC and the discrete wavelet transform (1997) (2)
Preserving Semantic Consistency in Unsupervised Domain Adaptation Using Generative Adversarial Networks (2021) (2)
Short utterance PLDA speaker verification using SN-WLDA and variance modelling techniques (2014) (2)
Textual Analysis for Script Recognition (2001) (2)
Activity Modelling in Crowded Environments: A Soft-Decision Approach (2011) (2)
Towards Improved Assessment of Phonotactic Information for Automatic Language Identification (2006) (2)
Speech separation by simulating the cocktail party effect with a neural network controlled Wiener filter (1997) (2)
Human Face Reconstruction Using Bayesian Deformable Models (2006) (2)
Practical Improvements to Simultaneous Computation of Multi-view Geometry and Radial Lens Distortion (2011) (2)
Speech Enhancement Iby Simulation Of Cocktail Party Effect With Neural Network Controlled Iterative Filter (1996) (2)
Split 'n' merge net: A dynamic masking network for multi-task attention (2022) (2)
Deep Decision Trees for Discriminative Dictionary Learning with Adversarial Multi-agent Trajectories (2018) (2)
An iterative speaker re-diarization scheme for improving speaker-based entity extraction in multimedia archives (2014) (2)
ROBUST LIP TRACKING USING ACTIVE SHAPE MODELS AND GRADIENT VECTOR FLOW (2000) (2)
Short Utterance Variance Modelling and Utterance Partitioning for PLDA Speaker Verification (2016) (2)
From Affine Rank Minimization Solution to Sparse Modeling (2017) (2)
Searching for semantic person queries using channel representations (2015) (2)
Importance coding of surveillance imagery for interpretability using quadtree dynamic importance maps (2001) (2)
Digital coding of covert audio for monitoring and storage (1999) (2)
Scene Invariant Virtual Gates Using DNNs (2019) (2)
Importance Coding in JPEG2000 for Improved Interpretability (2001) (2)
Fast Exact Nearest Neighbour Matching in High Dimensions Using d-D Sort (2013) (2)
Application of the trended hidden Markov model to speech synthesis (2001) (2)
Semantic Correspondence: A Hierarchical Approach (2018) (2)
Adaptive Vector Quantization for Speech Spectrum Coding (1999) (2)
Unified 2D and 3D Hand Pose Estimation from a Single Visible or X-ray Image (2019) (1)
Modeling of output probability distribution to improve small vocabulary speech recognition in adverse environments (1998) (1)
Robust Automatic Face Clustering in News Video (2015) (1)
A speaker rediarization scheme for improving diarization in large two-speaker telephone datasets (2014) (1)
QUT Speaker Identity Verification system for EVALITA 2009 (2010) (1)
Learning object dynamics for smooth tracking of moving lip contours (2000) (1)
IGSSTRCF: Importance Guided Sparse Spatio-Temporal Regularized Correlation Filters For Tracking (2021) (1)
Semantic Correspondence in the Wild (2019) (1)
Improved subject identification in surveillance video using super resolution (2012) (1)
A Comparison of Three Discriminant Models for Automatic Speaker Verification (1996) (1)
Accurate 3D hand mesh recovery from a single RGB image (2022) (1)
Robust Real Time Multi-Layer Foreground Segmentation (2007) (1)
Enhancement Methods for Reverberant Speech (1996) (1)
Robust Enhancement of Reverberant Speech (1995) (1)
Semi-Binary Based Video Features for Activity Representation (2013) (1)
Audio visual automatic speech recognition in vehicles (2010) (1)
Deep Match Tracker: Classifying when Dissimilar, Similarity Matching when Not (2018) (1)
SESS: Saliency Enhancing with Scaling and Sliding (2022) (1)
Supervised Latent Dirichlet Allocation Models for Efficient Activity Representation (2014) (1)
Application Specific Bounds on Detection Cost using Game Theory (2006) (1)
Cascading appearance-based features for visual voice activity detection (2010) (1)
Spectral Geometric Verification: Re-Ranking Point Cloud Retrieval for Metric Localization (2022) (1)
Joint Max Margin and Semantic Features for Continuous Event Detection in Complex Scenes (2017) (1)
Enhancing Feature Invariance with Learned Image Transformations for Image Retrieval (2020) (1)
Single image depth prediction using super-column super-pixel features (2017) (1)
Learning test-time augmentation for content-based image retrieval (2020) (1)
Robust facial feature extraction and matching (2012) (1)
Calculating the similarity of textures using wavelet scale relationships (2003) (1)
Multi-lingual character recognition using artificial neural networks (1996) (1)
Rescaling clustering trees using impact ratios for robust hierarchical speaker clustering (2014) (1)
Analyzing and predicting events in soccer and tennis using spatiotemporal data (2014) (1)
Deeper and wider fully convolutional network coupled with conditional random fields for scene labeling (2016) (1)
Enhancing The Multiple Binary Classifier Model (1996) (1)
PhD forum: Multiple camera management using wide base-line matching (2009) (1)
Improving Short Utterance PLDA Speaker Verification using SUV Modelling and Utterance Partitioning Approach (2016) (1)
Investigating Domain Sensitivity of DNN Embeddings for Speaker Recognition Systems (2019) (1)
An Intelligent Microphone Array for Speech Enhancement (1996) (1)
Detecting Heart Failure Through Voice Analysis using Self-Supervised Mode-Based Memory Fusion (2022) (1)
A distributed protocol for object tracking in wireless multimedia sensor networks (2010) (1)
Improving the performance of a small microphone array at low frequencies using critical band and LPC codebooks (2000) (1)
A Secure Analog Speech Scrambler Using the Discrete Cosine Transform (1991) (1)
Fused HMM-Adaptation of Synchronous HMMs for Audio-Visual Speech Recognition (2007) (1)
Meta-transfer learning for emotion recognition (2020) (1)
An analysis of the KEEP CLEAR pavement markings effects on queuing vehicles dynamic performance at urban signalised intersections (2013) (1)
Cross-Lingual Pronunciation Modelling for Indonesian Speech (2003) (1)
Visual Question Answering Through Adversarial Learning of Multi-modal Representation (2020) (0)
3D Face Acquisition, Modelling and Recognition (2004) (0)
A likelihood-maximizing framework for enhanced in-car speech recognition based on speech dialog system interaction (2012) (0)
anu Aberrant Epileptic Seizure Identiﬁcation: A Computer Vision Perspective (2021) (0)
The application of phonetic distribution normalisation to likelihood-maximising speech enhancement for robust ASR (2010) (0)
Generalized Generative Deep Learning Models for Biosignal Synthesis and Modality Transfer (2022) (0)
Aerial-Ground Person Re-ID (2023) (0)
Towards On-Board Panoptic Segmentation of Multispectral Satellite Images (2022) (0)
Progressive image transmission (1992) (0)
Robust enhancement of reverberant speech using iterative noise removal (1997) (0)
Multi-stage stacked temporal convolution neural networks (MS-S-TCNs) for biosignal segmentation and anomaly localization (2023) (0)
Table of Contents (2011) (0)
A study on the effects of using short utterance length development data in the design of GPLDA speaker verification systems (2017) (0)
QUT System Description to the NIST SRE 2018 Campaign (2018) (0)
Towards Self-Explainability of Deep Neural Networks with Heatmap Captioning and Large-Language Models (2023) (0)
Memory Based Attentive Fusion (2020) (0)
Rapid Channel Compensation for Speaker Verification in the NIST 2000 Speaker Recognition Evaluation (2001) (0)
Fast Search Methods for Spectral Quantization (1999) (0)
Erratum: Design of a discrete cosine transform based speech scrambler (1991) (0)
Overleaf Example (2022) (0)
Using Auxiliary Information for Person Re-Identification - A Tutorial Overview (2022) (0)
Toward On-Board Panoptic Segmentation of Multispectral Satellite Images (2023) (0)
Unsupervised Temporal Ensemble Alignment for Rapid Annotation (2014) (0)
On the convergence of Gaussian mixture models: improvements through vector quantization (1998) (0)
Modelling output probability distributions for enhancing speaker recognition (1999) (0)
Video Question Answering for Surveillance (2020) (0)
3DCarRecog: Car Recognition Using 3D Bounding Box. (2019) (0)
Simulation of Cocktail Party Effect with Neural Network Controlled Iterative Wiener Filter (1996) (0)
Physical Adversarial Attacks for Surveillance: A Survey (2023) (0)
Hessian-Based Affine Adaptation of Salient Local Image Features (2011) (0)
Application Specific Boundson Detection CostUsingGameTheory (2006) (0)
Ground-plane based projective reconstruction for surveillance camera networks (2008) (0)
An Investigation of HMM Classiﬁer Combination Strategies for Improved Audio-Visual Speech Recognition (2021) (0)
Coding Speech at Very Low Rates Using Temporal Decomposition-Based Spectral Interpolation and Mixed Excitation in the LPC Model (1999) (0)
Detection of Forms from Unknown Document Images (2003) (0)
Audio-Visual Speaker Veri(cid:28)cation using Continuous Fused HMMs (2006) (0)
Incorporating visual information for spoken term detection Audio, Image, and Video (2015) (0)
Fusion of Cohort-Word and Speech Background Model Based Confidence Scores for Improved Keyword Confidence Scoring and Verification (2005) (0)
Graph Rigidity for Near-Coplanar Structure from Motion (2011) (0)
Improving speaker identification performance in reverberant conditions using lip information (1998) (0)
Infra-red pupil detection for use in a face recognition system (2004) (0)
2013 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2013, Hobart, Australia, November 26-28, 2013 (2013) (0)
Class-specific sparse codes for representing activities (2015) (0)
Jointly Trained Conversion Model With LPCNet for Any-to-One Voice Conversion Using Speaker-Independent Linguistic Features (2022) (0)
Voice Recognition Research - Final Report (2009) (0)
Supplementary SESS: Saliency Enhancing with Scaling and Sliding (2022) (0)
2016 IEEE Winter Conference on Applications of Computer Vision, WACV 2016, Lake Placid, NY, USA, March 7-10, 2016 (2016) (0)
Object Recognition Using Stereo Vision and Higher Order Spectra (2005) (0)
Sparse Over-complete Patch Matching (2018) (0)
Fast & Slow Learning: Incorporating Synthetic Gradients in Neural Memory Controllers (2020) (0)
Intelligibility Measurement of Processed Reverberant Speech (1996) (0)
Frequency decomposition techniques for increased discriminative 3D facial information capture (2010) (0)
DEPENDENT LANGUAGEMODELLING FOR SPOKEN TERM DETECTION (2014) (0)
Speech compaction using vector quantisation and hidden Markov models (1999) (0)
Deep Inverse Reinforcement Learning for Behaviour Prediction in Autonomous Driving (2021) (0)
Vertical Axis Detection for Sport Video Analytics (2016) (0)
Point Cloud Segmentation Using Sparse Temporal Local Attention (2021) (0)
Comparing the Multiple Binary Classifier Model to Other Automatic Speaker Verification Models (1999) (0)
Cross database audio visual speech adaptation for phonetic spoken term detection (2017) (0)
Reduction of Feature Contamination for Hyper Spectral Image Classification (2021) (0)
Improving PLDA speaker verification using WMFD and linear-weighted approaches in limited microphone data conditions (2015) (0)
A Hybrid Method for Face Recognition using LLS CLAHE Method (2017) (0)
Hybrid coding of mixed signals for digital covert audio surveillance (2000) (0)
The effect of dialect mismatch on likelihood-maximising speech enhancement for noise-robust speech recognition (2010) (0)
ROI Detection & Tracking Visual Feature Extraction Visual Modelling ROI Detection & Tracking Visual Feature Extraction Visual Modelling ROI Detection & Tracking Visual Feature Extraction Visual Modelling (2006) (0)
Using a Free-Parts Representation for Visual Speech Recognition (2005) (0)
Multilingual Speech and Language Processing (2001) (0)
Eigenvoice modelling for cross likelihood ratio based speaker clustering: A Bayesian approach (2013) (0)
Academic Strategy Planning For A University Research Centre (1996) (0)
Investigation and comparison of robust stereo image matching using mutual information and hierarchical prior probabilities (2008) (0)
Speech enhancement by eigen decomposition with two-channel observations (1995) (0)
Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28 - July 1, 2010 (2010) (0)
Acoustic Adaptation in Cross Database Audio Visual SHMM Training for Phonetic Spoken Term Detection (2015) (0)
Airports of the future : improving operation, security and experience (2014) (0)
Design of a High Speed Stream Cipher (1992) (0)
Fused HMM adaptation of synchronous HMMs for audio-visual speaker verification (2008) (0)
An Auto-Tracking Auto-Beamforming Microphone Array for Sound Recording (1995) (0)
Task Speciﬁc Visual Saliency Prediction with Memory Augmented Conditional Generative Adversarial Networks (2020) (0)

This paper list is powered by the following services:

What Schools Are Affiliated With Sridha Sridharan?

Sridha Sridharan is affiliated with the following schools:

Queensland University of Technology