Sridha Sridharan
#130,759
Most Influential Person Now
Sridha Sridharan's AcademicInfluence.com Rankings
Sridha Sridharanengineering Degrees
Engineering
#4489
World Rank
#5686
Historical Rank
Applied Physics
#1100
World Rank
#1128
Historical Rank

Download Badge
Engineering
Sridha Sridharan's Degrees
- PhD Electrical and Computer Engineering Carnegie Mellon University
- Masters Electrical and Computer Engineering Carnegie Mellon University
- Bachelors Electronics and Communication Engineering IIT Madras
Why Is Sridha Sridharan Influential?
(Suggest an Edit or Addition)Sridha Sridharan's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- Feature warping for robust speaker verification (2001) (789)
- Crowd Counting Using Multiple Local Features (2009) (293)
- i-vector Based Speaker Recognition on Short Utterances (2011) (266)
- Soft + Hardwired Attention: An LSTM Framework for Human Trajectory Prediction and Abnormal Event Detection (2017) (254)
- Automatically Detecting Pain in Video Through Facial Action Units (2011) (246)
- Iris Recognition With Off-the-Shelf CNN Features: A Deep Learning Perspective (2018) (237)
- Texture for script identification (2005) (198)
- Two Stream LSTM: A Deep Fusion Framework for Human Action Recognition (2017) (145)
- The QUT-NOISE-TIMIT corpus for the evaluation of voice activity detection algorithms (2010) (141)
- Explicit modelling of session variability for speaker verification (2008) (139)
- Face authentication test on the BANCA database (2004) (134)
- Person-independent facial expression detection using Constrained Local Models (2011) (131)
- Gait energy volumes and frontal gait recognition using depth images (2011) (130)
- An evaluation of crowd counting methods, features and regression models (2015) (127)
- Large-Scale Analysis of Soccer Matches Using Spatiotemporal Tracking Data (2014) (125)
- A Database for Person Re-Identification in Multi-Camera Surveillance Networks (2012) (124)
- Long range iris recognition: A survey (2017) (123)
- A Mask-Based Approach for the Geometric Calibration of Thermal-Infrared Cameras (2012) (121)
- Real-time adaptive background segmentation (2003) (108)
- Robust speaker recognition using microphone arrays (2001) (105)
- Image2Mesh: A Learning Framework for Single Image 3D Reconstruction (2017) (98)
- Textures of optical flow for real-time anomaly detection in crowds (2011) (97)
- Automatically detecting pain using facial actions (2009) (94)
- Least squares congealing for unsupervised alignment of images (2008) (94)
- A phonetic search approach to the 2006 NIST spoken term detection evaluation (2007) (94)
- Super-resolution for biometrics: A comprehensive survey (2018) (90)
- Liveness detection based on 3D face shape analysis (2013) (89)
- An adaptive optical flow technique for person tracking systems (2007) (89)
- Improving short utterance i-vector speaker verification using utterance variance modelling and compensation techniques (2014) (85)
- Modelling session variability in text-independent speaker verification (2005) (85)
- A syntactic approach to automatic lip feature extraction for speaker identification (1998) (77)
- Design and Cryptanalysis of Transform-Based Analog Speech Scamblers (1993) (76)
- Correlation-aware Adversarial Domain Adaptation and Generalization (2019) (75)
- Clustered Blind Beamforming From Ad-Hoc Microphone Arrays (2011) (75)
- Factor analysis subspace estimation for speaker verification with short utterances (2008) (74)
- Identifying Team Style in Soccer Using Formations Learned from Spatiotemporal Tracking Data (2014) (73)
- An approach to statistical lip modelling for speaker identification via chromatic feature extraction (1998) (71)
- Elastic LiDAR Fusion: Dense Map-Centric Continuous-Time SLAM (2017) (71)
- Deep Learning for Medical Anomaly Detection – A Survey (2020) (69)
- In the Pursuit of Effective Affective Computing: The Relationship Between Features and Registration (2012) (68)
- Real-Time Adaptive Foreground/Background Segmentation (2005) (67)
- Fast Fourier transform based speech encryption system (1991) (67)
- Improved Simultaneous Computation of Motion Detection and Optical Flow for Object Tracking (2009) (67)
- Fourier Lucas-Kanade Algorithm (2012) (64)
- Learning Free-Form Deformations for 3D Object Reconstruction (2018) (64)
- Rapid Yet Accurate Speech Indexing Using Dynamic Match Lattice Spotting (2007) (63)
- Adaptive Fusion of Speech and Lip Information for Robust Speaker Identification (2001) (62)
- Evaluation of image resolution and super-resolution on face recognition performance (2012) (62)
- Face recognition from 3D data using Iterative Closest Point algorithm and Gaussian mixture models (2004) (60)
- Predicting the Future: A Jointly Learnt Model for Action Anticipation (2019) (60)
- Dynamic match phone-lattice searches for very fast and accurate unrestricted vocabulary keyword spotting (2005) (59)
- Soft-Biometrics: Unconstrained Authentication in a Surveillance Environment (2009) (59)
- Large-Scale Analysis of Formations in Soccer (2013) (58)
- Super-Resolved Faces for Improved Face Recognition from Surveillance Video (2007) (58)
- Combined 2D/3D Face Recognition Using Log-Gabor Templates (2006) (57)
- PLDA based speaker recognition on short utterances (2012) (56)
- Improved facial expression recognition via uni-hyperplane classification (2012) (56)
- Integration strategies for audio-visual speech processing: applied to text-dependent speaker recognition (2005) (56)
- Deep Spatio-Temporal Features for Multimodal Emotion Recognition (2017) (56)
- Tracking by Prediction: A Deep Generative Model for Mutli-person Localisation and Tracking (2018) (56)
- Score-Level Multibiometric Fusion Based on Dempster–Shafer Theory Incorporating Uncertainty Factors (2014) (56)
- Feature-domain super-resolution framework for Gabor-based face and iris recognition (2012) (55)
- Investigation into Optical Flow Super-Resolution for Surveillance Applications (2005) (54)
- Quality-Driven Super-Resolution for Less Constrained Iris Recognition at a Distance and on the Move (2011) (54)
- Deep spatio-temporal feature fusion with compact bilinear pooling for multimodal emotion recognition (2018) (52)
- The Delta-Phase Spectrum With Application to Voice Activity Detection and Speaker Recognition (2011) (52)
- Noise robust voice activity detection using features extracted from the time-domain autocorrelation function (2010) (51)
- Deep Classification of Epileptic Signals (2018) (51)
- Face recognition using fractal codes (2001) (50)
- Local inter-session variability modelling for object classification (2014) (49)
- Least-squares congealing for large numbers of images (2009) (49)
- Feature-domain super-resolution for iris recognition (2011) (49)
- Scene invariant multi camera crowd counting (2014) (48)
- Using Synthetic Data to Improve Facial Expression Analysis with 3D Convolutional Networks (2017) (48)
- GD-GAN: Generative Adversarial Networks for Trajectory Prediction and Group Detection in Crowds (2018) (48)
- Tree Memory Networks for Modelling Long-term Temporal Dependencies (2017) (48)
- Methods to improve Gaussian mixture model based language identification system (2002) (48)
- Vector quantization based Gaussian modeling for speaker verification (2000) (48)
- Efficient constrained local model fitting for non-rigid face alignment (2009) (48)
- Trainable speech synthesis with trended hidden Markov models (2001) (47)
- Dynamic texture reconstruction from sparse codes for unusual event detection in crowded scenes (2011) (47)
- An MRF based abnormal event detection approach using motion and appearance features (2014) (47)
- Going Deeper: Autonomous Steering with Neural Memory Networks (2017) (44)
- Factor analysis modelling for speaker verification with short utterances (2008) (42)
- Experiments in Session Variability Modelling for Speaker Verification (2006) (41)
- Adaptive Optical Flow for Person Tracking (2005) (41)
- “ Sweet-Spot ” : Using Spatiotemporal Data to Discover and Predict Shots in Tennis (2013) (40)
- Discovering Team Structures in Soccer from Spatiotemporal Data (2016) (40)
- Spatiotemporal Camera-LiDAR Calibration: A Targetless and Structureless Approach (2020) (39)
- Recent Advances in Camera Planning for Large Area Surveillance (2016) (39)
- I-vector based speaker recognition using advanced channel compensation techniques (2014) (39)
- Deep Learning for Patient-Independent Epileptic Seizure Prediction Using Scalp EEG Signals (2021) (38)
- Adaptive mouth segmentation using chromatic features (2002) (38)
- Real-Time Mobile 3D Temperature Mapping (2015) (38)
- A Robust Interpretable Deep Learning Classifier for Heart Anomaly Detection Without Segmentation (2020) (38)
- Multi-Component Image Translation for Deep Domain Generalization (2018) (38)
- Heart Sound Segmentation Using Bidirectional LSTMs With Attention (2020) (37)
- Experiments in SVM-based Speaker Verification Using Short Utterances (2010) (36)
- Recognising Team Activities from Noisy Data (2013) (36)
- Multichannel speech separation by eigendecomposition and its application to co-talker interference removal (1997) (35)
- Multi-spectral stereo image matching using mutual information (2004) (34)
- On Minimum Discrepancy Estimation for Deep Domain Adaptation (2019) (34)
- Sparse Temporal Representations for Facial Expression Recognition (2011) (34)
- Locus: LiDAR-based Place Recognition using Spatiotemporal Higher-Order Pooling (2020) (34)
- Fusing shrinking and expanding active contour models for robust iris segementation (2010) (33)
- Optimal Camera Planning Under Versatile User Constraints in Multi-Camera Image Processing Systems (2014) (33)
- Automated analysis of seizure semiology and brain electrical activity in presurgery evaluation of epilepsy: A focused survey (2017) (33)
- Improving deep convolutional neural networks with unsupervised feature learning (2015) (33)
- Crowd Counting Using Group Tracking and Local Features (2010) (33)
- Discriminant NAP for SVM speaker recognition (2008) (33)
- Improving out-domain PLDA speaker verification using unsupervised inter-dataset variability compensation approach (2015) (33)
- Robust speaker verification via fusion of speech and lip modalities (1999) (32)
- Multiscale Representation for 3-D Face Recognition (2007) (32)
- Making Confident Speaker Verification Decisions With Minimal Speech (2010) (32)
- Deep facial analysis: A new phase I epilepsy evaluation using computer vision (2018) (32)
- MTRNet: A Generic Scene Text Eraser (2019) (31)
- Predicting movie ratings from audience behaviors (2014) (31)
- Predicting Shot Locations in Tennis Using Spatiotemporal Data (2013) (31)
- The use of temporal speech and lip information for multi-modal speaker identification via multi-stream HMMs (2000) (31)
- Spoken term detection using fast phonetic decoding (2009) (30)
- Initialised eigenlip estimator for fast lip tracking using linear regression (2000) (30)
- Speech enhancement using critical band spectral subtraction (1998) (30)
- Predicting Serves in Tennis using Style Priors (2015) (30)
- Gaussian mixture modelling of broad phonetic and syllabic events for text-independent speaker verification (2005) (29)
- A unified approach to multi-pose audio-visual ASR (2007) (29)
- Improving pain recognition through better utilisation of temporal information (2008) (29)
- The use of phase in complex spectrum subtraction for robust speech recognition (2011) (29)
- Speaker recognition in reverberant enclosures (1996) (29)
- Affine Adaptation of Local Image Features Using the Hessian Matrix (2009) (29)
- Attention Driven Fusion for Multi-Modal Emotion Recognition (2020) (29)
- Real-time adaptive background segmentation (2003) (28)
- Forecasting Future Action Sequences with Neural Memory Networks (2019) (27)
- Identification of Children at Risk of Schizophrenia via Deep Learning and EEG Responses (2020) (27)
- Optimising Figure of Merit for phonetic spoken term detection (2010) (27)
- Improved GrabCut Segmentation via GMM Optimisation (2008) (27)
- Forecasting Events Using an Augmented Hidden Conditional Random Field (2014) (26)
- Dataset-invariant covariance normalization for out-domain PLDA speaker verification (2015) (26)
- Multi-spectral fusion for surveillance systems (2008) (26)
- Automatic UAV Forced Landing Site Detection Using Machine Learning (2014) (25)
- Near-field Adaptive Beamformer for Robust Speech Recognition (2002) (25)
- Automatically detecting action units from faces of pain: Comparing shape and appearance features (2009) (25)
- Fine-grained Action Segmentation using the Semi-Supervised Action GAN (2019) (25)
- Task Specific Visual Saliency Prediction with Memory Augmented Conditional Generative Adversarial Networks (2018) (25)
- Fourier Active Appearance Models (2011) (25)
- Lip detection for audio-visual speech recognition in-car environment (2010) (24)
- Determining operational measures from multi-camera surveillance systems using soft biometrics (2011) (24)
- The Automated Cryptanalysis of Analog Speech Scramblers (1991) (24)
- Improved SVM speaker verification through data-driven background dataset collection (2009) (24)
- Improving short utterance based i-vector speaker recognition using source and utterance-duration normalization techniques (2013) (24)
- Hierarchical Relational Attention for Video Question Answering (2018) (24)
- Memory Augmented Deep Generative Models for Forecasting the Next Shot Location in Tennis (2019) (23)
- Deep Inverse Reinforcement Learning for Behavior Prediction in Autonomous Driving: Accurate Forecasts of Vehicle Motion (2021) (23)
- Hessian-Based Affine Adaptation of Salient Local Image Features (2012) (23)
- Improving PLDA speaker verification performance using domain mismatch compensation techniques (2018) (23)
- Locating People in Video from Semantic Descriptions: A New Database and Approach (2014) (23)
- Gaze tracking for region of interest coding in JPEG 2000 (2006) (22)
- Compressive Sensing for Gait Recognition (2011) (22)
- Predicting Ball Ownership in Basketball from a Monocular View Using Only Player Trajectories (2015) (22)
- Speaker attribution of multiple telephone conversations using a complete-linkage clustering approach (2012) (22)
- 3D face verification using a free-parts approach (2008) (21)
- Effects of speech coding on text-dependent speaker recognition (1997) (21)
- Learning Temporal Strategic Relationships using Generative Adversarial Imitation Learning (2018) (21)
- Gabor Filter Bank Representation for 3D Face Recognition (2005) (21)
- Use of brain computer interface to drive: preliminary results (2012) (21)
- A Study of x-Vector Based Speaker Recognition on Short Utterances (2019) (21)
- Discovering methods of scoring in soccer using tracking data (2015) (21)
- Searching for people using semantic soft biometric descriptions (2015) (20)
- Data-driven clustering for blind feature mapping in speaker verification (2005) (20)
- Speaker Identification Using Higher Order Spectral Phase Features and their Effectiveness vis-a-vis Mel-Cepstral Features (2004) (20)
- Multiple cameras for audio-visual speech recognition in an automotive environment (2013) (20)
- Unusual Event Detection in Crowded Scenes Using Bag of LBPs in Spatio-Temporal Patches (2011) (20)
- Forecasting the Next Shot Location in Tennis Using Fine-Grained Spatiotemporal Tracking Data (2016) (20)
- A comparison of session variability compensation techniques for SVM-based speaker recognition (2007) (20)
- Patch-Based Representation of Visual Speech (2006) (20)
- Detecting rare events using Kullback-Leibler divergence: A weakly supervised approach (2016) (20)
- Hand-held monocular SLAM in thermal-infrared (2012) (20)
- Patch-based analysis of visual speech from multiple views (2008) (20)
- Data-Driven Background Dataset Selection for SVM-Based Speaker Verification (2010) (20)
- Probabilistic Surfel Fusion for Dense LiDAR Mapping (2017) (19)
- Continuous pose-invariant lipreading (2008) (19)
- 3D ellipsoid fitting for multi-view gait recognition (2011) (19)
- MTRNet++: One-stage Mask-based Scene Text Eraser (2019) (19)
- Improved phonetic and lexical speaker recognition through MAP adaptation (2004) (19)
- Fine-Grained Retrieval of Sports Plays using Tree-Based Alignment of Trajectories (2017) (19)
- Histogram of Weighted Local Directions for Gait Recognition (2013) (19)
- Person Re-Identification Using Group Information (2013) (19)
- Extending the Task of Diarization to Speaker Attribution (2011) (18)
- Deep Motion Analysis for Epileptic Seizure Classification (2018) (18)
- The Backfilled GEI - A Cross-Capture Modality Gait Feature for Frontal and Side-View Gait Recognition (2012) (18)
- Cryptanalysis of frequency domain analogue speech scramblers (1993) (18)
- Elasticity Meets Continuous-Time: Map-Centric Dense 3D LiDAR SLAM (2020) (18)
- Interactive Sports Analytics (2018) (18)
- Neural memory plasticity for medical anomaly detection (2020) (18)
- Coupled Generative Adversarial Network for Continuous Fine-Grained Action Segmentation (2019) (18)
- Understanding Patients’ Behavior: Vision-Based Analysis of Seizure Disorders (2019) (18)
- Multi-Level Sequence GAN for Group Activity Recognition (2018) (17)
- The use of speech and lip modalities for robust speaker verification under adverse conditions (1999) (17)
- Gaze Based Personal Identification (2010) (17)
- Telephone based speaker recognition using multiple binary classifier and Gaussian mixture models (1997) (17)
- Understanding and analyzing a large collection of archived swimming videos (2014) (17)
- Improving PLDA speaker verification with limited development data (2014) (17)
- Efficient Articulated Trajectory Reconstruction Using Dynamic Programming and Filters (2012) (17)
- The QUT-NOISE-SRE protocol for the evaluation of noisy speaker recognition (2015) (17)
- Rank Minimization across Appearance and Shape for AAM Ensemble Fitting (2013) (17)
- Speech encryption in the transform domain (1990) (17)
- Rethinking Planar Homography Estimation Using Perspective Fields (2018) (17)
- Deep Auto-Encoders With Sequential Learning for Multimodal Dimensional Emotion Recognition (2020) (17)
- Visual Voice Activity Detection Using Frontal versus Profile Views (2011) (17)
- Multi-view Intelligent Vehicle Surveillance System (2006) (16)
- Compact Model Representation for 3D Reconstruction (2017) (16)
- Dynamic visual features for audio-visual speaker verification (2010) (16)
- Dependence of GMM adaptation on feature post-processing for speaker recognition (2003) (16)
- Face recognition from super-resolved images (2005) (16)
- Interpretability performance assessment of JPEG2000 and part 1 compliant region of interest coding (2003) (16)
- An Evaluation of Different Features and Learning Models for Anomalous Event Detection (2013) (16)
- Pedestrian Trajectory Prediction with Structured Memory Hierarchies (2018) (16)
- Scene Invariant Crowd Counting (2011) (16)
- LoGG3D-Net: Locally Guided Global Descriptor Learning for 3D Place Recognition (2021) (16)
- An extended pose-invariant lipreading system (2007) (16)
- An Efficient Framework for Zero-Shot Sketch-Based Image Retrieval (2021) (16)
- Swimmer Localization from a Moving Camera (2013) (16)
- Meta Transfer Learning for Facial Emotion Recognition (2018) (16)
- An Accurate Method for Skew Determination in Document Images (2002) (16)
- Speaker Attribution of Australian Broadcast News Data (2013) (15)
- Robustness to expression variations in fractal-based face recognition (2001) (15)
- Automatic surveillance in transportation hubs: No longer just about catching the bad guy (2015) (15)
- Complete-linkage clustering for voice activity detection in audio and visual speech (2015) (15)
- Abandoned object detection using multi-layer motion detection (2008) (15)
- Fused HMM-adaptation of multi-stream HMMs for audio-visual speech recognition (2007) (15)
- Geometric Deep Learning for Subject Independent Epileptic Seizure Prediction Using Scalp EEG Signals (2021) (15)
- Clustering of ad-hoc microphone arrays for robust blind beamforming (2010) (14)
- Neighbourhood Context Embeddings in Deep Inverse Reinforcement Learning for Predicting Pedestrian Motion Over Long Time Horizons (2019) (14)
- Aberrant epileptic seizure identification: A computer vision perspective (2019) (14)
- A Continuous Speech Recognition Evaluation Protocol for the AVICAR Database (2008) (14)
- The effect of language models on phonetic decoding for spoken term detection (2009) (14)
- Cross-lingual pronunciation modelling for indonesian speech recognition (2003) (14)
- Improved Facial-Feature Detection for AVSP via Unsupervised Clustering and Discriminant Analysis (2003) (14)
- Constrained Design of Deep Iris Networks (2019) (14)
- Chromatic colour spaces for skin detection using GMMS (2002) (14)
- Combat sports analytics: Boxing punch classification using overhead depthimagery (2015) (14)
- Exploiting Human Social Cognition for the Detection of Fake and Fraudulent Faces via Memory Networks (2019) (13)
- A Deep Four-Stream Siamese Convolutional Neural Network with Joint Verification and Identification Loss for Person Re-Detection (2018) (13)
- Spatio Temporal Feature Evaluation for Action Recognition (2012) (13)
- Fine-grained action recognition of boxing punches from depth imagery (2017) (13)
- Quality based frame selection for video face recognition (2012) (13)
- Component-Based Attention for Large-Scale Trademark Retrieval (2018) (13)
- Recognising audio-visual speech in vehicles using the AVICAR database (2010) (13)
- Within-session variability modelling for factor analysis speaker verification (2009) (13)
- Robust 3D Face Recognition from Expression Categorisation (2007) (12)
- Robust mean super-resolution for less cooperative NIR iris recognition at a distance and on the move (2010) (12)
- Unusual Scene Detection Using Distributed Behaviour Model and Sparse Representation (2012) (12)
- Activity Analysis in Complicated Scenes Using DFT Coefficients of Particle Trajectories (2012) (12)
- Quality Based Frame Selection for Face Clustering in News Video (2013) (12)
- Microphone array sub-band speech recognition (2001) (12)
- Noise robust voice activity detection using normal probability testing and time-domain histogram analysis (2010) (12)
- An Exploration of Feature Detector Performance in the Thermal-Infrared Modality (2011) (12)
- Domain Generalization in Biosignal Classification (2020) (12)
- Evaluating Automatic Road Detection across a Large Aerial Imagery Collection (2011) (12)
- Group Segmentation During Object Tracking Using Optical Flow Discontinuities (2010) (12)
- Speech Enhancement Using Microphone Array with Multi-Stage Processing (1996) (12)
- Multi-Channel Sub-Band Speech Recognition (2001) (12)
- Semantic Consistency and Identity Mapping Multi-Component Generative Adversarial Network for Person Re-Identification (2020) (12)
- A Comparison of Session Variability Compensation Approaches for Speaker Verification (2010) (12)
- SVM Speaker Verification Using Session Variability Modelling and GMM Supervectors (2007) (11)
- Multi-view human pose estimation using modified five-point skeleton model (2008) (11)
- Deformable face ensemble alignment with robust grouped-L1 anchors (2013) (11)
- Scene Invariant Crowd Counting and Crowd Occupancy Analysis (2012) (11)
- A study on the effects of using short utterance length development data in the design of GPLDA speaker verification systems (2017) (11)
- A cluster-voting approach for speaker diarization and linking of Australian broadcast news recordings (2015) (11)
- Can You Describe Him for Me? A Technique for Semantic Person Search in Video (2012) (11)
- Visual speech recognition across multiple views (2008) (11)
- Context from within: Hierarchical context modeling for semantic segmentation (2020) (11)
- A hierarchical multimodal system for motion analysis in patients with epilepsy (2018) (11)
- Visual attention based ROI maps from gaze tracking data (2004) (11)
- A cascaded long short-term memory (LSTM) driven generic visual question answering (VQA) (2017) (11)
- Robust Photogeometric Localization Over Time for Map-Centric Loop Closure (2019) (11)
- Joint identification-verification for person re-identification: A four stream deep learning approach with improved quartet loss function (2020) (11)
- Comparison of Four Distance Measures for Long Time Text-Independent Speaker Identification (1996) (11)
- An Efficient and Robust System for Multiperson Event Detection in Real-World Indoor Surveillance Scenes (2015) (11)
- TMMF: Temporal Multi-Modal Fusion for Single-Stage Continuous Gesture Recognition (2020) (11)
- Discriminative Optimization of the Figure of Merit for Phonetic Spoken Term Detection (2011) (11)
- Chromatic lip tracking using a connectivity based fuzzy thresholding technique (1999) (10)
- Multilingual phone clustering for recognition of spontaneous indonesian speech utilising pronunciation modelling techniques (2003) (10)
- A study of speaker clustering for speaker attribution in large telephone conversation datasets (2016) (10)
- A robust UAV landing site detection system using mid-level discriminative patches (2016) (10)
- Logarithmic quantisation of wavelet coefficients for improved texture classification performance (2004) (10)
- Audio-visual speaker verification using continuous fused HMMs (2006) (10)
- Dealing with uncertainty in microphone placement in a microphone array speech recognition system (2008) (10)
- Improving the PLDA based speaker verification in limited microphone data conditions (2013) (10)
- DNN based Speaker Recognition on Short Utterances (2016) (10)
- Representing Team Behaviours from Noisy Data Using Player Role (2014) (10)
- Position-Independent Enhancement of Reverberant Speech (1997) (10)
- Target-Specific Siamese Attention Network for Real-Time Object Tracking (2020) (10)
- Dynamic Performance Measures for Object Tracking Systems (2009) (10)
- A feature clustering algorithm for scale-space analysis of image structures (2008) (10)
- Cross-language acoustic model refinement for the Indonesian language (2005) (9)
- Resection-Intersection Bundle Adjustment Revisited (2013) (9)
- Gate connected convolutional neural network for object tracking (2017) (9)
- Automatic Tracking, Super-Resolution and Recognition of Human Faces from Surveillance Video (2007) (9)
- Bayes factor scoring of GMMs for speaker verification (2004) (9)
- Complex Event Detection Using Joint Max Margin and Semantic Features (2016) (9)
- Three approaches to multilingual phone recognition (2003) (9)
- Real-time video event detection in crowded scenes using MPEG derived features: A multiple instance learning approach (2014) (9)
- A syllable-scale framework for language identification (2006) (8)
- Negative Determinant of Hessian Features (2011) (8)
- Activity recognition using binary tree SVM (2014) (8)
- Geometry-constrained Car Recognition Using a 3D Perspective Network (2019) (8)
- Scatter Difference NAP for SVM Speaker Recognition (2009) (8)
- Interpretable Seizure Classification Using Unprocessed EEG With Multi-Channel Attentive Feature Fusion (2021) (8)
- Multiple Instance Dictionary Learning for Activity Representation (2014) (8)
- Speaker linking using complete-linkage clustering (2012) (8)
- InCloud: Incremental Learning for Point Cloud Place Recognition (2022) (8)
- Visual front-endwars: Viola-Jones face detector vs Fourier Lucas-Kanade (2013) (8)
- Cross database training of audio-visual hidden Markov models for phone recognition (2015) (8)
- The State of Aerial Surveillance: A Survey (2022) (8)
- Memory based fusion for multi-modal deep learning (2020) (8)
- Multi-modal semantic image segmentation (2021) (8)
- SPEECH ENHANCEMENT USING NEAR-FIELD SUPERDIRECTIVITY WITH AN ADAPTIVE SIDELOBE CANCELER AND POST-FILTER (2000) (8)
- Facial analysis in the wild with LSTM networks (2017) (8)
- On the Performance and Use of Speaker Recognition Systems for Surveillance (2006) (8)
- Comparing audio and visual information for speech processing (2005) (8)
- On the Statistical Determination of Optimal Camera Configurations in Large Scale Surveillance Networks (2012) (8)
- Improved GMM-based speaker verification using SVM-driven impostor dataset selection (2009) (7)
- Improving Speech Recognition Accuracy for Small Vocabulary Applications in Adverse Environments (2000) (7)
- Ball on beam on roller: a new control laboratory device (2002) (7)
- Closed-Form Solutions for Low-Rank Non-Rigid Reconstruction (2015) (7)
- A modified LIMA framework for spectral subtraction applied to in-car speech recognition (2008) (7)
- Hierarchical temporal decomposition: a novel approach to efficient compression of spectral characteristics of speech (1998) (7)
- Channel selection in the short-time modulation domain for distant speech recognition (2015) (7)
- End-to-End Domain Adaptive Attention Network for Cross-Domain Person Re-Identification (2020) (7)
- Pitch and energy trajectory modelling in a syllable length temporal framework for language identification (2004) (7)
- Gaze-J2K: Gaze-influenced image voding using eye trackers and JPEG 2000 (2006) (7)
- Automatic gender identification under adverse conditions (1997) (7)
- Channel Graph Regularized Correlation Filters for Visual Object Tracking (2021) (7)
- Dense Correspondence Extraction in Difficult Uncalibrated Scenarios (2009) (7)
- Image 2 Mesh : A Learning Framework for Single Image 3 D Reconstruction (2018) (7)
- Joint Deep Cross-Domain Transfer Learning for Emotion Recognition (2020) (7)
- Can Audio-Visual Speech Recognition Outperform Acoustically Enhanced Speech Recognition in Automotive Environment? (2011) (7)
- Progressive coding in JPEG2000 - improving content recognition performance using ROIs and importance maps (2002) (7)
- On the Use of Factor Analysis with Restricted Target Data in Speaker Verification (2010) (7)
- Understanding the Importance of Heart Sound Segmentation for Heart Anomaly Detection (2020) (7)
- Enhancing automatic speaker identification using phoneme clustering and frame based parameter and frame size selection (1999) (7)
- Data-Driven Impostor Selection for T-Norm Score Normalisation and the Background Dataset in SVM-Based Speaker Verification (2009) (7)
- Detection of Fake and Fraudulent Faces via Neural Memory Networks (2021) (7)
- Employing Phonetic Information in DNN Speaker Embeddings to Improve Speaker Recognition Performance (2018) (7)
- Anchored Deformable Face Ensemble Alignment (2012) (7)
- Robust Face Localisation Using Motion, Colour and Fusion (2003) (7)
- Importance prioritization coding in JPEG2000 for interpretability with application to surveillance imagery (2003) (7)
- A two stage fuzzy decision classifier for speaker identification (1996) (7)
- Anomalous Event Detection Using a Semi-Two Dimensional Hidden Markov Model (2012) (7)
- Problems associated with current area-based visual speech feature extraction techniques (2005) (6)
- A link between cepstral shrinking and the weighted product rule in audio-visual speech recognition (2002) (6)
- Two novel lossless algorithms to exploit index redundancy in VQ speech compression (1998) (6)
- A suitability metric for mouth tracking through chromatic segmentation (2001) (6)
- Deep features-based expression-invariant tied factor analysis for emotion recognition (2017) (6)
- Minimising Speaker Verification Utterance Length through Confidence Based Early Verification Decisions (2009) (6)
- Identifying Customer Behaviour and Dwell Time Using Soft Biometrics (2012) (6)
- Improving The Effectiveness of Existing Noise Reduction Techniques Using Neural Networks (1996) (6)
- An Examination of Audio-Visual Fused HMMs for Speaker Recognition (2006) (6)
- The Australian English Speech Corpus for In-Car Speech processing (2009) (6)
- PLDA based speaker verification with weighted LDA techniques (2012) (6)
- Discovery of facial motions using deep machine perception (2016) (6)
- Isolated word verification using cohort word-level verification (2003) (6)
- Two-Stream Deep Feature Modelling for Automated Video Endoscopy Data Analysis (2020) (6)
- An investigation of HMM classifier combination strategies for improved audio-visual speech recognition (2001) (6)
- Frequency offset correction for HF radio speech reception (2000) (6)
- Complex-Valued Iris Recognition Network (2020) (6)
- Non-rigid Reconstruction with a Single Moving RGB-D Camera (2018) (6)
- Person tracking using motion detection and optical flow (2005) (6)
- Techniques for improving stereo depth maps of faces (2004) (6)
- Vision-Based Mouth Motion Analysis in Epilepsy: A 3D Perspective (2019) (5)
- Camera calibration in wireless multimedia sensor networks (2009) (5)
- Speech compression with preservation of speaker identity (1997) (5)
- LSTM guided ensemble correlation filter tracking with appearance model pool (2020) (5)
- Exploiting multiple feature sets in data-driven impostor dataset selection for speaker verification (2010) (5)
- 2 D-3 D Hybrid Face Recognition Based on PCA and Feature Modelling (2006) (5)
- Efficient real-time face detection for high resolution surveillance applications (2012) (5)
- Comparing object alignment algorithms with appearance variation: Forward-additive vs inverse-composition (2008) (5)
- CROSS LINGUAL MODELLING EXPERIMENTS FOR INDONESIAN (2002) (5)
- SAIVT-QUT@TRECVid 2012: Interactive surveillance event detection (2012) (5)
- Feature mapping using far-field microphones for distant speech recognition (2016) (5)
- Importance coding of still imagery based on importance maps of visually interpretable regions (2001) (5)
- The Role of Motion Models in Super-Resolving Surveillance Video for Face Recognition (2006) (5)
- SAIVT-ADMRG @ MediaEval 2014 Social Event Detection (2014) (5)
- A comparison of fusion techniques in mel-cepstral based speaker identification (1998) (5)
- Temporarily-Aware Context Modeling Using Generative Adversarial Networks for Speech Activity Detection (2020) (5)
- Robust and Interpretable Temporal Convolution Network for Event Detection in Lung Sound Recordings (2021) (5)
- An Application of Fractal Image-Set Coding in Facial Recognition (2004) (5)
- Deep domain adaptation for anti-spoofing in speaker verification systems (2019) (5)
- High Quality Audio Coding: An Overview (1995) (5)
- A Hierarchical Multi-modal System for Motion Analysis in Epileptic Patients (2020) (5)
- 2D-3D Face Recognition Based on PCA and Feature Modelling (2006) (5)
- Social signal processing for pain monitoring using a hidden conditional random field (2014) (5)
- Audio-visual speaker identification using the CUAVE database (2005) (5)
- Accurate Silhouettes for Surveillance - Improved Motion Segmentation Using Graph Cuts (2010) (5)
- Investigating Deep Neural Networks for Speaker Diarization in the DIHARD Challenge (2018) (5)
- Topic dependent language modelling for spoken term detection (2014) (5)
- Co-talker Separation Using the 'Cocktail Party Effect' (1996) (4)
- Domain-invariant I-vector Feature Extraction for PLDA Speaker Verification (2018) (4)
- Likelihood-maximising frameworks for enhanced in-car speech recognition (2009) (4)
- Feature Modelling of PCA Difference Vectors for 2D and 3D Face Recognition (2006) (4)
- Weighting and normalisation of synchronous HMMs for audio-visual speech recognition (2007) (4)
- Learning Detectors Quickly with Stationary Statistics (2014) (4)
- Voice Presentation Attack Detection Using Convolutional Neural Networks (2019) (4)
- Phonetic spoken term search using topic information (2014) (4)
- Detecting rare events using Kullback-Leibler divergence (2015) (4)
- Voiced/Unvoiced/Silence Classification of Noisy Speech in Real Time Audio Signal Processing (1995) (4)
- A Multi-Class Tracker Using a Scalable Condensation Filter (2006) (4)
- Super-Resolved Face Images using Robust Optical Flow (2004) (4)
- Robust Face Localisation Using Motion, Colour & Fusion (2003) (4)
- Detecting anomalous events at railway level crossings (2013) (4)
- Wild-Places: A Large-Scale Dataset for Lidar Place Recognition in Unstructured Natural Environments (2022) (4)
- Learning Salient Features for Multimodal Emotion Recognition with Recurrent Neural Networks and Attention Based Fusion (2019) (4)
- Cross Likelihood Ratio Based Speaker Clustering Using Eigenvoice Models (2011) (4)
- Guidelines to Using Region of Interest Coding in JPEG 2000 (2003) (4)
- Human-level face verification with intra-personal factor analysis and deep face representation (2018) (4)
- Multi-Scale Representation for 3D Face Recognition (2007) (4)
- Deep discovery of facial motions using a shallow embedding layer (2017) (4)
- Affect recognition from scalp-EEG using channel-wise encoder networks coupled with geometric deep learning and multi-channel feature fusion (2022) (4)
- Learning Temporal Alignment Uncertainty for Efficient Event Detection (2015) (4)
- Patient-independent Epileptic Seizure Prediction using Deep Learning Models (2020) (4)
- Domain Mismatch Modeling of Out-Domain i-Vectors for PLDA Speaker Verification (2017) (4)
- Neural Memory Plasticity for Anomaly Detection (2019) (4)
- Multimodal clothing recognition for semantic search in unconstrained surveillance imagery (2019) (4)
- Study on pairwise LDA for x‐vector‐based speaker recognition (2019) (4)
- The development of a new signal processing program at the Queensland University of Technology (1996) (4)
- Normalisation and Recognition of 3D Face Data Using Robust Hausdorff Metric (2008) (4)
- Bayes Factor based speaker clustering for speaker diarization (2010) (4)
- Pose-driven Attention-guided Image Generation for Person Re-Identification (2021) (4)
- Low-cost hardware speech enhancement for improved speech recognition in automotive environments (2010) (3)
- Normalisation of 3D face data (2008) (3)
- Multi-sensor tracking using a scalable condensation filter (2008) (3)
- Locating People in Surveillance Video Using Soft Biometric Traits (2017) (3)
- Speech-seeking microphone array with multi-stage processing (1995) (3)
- Evaluation of two-view geometry methods with automatic ground-truth generation (2013) (3)
- Skeleton Driven Non-Rigid Motion Tracking and 3D Reconstruction (2018) (3)
- Large scale monitoring of crowds and building utilisation: A new database and distributed approach (2015) (3)
- Importance prioritisation in JPEG 2000 for improved interpretability (2004) (3)
- A comparison of Gaussian mixture and multiple binary classifier models for speaker verification (1996) (3)
- Higher Order Spectral Phase Features for Speaker Identification (2004) (3)
- Improving visual noise insensitivity in small vocabulary audio visual speech recognition applications (2001) (3)
- Interpolative coding of speech parameters using hierarchical temporal decomposition (2003) (3)
- Assessment of speech dialog systems using multi-modal cognitive load analysis and driving performance metrics (2009) (3)
- Effects of speech coding on speaker verification (1996) (3)
- JFA based speaker recognition using delta-phase and MFCC features (2012) (3)
- Incorporating visual information for spoken term detection (2015) (3)
- Speaker Verification using Hidden Markov Models in a Multilingual Text-constrained Framework (2006) (3)
- Learning detectors quickly using structured covariance matrices (2014) (3)
- Scale-space volume descriptors for automatic 3D facial feature extraction (2009) (3)
- Hierarchical Attention Network for Action Segmentation (2020) (3)
- Detection of unknown forms from document images (2003) (3)
- Deep Context Modeling for Semantic Segmentation (2017) (3)
- Cascading appearance-based features for visual speaker verification (2008) (3)
- Calibrating Cameras in Poor-Conditioned Pitch-Based Sports Games (2018) (3)
- Exploring visual features through Gabor representations for facial expression detection (2010) (3)
- Tracking people in 3D using position, size and shape (2005) (3)
- Eigengaze - covert behavioral biometric exploiting visual attention characteristics (2010) (3)
- Semantic Segmentation Of Hands In Multimodal Images: A Region New-Based CNN Approach (2019) (3)
- Speakers In The Wild (SITW): The QUT Speaker Recognition System (2016) (3)
- Multi-Modal Object Tracking using Dynamic Performance Metrics (2010) (3)
- Spoken Language Identification Utilising Both Acoustic and Phonetic Information (2003) (3)
- Speech compaction using temporal decomposition (1998) (3)
- The design and development of an undergraduate signal processing laboratory (1994) (3)
- Learning Regional Attention Over Multi-Resolution Deep Convolutional Features For Trademark Retrieval (2021) (2)
- Accurate 3D hand mesh recovery from a single RGB image (2021) (2)
- A New Approach To Teaching Signal Processing At Undergraduate Level (1996) (2)
- Bayes factor based speaker segmentation for speaker diarization (2010) (2)
- Self-calibration of wireless cameras with restricted degrees of freedom (2012) (2)
- Robust Speech Coding for the Preservation of Speaker Identity (1996) (2)
- Discriminative Domain-Invariant Adversarial Network for Deep Domain Generalization (2021) (2)
- What Is the Average Human Face? (2006) (2)
- Wide baseline correspondence extraction beyond local features (2011) (2)
- Deep Domain Generalization with Feature-norm Network (2021) (2)
- Texture classification using gabor energy features and higher order spectral features: a comparative study (2005) (2)
- A Real Time Audio Enhancement System (1995) (2)
- Automatic Event Detection for Signal-based Surveillance (2016) (2)
- Investigating in-domain data requirements for PLDA training (2015) (2)
- Identifying Team Style in Soccer using Formations from Spatiotemporal Tracking Data (2014) (2)
- Domain adaptation based Speaker Recognition on Short Utterances (2016) (2)
- Normalisation of 3 D Face Data (2007) (2)
- Fast, Dense Feature SDM on an iPhone (2016) (2)
- Multi-Slice Net: A Novel Light Weight Framework For COVID-19 Diagnosis (2021) (2)
- Combined coding of audio and speech signals using LPC and the discrete wavelet transform (1997) (2)
- Preserving Semantic Consistency in Unsupervised Domain Adaptation Using Generative Adversarial Networks (2021) (2)
- Short utterance PLDA speaker verification using SN-WLDA and variance modelling techniques (2014) (2)
- Textual Analysis for Script Recognition (2001) (2)
- Activity Modelling in Crowded Environments: A Soft-Decision Approach (2011) (2)
- Towards Improved Assessment of Phonotactic Information for Automatic Language Identification (2006) (2)
- Speech separation by simulating the cocktail party effect with a neural network controlled Wiener filter (1997) (2)
- Human Face Reconstruction Using Bayesian Deformable Models (2006) (2)
- Practical Improvements to Simultaneous Computation of Multi-view Geometry and Radial Lens Distortion (2011) (2)
- Speech Enhancement Iby Simulation Of Cocktail Party Effect With Neural Network Controlled Iterative Filter (1996) (2)
- Split 'n' merge net: A dynamic masking network for multi-task attention (2022) (2)
- Deep Decision Trees for Discriminative Dictionary Learning with Adversarial Multi-agent Trajectories (2018) (2)
- An iterative speaker re-diarization scheme for improving speaker-based entity extraction in multimedia archives (2014) (2)
- ROBUST LIP TRACKING USING ACTIVE SHAPE MODELS AND GRADIENT VECTOR FLOW (2000) (2)
- Short Utterance Variance Modelling and Utterance Partitioning for PLDA Speaker Verification (2016) (2)
- From Affine Rank Minimization Solution to Sparse Modeling (2017) (2)
- Searching for semantic person queries using channel representations (2015) (2)
- Importance coding of surveillance imagery for interpretability using quadtree dynamic importance maps (2001) (2)
- Digital coding of covert audio for monitoring and storage (1999) (2)
- Scene Invariant Virtual Gates Using DNNs (2019) (2)
- Importance Coding in JPEG2000 for Improved Interpretability (2001) (2)
- Fast Exact Nearest Neighbour Matching in High Dimensions Using d-D Sort (2013) (2)
- Application of the trended hidden Markov model to speech synthesis (2001) (2)
- Semantic Correspondence: A Hierarchical Approach (2018) (2)
- Adaptive Vector Quantization for Speech Spectrum Coding (1999) (2)
- Unified 2D and 3D Hand Pose Estimation from a Single Visible or X-ray Image (2019) (1)
- Modeling of output probability distribution to improve small vocabulary speech recognition in adverse environments (1998) (1)
- Robust Automatic Face Clustering in News Video (2015) (1)
- A speaker rediarization scheme for improving diarization in large two-speaker telephone datasets (2014) (1)
- QUT Speaker Identity Verification system for EVALITA 2009 (2010) (1)
- Learning object dynamics for smooth tracking of moving lip contours (2000) (1)
- IGSSTRCF: Importance Guided Sparse Spatio-Temporal Regularized Correlation Filters For Tracking (2021) (1)
- Semantic Correspondence in the Wild (2019) (1)
- Improved subject identification in surveillance video using super resolution (2012) (1)
- A Comparison of Three Discriminant Models for Automatic Speaker Verification (1996) (1)
- Accurate 3D hand mesh recovery from a single RGB image (2022) (1)
- Robust Real Time Multi-Layer Foreground Segmentation (2007) (1)
- Enhancement Methods for Reverberant Speech (1996) (1)
- Robust Enhancement of Reverberant Speech (1995) (1)
- Semi-Binary Based Video Features for Activity Representation (2013) (1)
- Audio visual automatic speech recognition in vehicles (2010) (1)
- Deep Match Tracker: Classifying when Dissimilar, Similarity Matching when Not (2018) (1)
- SESS: Saliency Enhancing with Scaling and Sliding (2022) (1)
- Supervised Latent Dirichlet Allocation Models for Efficient Activity Representation (2014) (1)
- Application Specific Bounds on Detection Cost using Game Theory (2006) (1)
- Cascading appearance-based features for visual voice activity detection (2010) (1)
- Spectral Geometric Verification: Re-Ranking Point Cloud Retrieval for Metric Localization (2022) (1)
- Joint Max Margin and Semantic Features for Continuous Event Detection in Complex Scenes (2017) (1)
- Enhancing Feature Invariance with Learned Image Transformations for Image Retrieval (2020) (1)
- Single image depth prediction using super-column super-pixel features (2017) (1)
- Learning test-time augmentation for content-based image retrieval (2020) (1)
- Robust facial feature extraction and matching (2012) (1)
- Calculating the similarity of textures using wavelet scale relationships (2003) (1)
- Multi-lingual character recognition using artificial neural networks (1996) (1)
- Rescaling clustering trees using impact ratios for robust hierarchical speaker clustering (2014) (1)
- Analyzing and predicting events in soccer and tennis using spatiotemporal data (2014) (1)
- Deeper and wider fully convolutional network coupled with conditional random fields for scene labeling (2016) (1)
- Enhancing The Multiple Binary Classifier Model (1996) (1)
- PhD forum: Multiple camera management using wide base-line matching (2009) (1)
- Improving Short Utterance PLDA Speaker Verification using SUV Modelling and Utterance Partitioning Approach (2016) (1)
- Investigating Domain Sensitivity of DNN Embeddings for Speaker Recognition Systems (2019) (1)
- An Intelligent Microphone Array for Speech Enhancement (1996) (1)
- Detecting Heart Failure Through Voice Analysis using Self-Supervised Mode-Based Memory Fusion (2022) (1)
- A distributed protocol for object tracking in wireless multimedia sensor networks (2010) (1)
- Improving the performance of a small microphone array at low frequencies using critical band and LPC codebooks (2000) (1)
- A Secure Analog Speech Scrambler Using the Discrete Cosine Transform (1991) (1)
- Fused HMM-Adaptation of Synchronous HMMs for Audio-Visual Speech Recognition (2007) (1)
- Meta-transfer learning for emotion recognition (2020) (1)
- An analysis of the KEEP CLEAR pavement markings effects on queuing vehicles dynamic performance at urban signalised intersections (2013) (1)
- Cross-Lingual Pronunciation Modelling for Indonesian Speech (2003) (1)
- Visual Question Answering Through Adversarial Learning of Multi-modal Representation (2020) (0)
- 3D Face Acquisition, Modelling and Recognition (2004) (0)
- A likelihood-maximizing framework for enhanced in-car speech recognition based on speech dialog system interaction (2012) (0)
- anu Aberrant Epileptic Seizure Identification: A Computer Vision Perspective (2021) (0)
- The application of phonetic distribution normalisation to likelihood-maximising speech enhancement for robust ASR (2010) (0)
- Generalized Generative Deep Learning Models for Biosignal Synthesis and Modality Transfer (2022) (0)
- Aerial-Ground Person Re-ID (2023) (0)
- Towards On-Board Panoptic Segmentation of Multispectral Satellite Images (2022) (0)
- Progressive image transmission (1992) (0)
- Robust enhancement of reverberant speech using iterative noise removal (1997) (0)
- Multi-stage stacked temporal convolution neural networks (MS-S-TCNs) for biosignal segmentation and anomaly localization (2023) (0)
- Table of Contents (2011) (0)
- A study on the effects of using short utterance length development data in the design of GPLDA speaker verification systems (2017) (0)
- QUT System Description to the NIST SRE 2018 Campaign (2018) (0)
- Towards Self-Explainability of Deep Neural Networks with Heatmap Captioning and Large-Language Models (2023) (0)
- Memory Based Attentive Fusion (2020) (0)
- Rapid Channel Compensation for Speaker Verification in the NIST 2000 Speaker Recognition Evaluation (2001) (0)
- Fast Search Methods for Spectral Quantization (1999) (0)
- Erratum: Design of a discrete cosine transform based speech scrambler (1991) (0)
- Overleaf Example (2022) (0)
- Using Auxiliary Information for Person Re-Identification - A Tutorial Overview (2022) (0)
- Toward On-Board Panoptic Segmentation of Multispectral Satellite Images (2023) (0)
- Unsupervised Temporal Ensemble Alignment for Rapid Annotation (2014) (0)
- On the convergence of Gaussian mixture models: improvements through vector quantization (1998) (0)
- Modelling output probability distributions for enhancing speaker recognition (1999) (0)
- Video Question Answering for Surveillance (2020) (0)
- 3DCarRecog: Car Recognition Using 3D Bounding Box. (2019) (0)
- Simulation of Cocktail Party Effect with Neural Network Controlled Iterative Wiener Filter (1996) (0)
- Physical Adversarial Attacks for Surveillance: A Survey (2023) (0)
- Hessian-Based Affine Adaptation of Salient Local Image Features (2011) (0)
- Application Specific Boundson Detection CostUsingGameTheory (2006) (0)
- Ground-plane based projective reconstruction for surveillance camera networks (2008) (0)
- An Investigation of HMM Classifier Combination Strategies for Improved Audio-Visual Speech Recognition (2021) (0)
- Coding Speech at Very Low Rates Using Temporal Decomposition-Based Spectral Interpolation and Mixed Excitation in the LPC Model (1999) (0)
- Detection of Forms from Unknown Document Images (2003) (0)
- Audio-Visual Speaker Veri(cid:28)cation using Continuous Fused HMMs (2006) (0)
- Incorporating visual information for spoken term detection Audio, Image, and Video (2015) (0)
- Fusion of Cohort-Word and Speech Background Model Based Confidence Scores for Improved Keyword Confidence Scoring and Verification (2005) (0)
- Graph Rigidity for Near-Coplanar Structure from Motion (2011) (0)
- Improving speaker identification performance in reverberant conditions using lip information (1998) (0)
- Infra-red pupil detection for use in a face recognition system (2004) (0)
- 2013 International Conference on Digital Image Computing: Techniques and Applications, DICTA 2013, Hobart, Australia, November 26-28, 2013 (2013) (0)
- Class-specific sparse codes for representing activities (2015) (0)
- Jointly Trained Conversion Model With LPCNet for Any-to-One Voice Conversion Using Speaker-Independent Linguistic Features (2022) (0)
- Voice Recognition Research - Final Report (2009) (0)
- Supplementary SESS: Saliency Enhancing with Scaling and Sliding (2022) (0)
- 2016 IEEE Winter Conference on Applications of Computer Vision, WACV 2016, Lake Placid, NY, USA, March 7-10, 2016 (2016) (0)
- Object Recognition Using Stereo Vision and Higher Order Spectra (2005) (0)
- Sparse Over-complete Patch Matching (2018) (0)
- Fast & Slow Learning: Incorporating Synthetic Gradients in Neural Memory Controllers (2020) (0)
- Intelligibility Measurement of Processed Reverberant Speech (1996) (0)
- Frequency decomposition techniques for increased discriminative 3D facial information capture (2010) (0)
- DEPENDENT LANGUAGEMODELLING FOR SPOKEN TERM DETECTION (2014) (0)
- Speech compaction using vector quantisation and hidden Markov models (1999) (0)
- Deep Inverse Reinforcement Learning for Behaviour Prediction in Autonomous Driving (2021) (0)
- Vertical Axis Detection for Sport Video Analytics (2016) (0)
- Point Cloud Segmentation Using Sparse Temporal Local Attention (2021) (0)
- Comparing the Multiple Binary Classifier Model to Other Automatic Speaker Verification Models (1999) (0)
- Cross database audio visual speech adaptation for phonetic spoken term detection (2017) (0)
- Reduction of Feature Contamination for Hyper Spectral Image Classification (2021) (0)
- Improving PLDA speaker verification using WMFD and linear-weighted approaches in limited microphone data conditions (2015) (0)
- A Hybrid Method for Face Recognition using LLS CLAHE Method (2017) (0)
- Hybrid coding of mixed signals for digital covert audio surveillance (2000) (0)
- The effect of dialect mismatch on likelihood-maximising speech enhancement for noise-robust speech recognition (2010) (0)
- ROI Detection & Tracking Visual Feature Extraction Visual Modelling ROI Detection & Tracking Visual Feature Extraction Visual Modelling ROI Detection & Tracking Visual Feature Extraction Visual Modelling (2006) (0)
- Using a Free-Parts Representation for Visual Speech Recognition (2005) (0)
- Multilingual Speech and Language Processing (2001) (0)
- Eigenvoice modelling for cross likelihood ratio based speaker clustering: A Bayesian approach (2013) (0)
- Academic Strategy Planning For A University Research Centre (1996) (0)
- Investigation and comparison of robust stereo image matching using mutual information and hierarchical prior probabilities (2008) (0)
- Speech enhancement by eigen decomposition with two-channel observations (1995) (0)
- Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 28 - July 1, 2010 (2010) (0)
- Acoustic Adaptation in Cross Database Audio Visual SHMM Training for Phonetic Spoken Term Detection (2015) (0)
- Airports of the future : improving operation, security and experience (2014) (0)
- Design of a High Speed Stream Cipher (1992) (0)
- Fused HMM adaptation of synchronous HMMs for audio-visual speaker verification (2008) (0)
- An Auto-Tracking Auto-Beamforming Microphone Array for Sound Recording (1995) (0)
- Task Specific Visual Saliency Prediction with Memory Augmented Conditional Generative Adversarial Networks (2020) (0)
This paper list is powered by the following services:
What Schools Are Affiliated With Sridha Sridharan?
Sridha Sridharan is affiliated with the following schools: