Shrikanth Narayanan

Shrikanth Narayanan's AcademicInfluence.com Rankings

Shrikanth Narayanan

Engineering

#1070

World Rank

#1645

Historical Rank

Applied Physics

#217

World Rank

#226

Historical Rank

Electrical Engineering

#228

World Rank

#261

Historical Rank

engineering Degrees

Download Badge

Engineering

Why Is Shrikanth Narayanan Influential?

(Suggest an Edit or Addition)

According to Wikipedia, Shrikanth Narayanan is an Indian-American Professor at the University of Southern California. He is an interdisciplinary engineer-scientist with a focus on human-centered signal processing and machine intelligence with speech and spoken language processing at its core. A prolific award-winning researcher, educator, and inventor, with hundreds of publications and a number of acclaimed patents to his credit, he has pioneered several research areas including in computational speech science, speech and human language technologies, audio, music and multimedia engineering, human sensing and imaging technologies, emotions research and affective computing, behavioral signal processing, and computational media intelligence. His technical contributions cover a range of applications including in defense, security, health, education, media, and the arts. His contributions continue to impact numerous domains including in human health , national defense/intelligence, and the media arts including in using technologies that facilitate awareness and support of diversity and inclusion. His award-winning patents have contributed to the proliferation of speech technologies on the cloud and on mobile devices and in enabling novel emotion-aware artificial intelligence technologies.

(See a Problem?)

Shrikanth Narayanan's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

IEMOCAP: interactive emotional dyadic motion capture database (2008) (2025)
Toward detecting emotions in spoken dialogs (2005) (1015)
The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing (2016) (984)
Analysis of emotion recognition using facial expressions, speech and multimodal information (2004) (870)
Acoustics of children's speech: developmental changes of temporal and spectral parameters. (1999) (790)
Environmental Sound Recognition With Time–Frequency Audio Features (2009) (634)
A System for Real-time Twitter Sentiment Analysis of 2012 U.S. Presidential Election Cycle (2012) (608)
The INTERSPEECH 2010 paralinguistic challenge (2010) (529)
The Vera am Mittag German audio-visual emotional speech database (2008) (407)
Emotion recognition using a hierarchical binary decision tree approach (2011) (392)
An approach to real-time magnetic resonance imaging for speech production. (2003) (336)
Primitives-based evaluation and estimation of emotions in speech (2007) (315)
Paralinguistics in speech and language - State-of-the-art and the challenge (2013) (295)
Analysis of Emotionally Salient Aspects of Fundamental Frequency for Emotion Detection (2009) (269)
Behavioral Signal Processing: Deriving Human Behavioral Informatics From Speech and Language (2013) (240)
Emotion recognition based on phoneme classes (2004) (236)
An articulatory study of fricative consonants using magnetic resonance imaging (1995) (213)
A Framework for Automatic Human Emotion Classification Using Emotion Profiles (2011) (213)
Robust Voice Activity Detection Using Long-Term Signal Variability (2011) (209)
Combining acoustic and language information for emotion recognition (2002) (206)
Robust recognition of children's speech (2003) (205)
Toward articulatory-acoustic models for liquid approximants based on MRI and EPG data. Part I. The laterals (1997) (204)
Automatic speaker age and gender recognition using acoustic and prosodic level information fusion (2013) (182)
Rigid Head Motion in Expressive Speech Animation: Analysis and Synthesis (2007) (176)
Context-sensitive multimodal emotion recognition from speech and facial expression using bidirectional LSTM modeling (2010) (176)
On Energy-Based Acoustic Source Localization for Sensor Networks (2008) (176)
Where am I? Scene Recognition for Mobile Robots using Audio Features (2006) (176)
Ada and Grace: Toward Realistic and Engaging Virtual Museum Guides (2010) (173)
Recognition of negative emotions from the speech signal (2001) (172)
Creating conversational interfaces for children (2002) (166)
Applying Machine Learning to Facilitate Autism Diagnostics: Pitfalls and Promises (2015) (164)
Learning Optimal Dialogue Strategies: A Case Study of a Spoken Dialogue Agent for Email (1998) (163)
Context-Sensitive Learning for Enhanced Audiovisual Emotion Classification (2012) (162)
An acoustic study of emotions expressed in speech (2004) (161)
Real-time magnetic resonance imaging and electromagnetic articulography database for speech production research (TC). (2014) (155)
Automatic Prosodic Event Detection Using Acoustic, Lexical, and Syntactic Evidence (2008) (154)
"yeah Right": Sarcasm Recognition for Spoken Dialogue Systems (2006) (149)
Optimal Arousal Identification and Classification for Affective Computing Using Physiological Signals: Virtual Reality Stroop Task (2010) (143)
Text-Independent Voice Conversion Based on Unit Selection (2006) (141)
Interrelation Between Speech and Facial Gestures in Emotional Utterances: A Single Subject Study (2007) (138)
Feature analysis for automatic detection of pathological speech (2002) (137)
Synchronized and noise-robust audio recordings during realtime magnetic resonance imaging scans. (2006) (132)
Annotation and processing of continuous emotional attributes: Challenges and opportunities (2013) (131)
Interpreting ambiguous emotional expressions (2009) (130)
DARPA communicator dialog travel planning systems: the june 2000 data collection (2001) (129)
Expressive speech synthesis using a concatenative synthesizer (2002) (128)
Multimodal Physical Activity Recognition by Fusing Temporal and Cepstral Information (2010) (124)
Support Vector Regression for Automatic Recognition of Spontaneous Emotions in Speech (2007) (122)
A review of ASR technologies for children's speech (2009) (122)
Use of machine learning to improve autism screening and diagnostic instruments: effectiveness, efficiency, and multi-instrument fusion. (2016) (121)
Phrasal signatures in articulation (2000) (120)
Acoustic modelling of American English /r/ (1997) (119)
Automatic speech recognition for children (1997) (117)
A Review of Speaker Diarization: Recent Advances with Deep Learning (2021) (112)
Region Segmentation in the Frequency Domain Applied to Upper Airway Real-Time Magnetic Resonance Images (2009) (111)
Collaborative classification applications in sensor networks (2002) (110)
Tracking continuous emotional trends of participants during affective dyadic interactions using body language and speech information (2013) (109)
Robust ECG Biometrics by Fusing Temporal and Cepstral Information (2010) (108)
Quantification of prosodic entrainment in affective spontaneous spoken interactions of married couples (2010) (107)
An articulatory study of emotional speech production (2005) (107)
Toward automating a human behavioral coding system for married couples' interactions using speech acoustic features (2013) (105)
Real-time Emotion Detection System using Speech: Multi-modal Fusion of Different Timescale Features (2007) (101)
Toward articulatory-acoustic models for liquid approximants based on MRI and EPG data. Part II. The rhotics. (1997) (100)
Detecting emotional state of a child in a conversational computer game (2011) (100)
Content-based movie analysis and indexing based on audiovisual cues (2004) (100)
Environmental sound recognition using MP-based features (2008) (99)
The AT&t-DARPA communicator mixed-initiative spoken dialog system (2000) (99)
Exploiting Acoustic and Syntactic Features for Automatic Prosody Labeling in a Maximum Entropy Framework (2008) (98)
The psychologist as an interlocutor in autism spectrum disorder assessment: insights from a study of spontaneous prosody. (2014) (97)
Robust Speech Rate Estimation for Spontaneous Speech (2007) (96)
NTUA-SLP at SemEval-2018 Task 1: Predicting Affective Content in Tweets with Deep Attentive RNNs and Transfer Learning (2018) (95)
Deep convolutional recurrent neural network with attention mechanism for robust speech emotion recognition (2017) (93)
Using neutral speech models for emotional speech analysis (2007) (92)
Text to Speech Synthesis: New Paradigms and Advances (2004) (92)
Toward Effective Automatic Recognition Systems of Emotion in Speech (2014) (91)
Audio-Visual Emotion Recognition Using Gaussian Mixture Models for Face and Voice (2008) (90)
Timing effects of syllable structure and stress on nasals: A real-time MRI examination (2009) (90)
Expressive Facial Animation Synthesis by Learning Speech Coarticulation and Expression Spaces (2006) (89)
MUPET—Mouse Ultrasonic Profile ExTraction: A Signal Processing Tool for Rapid and Unsupervised Analysis of Ultrasonic Vocalizations (2017) (89)
Natural head motion synthesis driven by acoustic prosodic features (2005) (89)
Multimodal Prediction of Affective Dimensions and Depression in Human-Computer Interactions (2014) (89)
Tactical Language Training System: An Interim Report (2004) (87)
Decision level combination of multiple modalities for recognition and analysis of emotional expression (2010) (86)
Accelerated three‐dimensional upper airway MRI using compressed sensing (2009) (86)
A saliency-based auditory attention model with applications to unsupervised prominent syllable detection in speech (2007) (86)
Geometry, kinematics, and acoustics of Tamil liquid consonants. (1999) (84)
An Acoustic Measure for Word Prominence in Spontaneous Speech (2007) (82)
KNOWME: a case study in wireless body area sensor network design (2012) (81)
Smart room: participant and speaker localization and identification (2005) (80)
Computing vocal entrainment: A signal-derived PCA-based quantification scheme with application to affect analysis in married couple interactions (2014) (80)
Attention Assisted Discovery of Sub-Utterance Structure in Speech Emotion Recognition (2016) (78)
A Multimodal Real-Time MRI Articulatory Corpus for Speech Research (2011) (78)
Modeling mutual influence of interlocutor emotion states in dyadic spoken interactions (2009) (76)
"Rate My Therapist": Automated Detection of Empathy in Drug and Alcohol Counseling via Speech and Language Processing (2015) (76)
The USC Creative IT Database: A Multimodal Database of Theatrical Improvisation (2010) (76)
A generalized smoothness criterion for acoustic-to-articulatory inversion. (2010) (75)
A fast and flexible MRI system for the study of dynamic vocal tract shaping (2017) (73)
Classifying emotions in human-machine spoken dialogs (2002) (73)
Automatic syllable stress detection using prosodic features for pronunciation evaluation of language learners (2005) (71)
Automatic intelligibility classification of sentence-level pathological speech (2015) (71)
Emotion recognition using a data-driven fuzzy inference system (2003) (70)
Designing Contestability: Interaction Design, Machine Learning, and Mental Health (2017) (70)
Building topic specific language models from webdata using competitive models (2005) (69)
Noise source models for fricative consonants (2000) (68)
Data Augmentation Using GANs for Speech Emotion Recognition (2019) (68)
Distributional Semantic Models for Affective Text Analysis (2013) (68)
An investigation of articulatory setting using real-time magnetic resonance imaging. (2013) (68)
Acoustic topic model for audio information retrieval (2009) (67)
Analysis of children's speech: duration, pitch and formants (1997) (66)
Combining lexical, syntactic and prosodic cues for improved online dialog act tagging (2009) (65)
Analysis of user behavior under error conditions in spoken dialogs (2002) (65)
Robust Unsupervised Arousal Rating:A Rule-Based Framework withKnowledge-Inspired Vocal Features (2014) (65)
Tactical Language Training System: Supporting the Rapid Acquisition of Foreign Language and Cultural Skills (2004) (65)
Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap (2020) (65)
Rapid semi-automatic segmentation of real-time magnetic resonance images for parametric vocal tract analysis (2010) (63)
Improving speech recognition for children using acoustic adaptation and pronunciation modeling (2014) (63)
Prominence Detection Using Auditory Attention Cues and Task-Dependent High Level Information (2009) (63)
Automatic classification of married couples' behavior using audio features (2010) (63)
Refined speech segmentation for concatenative speech synthesis (2002) (62)
Acoustic feature analysis in speech emotion primitives estimation (2010) (61)
Human Perception of Audio-Visual Synthetic Character Emotion Expression in the Presence of Ambiguous and Conflicting Information (2009) (61)
Iterative Feature Normalization Scheme for Automatic Emotion Detection from Speech (2013) (60)
Rachel: Design of an emotionally targeted interactive agent for children with autism (2011) (60)
Strategies to Improve the Robustness of Agglomerative Hierarchical Clustering Under Data Source Variation for Speaker Diarization (2008) (60)
An automatic prosody recognizer using a coupled multi-stream acoustic model and a syntactic-prosodic language model (2005) (60)
Which ASR should I choose for my dialogue system? (2013) (59)
Speech emotion estimation in 3D space (2010) (57)
A robust frontend for VAD: exploiting contextual, discriminative and spectral cues of human voice (2013) (57)
2008 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2008) (57)
Data-driven analysis of realtime vocal tract MRI using correlated image regions (2010) (57)
Average divergence distance as a statistical discrimination measure for hidden Markov models (2006) (57)
On the robustness of overall F0-only modifications to the perception of emotions in speech. (2008) (56)
"That's Aggravating, Very Aggravating": Is It Possible to Classify Behaviors in Couple Interactions Using Automatically Derived Lexical Features? (2011) (56)
Irregularity-Aware Graph Fourier Transforms (2018) (55)
Interspeaker variability in hard palate morphology and vowel production. (2013) (54)
Signal Processing and Machine Learning for Mental Health Research and Clinical Applications [Perspectives] (2017) (54)
Spontaneous-Speech Acoustic-Prosodic Features of Children with Autism and the Interacting Psychologist (2012) (54)
Challenging Uncertainty in Query by Humming Systems: A Fingerprinting Approach (2008) (53)
Flexible retrospective selection of temporal resolution in real‐time speech MRI using a golden‐ratio spiral view order (2011) (53)
Speaker Verification Using Sparse Representations on Total Variability i-vectors (2011) (53)
Limited domain synthesis of expressive military speech for animated characters (2002) (53)
The USC CreativeIT database of multimodal dyadic interactions: from speech and full body motion capture to continuous emotional annotations (2016) (52)
A Computational Study of Expressive Facial Dynamics in Children with Autism (2018) (51)
Simplified supervised i-vector modeling with application to robust and efficient language identification and speaker verification (2014) (51)
Politeness and frustration language in child-machine interactions (2001) (51)
A System for Technology Based Assessment of Language and Literacy in Young Children: the Role of Multiple Information Sources (2007) (50)
Visual emotion recognition using compact facial representations and viseme information (2010) (50)
Morphological variation in the adult hard palate and posterior pharyngeal wall. (2013) (50)
Analyzing Children's Speech: An Acoustic Study of Consonants and Consonant-Vowel Transition (2006) (50)
Automatic acoustic synthesis of human-like laughter. (2007) (50)
Automatic speech recognition using articulatory features from subject-independent acoustic-to-articulatory inversion. (2011) (50)
VPQ: a spoken language interface to large scale directory information (1998) (49)
Iterative feature normalization for emotional speech detection (2011) (49)
Acoustic modeling of American English (2000) (49)
Paralinguistic mechanisms of production in human "beatboxing": a real-time magnetic resonance imaging study. (2013) (49)
Speaker change detection using a new weighted distance measure (2002) (49)
Analyzing the language of therapist empathy in Motivational Interview based psychotherapy (2012) (48)
Detecting prominence in conversational speech: pitch accent, givenness and focus (2008) (48)
Classification of sound clips by two schemes: Using onomatopoeia and semantic labels (2008) (47)
Investigating Implicit Cues for User State Estimation in Human-Robot Interaction Using Physiological Measurements (2007) (46)
Sparse Representation of Electrodermal Activity With Knowledge-Driven Dictionaries (2015) (45)
Unsupervised speaker indexing using generic models (2005) (45)
Robust language identification using convolutional neural network features (2014) (45)
Spoken dialog systems for children (1998) (45)
Kernel Models for Affective Lexicon Creation (2011) (45)
Recording audio-visual emotional databases from actors : a closer look (2008) (45)
TBALL data collection: the making of a young children's speech corpus (2005) (45)
Combining acoustic, lexical, and syntactic evidence for automatic unsupervised prosody labeling (2006) (44)
Constructing emotional speech synthesizers with limited speech database (2004) (44)
Robust speaker identification based on selective use of feature vectors (2007) (44)
Tracking changes in continuous emotion states using body language and prosodic cues (2011) (43)
Hassan: A Virtual Human for Tactical Questioning (2007) (43)
The INTERSPEECH 2020 Far-Field Speaker Verification Challenge (2020) (42)
Modeling therapist empathy through prosody in drug addiction counseling (2014) (42)
An Overview on Perceptually Motivated Audio Indexing and Classification (2013) (42)
Emotions in “Black and White” or Shades of Gray? How We Think About Emotion Shapes Our Perception and Neural Representation of Emotion (2016) (42)
Predicting therapist empathy in motivational interviews using language features inspired by psycholinguistic norms (2015) (41)
Combining categorical and primitives-based emotion recognition (2006) (41)
Automatic recognition of emotion evoked by general sound events (2012) (41)
Classification of cognitive load from speech using an i-vector framework (2014) (41)
Multimodal Sensing for Pediatric Obesity Applications (2008) (41)
Enhanced airway-tissue boundary segmentation for real-time magnetic resonance imaging data (2014) (41)
A robust stopping criterion for agglomerative hierarchical clustering in a speaker diarization system (2007) (40)
Head Motion Modeling for Human Behavior Analysis in Dyadic Interaction (2015) (40)
Audio retrieval by latent perceptual indexing (2008) (40)
A reranking approach for recognition and classification of speech input in conversational dialogue systems (2012) (40)
Evaluating spoken dialog systems for telecommunication services (1997) (39)
Adversarial Attack and Defense Strategies for Deep Speaker Recognition Systems (2020) (39)
A Globally-Variant Locally-Constant Model for Fusion of Labels from Multiple Diverse Experts without Using Reference Labels (2013) (39)
Direct Estimation of Articulatory Kinematics from Real-Time Magnetic Resonance Image Sequences (2011) (39)
A subject-independent acoustic-to-articulatory inversion (2011) (39)
Linguistic analysis of differences in portrayal of movie characters (2017) (39)
TILES-2018, a longitudinal physiologic and behavioral data set of hospital workers (2020) (39)
Improved Speech Recognition using Acoustic and Lexical Correlates of Pitch Accent in a N-Best Rescoring Framework (2007) (39)
Machine learning and natural language processing in psychotherapy research: Alliance as example use case. (2020) (39)
An Analysis of PCA-Based Vocal Entrainment Measures in Married Couples' Affective Spoken Interactions (2011) (38)
Audio-based head motion synthesis for Avatar-based telepresence systems (2004) (38)
A technology prototype system for rating therapist empathy from audio recordings in addiction counseling (2016) (37)
A Deep Learning Approach to Modeling Empathy in Addiction Counseling (2016) (37)
Analysis of speech production real-time MRI (2018) (37)
Modeling therapist empathy and vocal entrainment in drug addiction counseling (2013) (37)
Automatic detection and classification of disfluent reading miscues in young children's speech for the purpose of assessment (2007) (36)
Investigating the role of phoneme-level modifications in emotional speech resynthesis (2005) (36)
Fuzzy Logic Models for the Meaning of Emotion Words (2013) (36)
Effect of bandwidth extension to telephone speech recognition in cochlear implant users. (2009) (36)
Using cognitive task analysis to facilitate collaboration in development of simulator to accelerate surgical training. (2004) (36)
An individually tailored family-centered intervention for pediatric obesity in primary care: study protocol of a randomized type II hybrid effectiveness–implementation trial (Raising Healthy Children study) (2018) (36)
An empirical text transformation method for spontaneous speech synthesizers (2003) (36)
Behavioral signal processing for understanding (distressed) dyadic interactions: some recent developments (2011) (35)
Using Articulatory Representations to Detect Segmental Errors in Nonnative Pronunciation (2008) (35)
A multi-pass linear fold algorithm for sentence boundary detection using prosodic cues (2004) (35)
Real-Time Monitoring of Participants' Interaction in a Meeting using Audio-Visual Sensors (2007) (35)
On Short-Time Estimation of Vocal Tract Length from Formant Frequencies (2015) (35)
Vocal tract cross-distance estimation from real-time MRI using region-of-interest analysis (2013) (35)
Pykaldi: A Python Wrapper for Kaldi (2018) (34)
Analysis of pausing behavior in spontaneous speech using real-time magnetic resonance imaging of articulation. (2009) (34)
Design feasibility of an automated, machine-learning based feedback system for motivational interviewing. (2019) (34)
On quantifying facial expression-related atypicality of children with Autism Spectrum Disorder (2015) (34)
An HMM-based approach to humming transcription (2002) (34)
A hierarchical framework for modeling multimodality and emotional evolution in affective dialogs (2012) (34)
Speaker verification based on the fusion of speech acoustics and inverted articulatory signals (2016) (33)
Interplay between linguistic and affective goals in facial expression during emotional utterances (2006) (33)
The expression and perception of emotions: comparing assessments of self versus others (2008) (33)
Analyzing the memory of BLSTM Neural Networks for enhanced emotion classification in dyadic spoken interactions (2012) (33)
"It sounds like...": A natural language processing approach to detecting counselor reflections in motivational interviewing. (2016) (33)
A review of the acoustic and linguistic properties of children's speech (2007) (33)
Information divergence estimation based on data-dependent partitions (2010) (33)
Intoxicated Speech Detection by Fusion of Speaker Normalized Hierarchical Features and GMM Supervectors (2011) (32)
Detecting Politeness and frustration state of a child in a conversational computer game (2005) (32)
Paralinguistic event detection from speech using probabilistic time-series smoothing and masking (2013) (32)
Behavioral Coding of Therapist Language in Addiction Counseling Using Recurrent Neural Networks (2016) (32)
Pronunciation verification of children²s speech for automatic literacy assessment (2006) (32)
Modified-prior i-vector estimation for language identification of short duration utterances (2014) (32)
Transonics: a speech to speech system for English-Persian interactions (2003) (32)
Split-lexicon based hierarchical recognition of speech using syllable and word level acoustic units (2003) (32)
Clinical state tracking in serious mental illness through computational analysis of speech (2020) (32)
Emotion Twenty Questions: Toward a Crowd-Sourced Theory of Emotions (2011) (32)
Automatic diacritization of Arabic transcripts for automatic speech recognition (2005) (32)
A hierarchical static-dynamic framework for emotion classification (2011) (32)
Pathological speech processing: State-of-the-art, current challenges, and future directions (2016) (32)
Proceedings of the 8th Annual Conference on the Science of Dissemination and Implementation (2016) (32)
Improved imaging of lingual articulation using real‐time multislice MRI (2012) (32)
An analysis of multimodal cues of interruption in dyadic spoken interactions (2008) (31)
Dynamic chroma feature vectors with applications to cover song identification (2008) (31)
Pronunciation variations of Spanish-accented English spoken by young children (2005) (31)
Coregulation of therapist and client emotion during psychotherapy (2020) (31)
Enhanced Sparse Imputation Techniques for a Robust Speech Recognition Front-End (2011) (31)
Speaker-Invariant Affective Representation Learning via Adversarial Training (2019) (31)
Saliency-driven unstructured acoustic scene classification using latent perceptual indexing (2009) (30)
Using Multimodal Wearable Technology to Detect Conflict among Couples (2017) (30)
A Case Study: Detecting Counselor Reflections in Psychotherapy for Addictions using Linguistic Features (2012) (30)
3D dynamic MRI of the vocal tract during natural speech (2018) (30)
Exploiting Acoustic and Syntactic Features for Prosody Labeling in a Maximum Entropy Framework (2007) (30)
Analyzing speech rate entrainment and its relation to therapist empathy in drug addiction counseling (2015) (30)
TILES audio recorder: an unobtrusive wearable solution to track audio activity (2018) (30)
Acoustic-prosodic, turn-taking, and language cues in child-psychologist interactions for varying social demand (2013) (30)
An Iterative Relative Entropy Minimization-Based Data Selection Approach for n-Gram Model Adaptation (2009) (29)
Text data acquisition for domain-specific language models (2006) (29)
Analysis and Predictive Modeling of Body Language Behavior in Dyadic Interactions From Multimodal Interlocutor Cues (2014) (29)
Computational Analysis and Simulation of Empathic Behaviors: a Survey of Empathy Modeling with Behavioral Signal Processing Framework (2016) (29)
Efficient scalable encoding for distributed speech recognition (2006) (29)
Statistical methods for estimation of direct and differential kinematics of the vocal tract (2013) (28)
Imaging axonal damage in multiple sclerosis by means of MR spectroscopy (2000) (28)
Advances in real-time magnetic resonance imaging of the vocal tract for speech science and technology research (2016) (28)
Quantifying atypicality in affective facial expressions of children with autism spectrum disorders (2013) (28)
Rapid Language Identification (2015) (28)
Object classification in sidescan sonar images with sparse representation techniques (2012) (27)
An exploratory study of emotional speech production using functional data analysis techniques (2006) (27)
Robust Object Classification in Underwater Sidescan Sonar Images by Using Reliability-Aware Fusion of Shadow Features (2015) (27)
A Bayesian network classifier for word-level reading assessment (2007) (27)
Modeling Dynamics of Expressive Body Gestures In Dyadic Interactions (2017) (27)
Automatic estimation of parkinson's disease severity from diverse speech tasks (2015) (27)
SPOKEN LANGUAGE DIALOGUE : FROM THEORY TO PRACTICE (1999) (27)
Spatio-temporal articulatory movement primitives during speech production: extraction, interpretation, and validation. (2013) (27)
Multi-band long-term signal variability features for robust voice activity detection (2013) (27)
Accurate transcription of broadcast news speech using multiple noisy transcribers and unsupervised reliability metrics (2011) (27)
Violence Rating Prediction from Movie Scripts (2019) (27)
Behaviorally-based couple therapies reduce emotional arousal during couple conflict. (2015) (27)
Music fingerprint extraction for classical music cover song identification (2008) (27)
A multimodal mixture-of-experts model for dynamic emotion prediction in movies (2016) (27)
Novel Variations of Group Sparse Regularization Techniques With Applications to Noise Robust Automatic Speech Recognition (2012) (26)
Semantic Edge Detection for Tracking Vocal Tract Air-Tissue Boundaries in Real-Time Magnetic Resonance Images (2017) (26)
Influence of modelling strategies on uncertainty propagation in the alternate path mechanism of reinforced concrete framed structures (2016) (26)
Adaptive categorical understanding for spoken dialogue systems (2005) (26)
Movie Content Analysis, Indexing and Skimming Via Multimodal Information (2003) (26)
Discriminative Wavelet Packet Filter Bank Selection for Pattern Recognition (2009) (26)
Speech rate estimation via temporal correlation and selected sub-band correlation (2005) (26)
Tactical Language Detection and Modeling of Learner Speech Errors: The case of Arabic tactical language training for American English speakers (2004) (26)
Transonics: A Practical Speech-to-Speech Translator for English-Farsi Medical Dialogs (2005) (25)
Factor analysis of vocal-tract outlines derived from real-time magnetic resonance imaging data (2015) (25)
Content Analysis for Acoustic Environment Classification in Mobile Robots (2006) (25)
Acoustic-prosodic correlates of 'awkward' prosody in story retellings from adolescents with autism (2015) (25)
Recognition of physical activities in overweight Hispanic youth using KNOWME Networks. (2012) (25)
Dynamic 3-D Visualization of Vocal Tract Shaping During Speech (2013) (25)
A study of emotional speech articulation using a fast magnetic resonance imaging technique (2006) (24)
Nonproduct Data-Dependent Partitions for Mutual Information Estimation: Strong Consistency and Applications (2010) (24)
Intoxicated speech detection: A fusion framework with speaker-normalized hierarchical functionals and GMM supervectors (2014) (24)
Speaker identification using supra-segmental pitch pattern dynamics (2004) (24)
Scripted dialogs versus improvisation: lessons learned about emotional elicitation techniques from the IEMOCAP database (2008) (24)
On the implementation of ASR algorithms for hand-held wireless mobile devices (2001) (24)
The USC CARE Corpus: Child-Psychologist Interactions of Children with Autism Spectrum Disorders (2011) (24)
Speaker verification using simplified and supervised i-vector modeling (2013) (24)
"You made me do it": Classification of Blame in Married Couples' Interactions by Fusing Automatically Derived Speech and Language Information (2011) (24)
Advancing methods for reliably assessing motivational interviewing fidelity using the motivational interviewing skills code. (2015) (24)
Predicting interruptions in dyadic spoken interactions (2010) (24)
Speaker states recognition using latent factor analysis based Eigenchannel factor vector modeling (2012) (24)
Stochastic Networked Computation (2010) (24)
New Frontiers in Ambulatory Assessment (2017) (24)
Closure duration analysis of incomplete stop consonants due to stop-stop interaction. (2009) (24)
Real-time magnetic resonance imaging investigation of resonance tuning in soprano singing. (2010) (23)
Towards modeling user behavior in human-machine interactions: Effect of Errors and Emotions (2002) (23)
Optimal Time-Resource Allocation for Energy-Efficient Physical Activity Detection (2011) (23)
The Transonics Spoken Dialogue Translator: An Aid for English-Persian Doctor-Patient Interviews (2004) (23)
Investigating articulatory setting - pauses, ready position, and rest - using real-time MRI (2010) (23)
A Statistical Approach for Modeling Prosody Features using POS Tags for Emotional Speech Synthesis (2007) (23)
Evaluation of swallow function after tongue cancer treatment using real-time magnetic resonance imaging: a pilot study. (2013) (23)
Dynamic off‐resonance correction for spiral real‐time MRI of speech (2018) (23)
Unsupervised Adaptation of Categorical Prosody Models for Prosody Labeling and Speech Recognition (2009) (23)
Toward active and unobtrusive engagement assessment of distance learners (2017) (23)
Upper Bound Kullback–Leibler Divergence for Transient Hidden Markov Models (2008) (22)
A quantitative analysis of gender differences in movies using psycholinguistic normatives (2015) (22)
Universal Consistency of Data-Driven Partitions for Divergence Estimation (2007) (22)
Automatic detection of voice onset time contrasts for use in pronunciation assessment (2006) (22)
Gender Representation in Cinematic Content: A Multimodal Approach (2015) (22)
Predicting couple therapy outcomes based on speech acoustic features (2017) (22)
A dialog act tagging approach to behavioral coding: a case study of addiction counseling conversations (2015) (22)
KNOWME: An Energy-Efficient Multimodal Body Area Network for Physical Activity Monitoring (2012) (22)
State-of-the-Art MRI Protocol for Comprehensive Assessment of Vocal Tract Structure and Function (2016) (22)
Joint Analysis of the Emotional Fingerprint in the Face and Speech: A single subject study (2007) (22)
Fine-grained pitch accent and boundary tone labeling with parametric F0 features (2008) (22)
Development of Socially Assistive Robots For Children With Autism Spectrum Disorders (2009) (22)
Automated evaluation of non-native English pronunciation quality: combining knowledge- and data-driven features at multiple time scales (2015) (22)
Semi-Supervised and Transfer Learning Approaches for Low Resource Sentiment Classification (2018) (22)
Automated evaluation of psychotherapy skills using speech and language technologies (2021) (22)
Characterizing Types of Convolution in Deep Convolutional Recurrent Neural Networks for Robust Speech Emotion Recognition (2017) (21)
Theoretical Analysis of Diversity in an Ensemble of Automatic Speech Recognition Systems (2014) (21)
A two-step technique for MRI audio enhancement using dictionary learning and wavelet packet analysis (2013) (21)
Velic coordination in French nasals: a real-time magnetic resonance imaging study (2013) (21)
Classifying language-related developmental disorders from speech cues: the promise and the potential confounds (2013) (21)
Online Affect Tracking with Multimodal Kalman Filters (2016) (21)
Articulatory Synthesis Based on Real-Time Magnetic Resonance Imaging Data (2016) (21)
Structured sparse methods for active ocean observation systems with communication constraints (2015) (21)
Multi-Label Multi-Task Deep Learning for Behavioral Coding (2018) (21)
Agglomerative hierarchical speaker clustering using incremental Gaussian mixture cluster modeling (2008) (21)
Adaptive speaker identification with audiovisual cues for movie content analysis (2004) (20)
Speech in Affective Computing (2015) (20)
An exploratory study of manifolds of emotional speech (2010) (20)
Automatic Detection of Disfluency Boundaries in Spontaneous Speech of Children Using Audio–Visual Information (2009) (20)
Characterizing Vocal Tract Dynamics Across Speakers Using Real-Time MRI (2016) (20)
Study The Effect Of Cryogenic Cooling On Machinability Characteristics During Turning Duplex Stainless Steel 2205 (2018) (20)
Multimodal Human and Environmental Sensing for Longitudinal Behavioral Studies in Naturalistic Settings: Framework for Sensor Selection, Deployment, and Management (2019) (20)
Assessment of emerging reading skills in young native speakers and language learners (2009) (20)
Novel 16‐channel receive coil array for accelerated upper airway MRI at 3 Tesla (2011) (20)
Multiple Instance Learning for Classification of Human Behavior Observations (2011) (19)
A statistical approach to retrieval under user-dependent uncertainty in query-by-humming systems (2004) (19)
SAIL: A hybrid approach to sentiment analysis (2013) (19)
Long-Term SNR Estimation of Speech Signals in Known and Unknown Channel Conditions (2016) (19)
A Robust Unsupervised Arousal Rating Framework using Prosody with Cross-Corpora Evaluation (2012) (19)
Affective State Recognition in Married Couples' Interactions Using PCA-Based Vocal Entrainment Measures with Multiple Instance Learning (2011) (19)
Better nonnative intonation scores through prosodic theory (2008) (19)
Processing speech signal using auditory-like filterbank provides least uncertainty about articulatory gestures. (2011) (19)
Analysis of interaction attitudes using data-driven hand gesture phrases (2014) (19)
Radiobot-CFF: a spoken dialogue system for military training (2006) (19)
Articulatory synthesis of French connected speech from EMA data (2013) (19)
Vector-based Representation and Clustering of Audio Using Onomatopoeia Words (2006) (19)
Automatic speech recognition system channel modeling (2010) (19)
Language-adaptive persian speech recognition (2003) (19)
Attention Networks for Modeling Behaviors in Addiction Counseling (2017) (19)
Lessons Learned: Recommendations For Implementing a Longitudinal Study Using Wearable and Environmental Sensors in a Health Care Organization (2019) (19)
Automatic main melody extraction from midi files with a modified Lempel-Ziv algorithm (2001) (19)
Robust talking face video verification using joint factor analysis and sparse representation on GMM mean shifted supervectors (2011) (19)
Morphological Variation in the Adult Vocal Tract: A Modeling Study of its Potential Acoustic Impact (2011) (18)
A statistical multidimensional humming transcription using phone level hidden Markov models for query by humming systems (2003) (18)
Piecewise linear stylization of pitch via wavelet analysis (2005) (18)
Characterizing Articulation in Apraxic Speech Using Real-Time Magnetic Resonance Imaging. (2017) (18)
Multimodal detection of fake social media use through a fusion of classification and pairwise ranking systems (2017) (18)
Grasp: A matlab toolbox for graph signal processing (2017) (18)
The FFSVC 2020 Evaluation Plan (2020) (18)
Analysis of engagement behavior in children during dyadic interactions using prosodic cues (2016) (18)
Database of Volumetric and Real-Time Vocal Tract MRI for Speech Science (2017) (18)
A Generative Student Model for Scoring Word Reading Skills (2011) (18)
Computationally deconstructing movie narratives: An informatics approach (2015) (17)
Estimation of ordinal approach-avoidance labels in dyadic interactions: Ordinal logistic regression approach (2011) (17)
Computational Media Intelligence: Human-Centered Machine Analysis of Media (2021) (17)
An investigation of vocal arousal dynamics in child-psychologist interactions using synchrony measures and a conversation-based model (2014) (17)
A Multi-task Approach to Learning Multilingual Representations (2018) (17)
Tweester at SemEval-2016 Task 4: Sentiment Analysis in Twitter Using Semantic-Affective Model Adaptation (2016) (17)
Automatic Dynamic Expression Synthesis For Speech Animation (2004) (17)
Speaker Diarization With Lexical Information (2018) (17)
Learning Expressive Human-Like Head Motion Sequences from Speech (2008) (17)
Multimodal Representation Learning using Deep Multiset Canonical Correlation (2019) (17)
Improvements in English ASR for the MALACH project using syllable-centric models (2003) (17)
Virtual Microphones for Multichannel Audio Resynthesis (2003) (17)
An interval type-2 fuzzy logic system to translate between emotion-related vocabularies (2008) (17)
Deblurring for spiral real‐time MRI using convolutional neural networks (2020) (17)
Developmental acoustic study of American English diphthongs. (2014) (16)
Using physiology and language cues for modeling verbal response latencies of children with ASD (2013) (16)
Analyzing the Nature of ECA Interactions in Children with Autism (2011) (16)
Spoken language synthesis: experiments in synthesis of spontaneous monologues (2002) (16)
Exploiting prosodic features for dialog act tagging in a discriminative modeling framework (2007) (16)
SAIL: Sentiment Analysis using Semantic Similarity and Contrast Features (2014) (16)
Explaining Coronal Reduction: Prosodic Structure and Articulatory Posture (2018) (16)
TRAP language identification system for RATS phase II evaluation (2013) (16)
Audio Scene Understanding using Topic Models (2009) (16)
Latent acoustic topic models for unstructured audio classification (2012) (16)
Real-Time Software Implementation of H.264 Baseline Profile Video Encoder for Mobile and Handheld Devices (2006) (16)
A kinematic study of critical and non-critical articulators in emotional speech production. (2015) (16)
Acoustic analysis and automatic recognition of spontaneous children²s speech (2006) (16)
Feasibility of through‐time spiral generalized autocalibrating partial parallel acquisition for low latency accelerated real‐time MRI of speech (2017) (16)
Analysis of disfluent repetitions in spontaneous speech recognition (2006) (16)
Acoustical analysis of engagement behavior in children (2012) (16)
Pitch Contour Stylization Using an Optimal Piecewise Polynomial Approximation (2009) (16)
Human perception of synthetic character emotions in the presence of conflicting and congruent vocal and facial expressions (2008) (16)
Joint source-filter optimization for robust glottal source estimation in the presence of shimmer and jitter (2011) (16)
Interaction between general prosodic factors and language-specific articulatory patterns underlies divergent outcomes of coronal stop reduction (2014) (16)
Robust Speaker Recognition Using Unsupervised Adversarial Invariance (2019) (16)
ASCII based transcription systems for languages with the Arabic script: the case of Persian (2003) (15)
Speech Recognition Engineering Issues in Speech to Speech Translation System Design for Low Resource Languages and Domains (2006) (15)
Test-retest repeatability of human speech biomarkers from static and real-time dynamic magnetic resonance imaging. (2017) (15)
Comparison of child-human and child-computer interactions based on manual annotations (2009) (15)
Modeling Multiple Time Series Annotations as Noisy Distortions of the Ground Truth: An Expectation-Maximization Approach (2018) (15)
A Comparative Study of Stress and Anxiety Estimation in Ecological Settings Using a Smart-shirt and a Smart-bracelet (2019) (15)
Creation of a Doctor-Patient Dialogue Corpus Using Standardized Patients (2004) (15)
Objective Language Feature Analysis in Children with Neurodevelopmental Disorders During Autism Assessment (2016) (15)
Robust Speech Activity Detection in Movie Audio: Data Resources and Experimental Evaluation (2019) (15)
Unsupervised Discovery of Character Dictionaries in Animation Movies (2018) (15)
Differential expression of carotenoid biosynthetic pathway genes in two contrasting tomato genotypes for lycopene content (2016) (15)
Sounds of the Human Vocal Tract (2017) (15)
Neural Speech Decoding During Audition, Imagination and Production (2020) (15)
Semi-Automatic Processing of Real-time MR Image Sequences for Speech Production Studies (2006) (15)
Articulatory characterization of English liquid-final rimes (2019) (15)
Novel inter-cluster distance measure combining GLR and ICR for improved agglomerative hierarchical speaker clustering (2008) (15)
Speaker Diarization Using Latent Space Clustering in Generative Adversarial Network (2019) (15)
Imputing Missing Data In Large-Scale Multivariate Biomedical Wearable Recordings Using Bidirectional Recurrent Neural Networks With Temporal Activation Regularization (2019) (15)
Language Features for Automated Evaluation of Cognitive Behavior Psychotherapy Sessions (2018) (15)
Effects of dialog initiative and multi-modal presentation strategies on large directory information access (2000) (15)
Robust speaker clustering strategies to data source variation for improved speaker diarization (2007) (15)
Therapy language analysis using automatically generated psycholinguistic norms (2015) (15)
Improving Gender Identification in Movie Audio Using Cross-Domain Data (2018) (15)
A Person-Organisation Fit Study of College Work Culture and Its Impact on Behavioural Intentions of Teachers (2009) (14)
A non-homogeneous poisson process model of Skin Conductance Responses integrated with observed regulatory behaviors for Autism intervention (2014) (14)
On Evaluating CNN Representations for Low Resource Medical Image Classification (2019) (14)
Reliability-Weighted Acoustic Model Adaptation Using Crowd Sourced Transcriptions (2011) (14)
Toward automatic vocal tract area function estimation from accelerated three-dimensional magnetic resonance imaging (2013) (14)
Evaluating evaluators: a case study in understanding the benefits and pitfalls of multi-evaluator modeling (2009) (14)
Statistical Modeling and Retrieval of Polyphonic Music (2007) (14)
Characterizing Post-Glossectomy Speech Using Real-time MRI (2013) (14)
The Ambiguous World of Emotion Representation (2019) (14)
Automatic Identification of Salient Acoustic Instances in Couples' Behavioral Interactions Using Diverse Density Support Vector Machines (2011) (14)
Affective Feature Design and Predicting Continuous Affective Dimensions from Music (2014) (14)
Toward Visual Voice Activity Detection for Unconstrained Videos (2019) (14)
Factored translation models for enriching spoken language translation with prosody (2008) (14)
Estimation of children's reading ability by fusion of automatic pronunciation verification and fluency detection (2008) (14)
Selecting relevant text subsets from web-data for building topic specific language models (2006) (14)
Automatic Data-Driven Learning of Articulatory Primitives from Real-Time MRI Data Using Convolutive NMF with Sparseness Constraints (2011) (14)
Quantifying EDA synchrony through joint sparse representation: A case-study of couples' interactions (2015) (14)
Statistical multi-stream modeling of real-time MRI articulatory speech data (2010) (14)
Analysis of emotional effect on speech-body gesture interplay (2014) (14)
Acoustic-Prosodic and Turn-Taking Features in Interactions with Children with Neurodevelopmental Disorders (2016) (14)
Speed-accuracy tradeoffs in human speech production (2018) (14)
Motion-Capture Patterns of Voluntarily Mimicked Dynamic Facial Expressions in Children and Adolescents With and Without ASD (2018) (14)
Interplay between verbal response latency and physiology of children with autism during ECA interactions (2012) (13)
A semi-supervised learning approach to online audio background detection (2009) (13)
Analysis of Audio Clustering using Word Descriptions (2007) (13)
A Virtual Human for Tactical Questioning (2007) (13)
Are Articulatory Settings Mechanically Advantageous for Speech Motor Control? (2014) (13)
Simplifying emotion classification through emotion distillation (2012) (13)
Emotion classification from speech using evaluator reliability-weighted combination of ranked lists (2011) (13)
Recognition for synthesis: Automatic parameter selection for resynthesis of emotional speech from neutral speech (2008) (13)
Acoustic Analysis of Preschool Children's Speech (2003) (13)
Articulation of Mandarin Sibilants: a multi-plane realtime MRI study (2012) (13)
End-to-End Neural Systems for Automatic Children Speech Recognition: An Empirical Study (2021) (13)
An acoustic analysis of shared enjoyment in ECA interactions of children with autism (2012) (13)
A supervised signal-to-noise ratio estimation of speech signals (2014) (13)
A new multichannel multi modal dyadic interaction database (2010) (13)
Emphatic segments and emphasis spread in Lebanese Arabic: a Real-time Magnetic Resonance Imaging Study (2012) (13)
Speaker Personality Classification Using Systems Based on Acoustic-Lexical Cues and an Optimal Tree-Structured Bayesian Network (2012) (13)
Directly data-derived articulatory gesture-like representations retain discriminatory information about phone categories (2016) (13)
Using interval type-2 fuzzy logic to analyze Turkish emotion words (2012) (13)
A multimodal analysis of physical activity, sleep, and work shift in nurses with wearable sensor data (2021) (13)
Co-registration of speech production datasets from electromagnetic articulography and real-time magnetic resonance imaging. (2014) (13)
Accelerated 3D MRI of vocal tract shaping using compressed sensing and parallel imaging (2009) (13)
Meta-Learning With Latent Space Clustering in Generative Adversarial Network for Speaker Diarization (2020) (13)
Combined Speaker Clustering and Role Recognition in Conversational Speech (2018) (13)
A method for on-line speaker indexing using generic reference models (2003) (13)
Predicting client's inclination towards target behavior change in motivational interviewing and investigating the role of laughter (2014) (13)
Based on Isolated Saliency or Causal Integration? Toward a Better Understanding of Human Annotation Process using Multiple Instance Learning and Sequential Probability Ratio Test (2012) (13)
A top-down auditory attention model for learning task dependent influences on prominence detection in speech (2008) (12)
Head motion synchrony and its correlation to affectivity in dyadic interactions (2013) (12)
Convex Hull Convolutive Non-Negative Matrix Factorization for Uncovering Temporal Patterns in Multivariate Time-Series Data (2016) (12)
U nderwater communication implementation with OFDM (2015) (12)
A study on the effect of prosodic emphasis transfer on overall speech translation quality (2013) (12)
Intelligibility classification of pathological speech using fusion of multiple high level descriptors (2012) (12)
A Novel Method for Human Bias Correction of Continuous- Time Annotations (2018) (12)
Data-dependent evaluator modeling and its application to emotional valence classification from speech (2010) (12)
An Exploratory Study of the Relations Between Perceived Emotion Strength and Articulatory Kinematics (2011) (12)
Modeling the intonation of discourse segments for improved online dialog ACT tagging (2008) (12)
Detecting paralinguistic events in audio stream using context in features and probabilistic decisions (2016) (12)
Simplified and supervised i-vector modeling for speaker age regression (2014) (12)
Speaker Agnostic Foreground Speech Detection from Audio Recordings in Workplace Settings from Wearable Recorders (2019) (12)
An analysis of vocal tract shaping in English sibilant fricatives using real-time magnetic resonance imaging (2008) (12)
A study of generic models for unsupervised on-line speaker indexing (2003) (12)
The language of interpersonal interaction: An interdisciplinary approach to assessing and processing vocal and speech data (2018) (12)
Language model adaptation for spoken language systems (1998) (12)
Efficient scalable speech compression for scalable speech recognition (2001) (12)
Gestural Control in the English Past-Tense Suffix: An Articulatory Study Using Real-Time MRI (2015) (12)
Complexity in Speech and its Relation to Emotional Bond in Therapist-Patient Interactions During Suicide Risk Assessment Interviews (2017) (12)
Upper Bound Kullback-Leibler Divergence for Hidden Markov Models with Application as Discrimination Measure for Speech Recognition (2006) (12)
Tweester at SemEval-2017 Task 4: Fusion of Semantic-Affective and pairwise classification models for sentiment analysis in Twitter (2017) (11)
Automatic Prediction of Suicidal Risk in Military Couples Using Multimodal Interaction Cues from Couples Conversations (2019) (11)
Family-of-origin aggression, dating aggression, and physiological stress reactivity in daily life (2019) (11)
Toward Robust Interpretable Human Movement Pattern Analysis in a Workplace Setting (2019) (11)
Strategies for Disseminating Information on Biomedical Research on Autism to Hispanic Parents (2016) (11)
Meta-Learning for Robust Child-Adult Classification from Speech (2019) (11)
An Expectation Maximization Approach to Joint Modeling of Multidimensional Ratings Derived from Multiple Annotators (2016) (11)
USC-EMO-MRI corpus: An emotional speech production database recorded by real-time magnetic resonance imaging (2014) (11)
Toward the Automatic Extraction of Policy Networks Using Web Links and Documents (2013) (11)
On-line genre classification of TV programs using audio content (2013) (11)
Speaker verification based on fusion of acoustic and articulatory information (2013) (11)
Morphological Variation in the Adult Vocal Tract : A Study Using rtMRI (2010) (11)
Adversarial Defense for Deep Speaker Recognition Using Hybrid Adversarial Training (2020) (11)
Supervised acoustic topic model for unstructured audio information retrieval (2010) (11)
Intrapersonal and interpersonal vocal affect dynamics during psychotherapy. (2021) (11)
Multidimensional humming transcription using a statistical approach for query by humming systems (2003) (11)
A text-free approach to assessing nonnative intonation (2007) (11)
Using Prosodic and Lexical Information for Learning Utterance-level Behaviors in Psychotherapy (2018) (11)
Data driven modeling of head motion towards analysis of behaviors in couple interactions (2013) (11)
ACOUSTIC-SYNTACTIC MAXIMUM ENTROPY MODEL FOR AUTOMATIC PROSODY LABELING (2006) (11)
Language model adaptation using WWW documents obtained by utterance-based queries (2010) (10)
Power-spectral analysis of head motion signal for behavioral modeling in human interaction (2014) (10)
Exploring sparse representation measures of physiological synchrony for romantic couples (2017) (10)
Gesture dynamics modeling for attitude analysis using graph based transform (2014) (10)
Joint-processing of audio-visual signals in human perception of conflicting synthetic character emotions (2008) (10)
Spatial and temporal alignment of multimodal human speech production data: Real time imaging, flesh point tracking and audio (2013) (10)
Toward Designing Interactive Technologies for Supporting Research in Autism Spectrum Disorders (2009) (10)
A mixture of experts approach towards intelligibility classification of pathological speech (2015) (10)
Simulation - The Design Tool for the Future (1986) (10)
Parametric hybrid source models for voiced and voiceless fricative consonants (1996) (10)
A Low-Complexity Dynamic Face-Voice Feature Fusion Approach to Multimodal Person Recognition (2009) (10)
Validating rt-MRI Based Articulatory Representations via Articulatory Recognition (2011) (10)
Towards an Unsupervised Entrainment Distance in Conversational Speech using Deep Neural Networks (2018) (10)
Robust word boundary detection in spontaneous speech using acoustic and lexical cues (2009) (10)
A Multimodal View into Music's Effect on Human Neural, Physiological, and Emotional Experience (2019) (10)
Illustrating the Production of the International Phonetic Alphabet Sounds Using Fast Real-Time Magnetic Resonance Imaging (2016) (10)
The Twins Corpus of Museum Visitor Questions (2012) (10)
Multi-Task Discriminative Training of Hybrid DNN-TVM Model for Speaker Verification with Noisy and Far-Field Speech (2019) (10)
Robust Multichannel Gender Classification from Speech in Movie Audio (2016) (10)
Estimation of vocal tract area function from volumetric Magnetic Resonance Imaging (2017) (10)
Stable articulatory tasks and their variable formation: tamil retroflex consonants (2013) (10)
A variable frame length and rate algorithm based on the spectral kurtosis measure for speaker verification (2010) (10)
Vocal tract shaping of emotional speech (2020) (10)
Intelligibility Classification of Pathological Speech UsingFusion of Multiple Subsystems (2012) (10)
Green public spaces in the cities of South and Southeast Asia. Protecting needs towards sustainable well-being (2020) (10)
Multimodal Meeting Monitoring: Improvements on Speaker Tracking and Segmentation through a Modified Mixture Particle Filter (2007) (10)
A cluster-profile representation of emotion using agglomerative hierarchical clustering (2010) (10)
Pitch period estimation using multipulse model and wavelet transform (2007) (10)
Automated quality assessment of cognitive behavioral therapy sessions through highly contextualized language representations (2021) (10)
Improved 3D real‐time MRI of speech production (2021) (9)
EDA-gram: Designing electrodermal activity fingerprints for visualization and feature extraction (2016) (9)
Reinforcing Self-expressive Representation with Constraint Propagation for Face Clustering in Movies (2019) (9)
Affective Conditioning on Hierarchical Attention Networks Applied to Depression Detection from Transcribed Clinical Interviews (2020) (9)
Flow of Renyi information in deep neural networks (2016) (9)
Effect of spectral normalization on different talker speech recognition by cochlear implant users. (2008) (9)
Markov Chain Monte Carlo Inference of Parametric Dictionaries for Sparse Bayesian Approximations (2016) (9)
Modeling Interpersonal Linguistic Coordination in Conversations using Word Mover's Distance (2019) (9)
Comparison of Basic Beatboxing Articulations Between Expert and Novice Artists Using Real-Time Magnetic Resonance Imaging (2017) (9)
Automatic Classification of Palatal and Pharyngeal Wall Shape Categories from Speech Acoustics and Inverted Articulatory Signals (2013) (9)
Redundancy analysis of behavioral coding for couples therapy and improved estimation of behavior from noisy annotations (2015) (9)
An empirical analysis of user uncertainty in problem-solving child-machine interactions (2008) (9)
Pharyngeal constriction in English diphthong production (2013) (9)
Crossmodal learning for audio-visual speech event localization (2020) (9)
Identification of speakers in movie dialogs using audiovisual cues (2002) (9)
Opening big in box office? Trailer content can help (2016) (9)
Enriching machine-mediated speech-to-speech translation using contextual information (2013) (9)
An Affect Prediction Approach Through Depression Severity Parameter Incorporation in Neural Networks (2017) (9)
Experiments in Automatic Genre Classification of Full-length Music Tracks using Audio Activity Rate (2007) (9)
A multispeaker dataset of raw and reconstructed speech production real-time MRI video and 3D volumetric images (2021) (9)
Enriching Spoken Language Translation with Dialog Acts (2008) (9)
An unsupervised quantitative measure for word prominence in spontaneous speech (2005) (9)
New results in vowel production: MRI, EPG, and acoustic data (1997) (9)
Directional descriptors using zernike moment phases for object orientation estimation in underwater sonar images (2011) (9)
Increasing coordination and responsivity of emotion-related brain regions with a heart rate variability biofeedback randomized trial (2022) (9)
Pronunciation verification of English letter-sounds in preliterate children (2008) (9)
Linguistically Aided Speaker Diarization Using Speaker Role Information (2019) (9)
ASSESSMENT OF A CHILD ’ S ENGAGEMENT USING SEQUENCE MODEL BASED FEATURES (2013) (9)
EmotiWord: Affective Lexicon Creation with Application to Interaction and Multimedia Data (2011) (9)
Graph-based approach for motion capture data representation and analysis (2014) (9)
Team ELISA System for DARPA LORELEI Speech Evaluation 2016 (2017) (9)
Multidimensional humming transcription using a statistical approach for query by humming systems (2003) (9)
Towards end-2-end learning for predicting behavior codes from spoken utterances in psychotherapy conversations (2020) (9)
ELISA System Description for LoReHLT 2017 (2017) (9)
An empirical analysis of information encoded in disentangled neural speaker representations (2020) (9)
Acoustic correlates of user response to error in human-computer dialogues (2003) (9)
An Empirical Study of Speech Processing in the Brain by Analyzing the Temporal Syllable Structure in Speech-input Induced EEG (2019) (9)
Creating ensemble of diverse maximum entropy models (2012) (9)
Data Driven Approach for Language Model Adaptation using Stepwise Relative Entropy Minimization (2007) (9)
Speech production and perception models and their applications to synthesis, recognition, and coding (1995) (9)
An N-gram model for unstructured audio signals toward information retrieval (2010) (9)
Attribute Inference Attack of Speech Emotion Recognition in Federated Learning Settings (2021) (9)
An analysis of articulatory-acoustic data based on articulatory strokes (2009) (9)
Analysis of emotional speech prosody in terms of part of speech tags (2007) (8)
Acoustic and Visual Cues of Turn-Taking Dynamics in Dyadic Interactions (2011) (8)
Global SNR Estimation of Speech Signals for Unknown Noise Conditions Using Noise Adapted Non-Linear Regression (2017) (8)
Unstructured Environmental Audio: Representation, Classification and Modeling (2011) (8)
Context dependent statistical augmentation of persian transcripts (2004) (8)
Breathing Rate Complexity Features for “In-the-Wild” Stress and Anxiety Measurement (2019) (8)
A study of invariant properties and variation patterns in the converter/distributor model for emotional speech (2014) (8)
UBM fused total variability modeling for language identification (2014) (8)
Systematic variation in the articulation of the Korean liquid across prosodic positions (2015) (8)
Intermittently tagged real‐time MRI reveals internal tongue motion during speech production (2019) (8)
Prosody-enriched lattices for improved syllable recognition (2007) (8)
Multiresolution spectral conversion for multichannel audio resynthesis (2002) (8)
A combined numerical and experimental investigation of minimum quantity lubrication applied to end milling of Ti6Al4V alloy (2020) (8)
Classification of emotional content of sighs in dyadic human interactions (2012) (8)
A statistical discrimination measure for hidden Markov models based on divergence (2004) (8)
On smoothing articulatory trajectories obtained from Gaussian mixture model based acoustic-to-articulatory inversion. (2013) (8)
Multimodal systems for children: building a prototype (1999) (8)
Creating data resources for designing user-centric frontends for query by humming systems (2003) (8)
Analyzing quality of crowd-sourced speech transcriptions of noisy audio for acoustic model adaptation (2012) (8)
Enabling effective design of multimodal interfaces for speech-to-speech translation system: An empirical study of longitudinal user behaviors over time and user strategies for coping with errors (2013) (8)
Extracting Situation Frames from Non-English Speech: Evaluation Framework and Pilot Results (2017) (8)
A modular architecture for articulatory synthesis from gestural specification. (2019) (8)
Multi-Scale Speaker Diarization with Neural Affinity Score Fusion (2020) (8)
Acoustic Denoising Using Dictionary Learning With Spectral and Temporal Regularization (2018) (8)
Modeling high-level descriptions of real-life physical activities using latent topic modeling of multimodal sensor signals (2011) (8)
Training ensemble of diverse classifiers on feature subsets (2014) (8)
Improved speaker diarization of meeting speech with recurrent selection of representative speech segments and participant interaction pattern modeling (2009) (8)
Cross-lingual dialog model for speech to speech translation (2006) (8)
Multimodal Speaker Segmentation and Identification in Presence of Overlapped Speech Segments (2010) (8)
A computational lens into how music characterizes genre in film (2021) (8)
Linguistic analysis of spontaneous children speech (2008) (8)
Comparing time-frequency representations for directional derivative features (2014) (8)
Romantic partner presence and physiological responses in daily life: Attachment style as a moderator (2021) (8)
Task-dependence of articulator synergies. (2019) (8)
Localization bounds for the graph translation (2016) (8)
Multimodal Interaction Modeling of Child Forensic Interviewing (2018) (8)
Estimating Individualized Daily Self-Reported Affect with Wearable Sensors (2019) (8)
Advances in vocal tract imaging and analysis (2019) (8)
Stress and Anxiety Measurement "In-the-Wild" Using Quality-aware Multi-scale HRV Features (2019) (8)
Energy-efficient multihypothesis activity-detection for health-monitoring applications (2009) (8)
Control of response of a quarter-car vehicle model with optimal skyhook damper (2008) (8)
Investigation of Speed-Accuracy Tradeoffs in Speech Production Using Real-Time Magnetic Resonance Imaging (2016) (8)
Multi-Scale Context Adaptation for Improving Child Automatic Speech Recognition in Child-Adult Spoken Interactions (2017) (8)
Modeling mutual influence of multimodal behavior in affective dyadic interactions (2015) (8)
Bilingual audio-subtitle extraction using automatic segmentation of movie audio (2011) (8)
Using emotional noise to uncloud audio-visual emotion perceptual evaluation (2013) (8)
Improvements in predicting children's overall reading ability by modeling variability in evaluators' subjective judgments (2012) (8)
Barista: A framework for concurrent speech processing by usc-sail (2014) (8)
Multimodal Embeddings From Language Models for Emotion Recognition in the Wild (2021) (8)
Analyzing Temporal Dynamics of Dyadic Synchrony in Affective Interactions (2016) (8)
Computation as estimation: Estimation-theoretic IC design improves robustness and reduces power consumption (2008) (8)
The Second DIHARD Challenge: System Description for USC-SAIL Team (2019) (7)
Complexity in Prosody: A Nonlinear Dynamical Systems Approach for Dyadic Conversations; Behavior and Outcomes in Couples Therapy (2016) (7)
Local stationarity of graph signals: insights and experiments (2017) (7)
Characterizing Covert Articulation in Apraxic Speech Using real-time MRI (2012) (7)
Identifying Therapist and Client Personae for Therapeutic Alliance Estimation (2019) (7)
An Evaluation of EEG-based Metrics for Engagement Assessment of Distance Learners (2018) (7)
Multichannel audio synthesis by subband-based spectral conversion and parameter adaptation (2005) (7)
Reference marking in children's computer-directed speech: an integrated analysis of discourse and gestures (2004) (7)
Improving Security of Parallel Algorithm Using Key Encryption Technique (2013) (7)
Improving the Prediction of Therapist Behaviors in Addiction Counseling by Exploiting Class Confusions (2019) (7)
Multiple Instance Learning for Behavioral Coding (2017) (7)
Exploiting speech production information for automatic speech and speaker modeling and recognition - possibilities and new opportunities (2012) (7)
Speaker model quantization for unsupervised speaker indexing (2004) (7)
Continuous speech recognition using attention shift decoding with soft decision (2009) (7)
Modeling and automating detection of errors in Arabic language learner speech (2005) (7)
Sensing for Obesity : KNOWME Implementation and Lessons for an Architect (2009) (7)
Still together?: the role of acoustic features in predicting marital outcome (2015) (7)
Categorical understanding using statistical ngram models (1999) (7)
Towards a definition of local stationarity for graph signals (2017) (7)
Dealing with Doctors: A Virtual Human for Non-team Interaction (2005) (7)
Robust diagnostic classification via Q-learning (2021) (7)
Articulation of English vowels in running speech: A real-time MRI study (2015) (7)
Respiration Rate Estimation From Noisy Electrocardiograms Based on Modulation Spectral Analysis (2018) (7)
Enhanced standard compliant distributed speech recognition (Aurora encoder) using rate allocation (2004) (7)
From MRI and acoustic data to articulatory synthesis: a case study of the lateral approximants in American English (1996) (7)
A study of interplay between articulatory movement and prosodic characteristics in emotional speech production (2010) (7)
Optimal Allocation of Time-Resources for Multihypothesis Activity-Level Detection (2009) (7)
Engineering Innovation in Speech Science: Data and Technologies (2019) (7)
Developing an Automated Report Card for Addiction Counseling: The Counselor Observer Ratings Expert for MI (CORE-MI) (2016) (7)
Semi-supervised term-weighted value rescoring for keyword search (2014) (7)
The Promise and the Challenge of Technology-Facilitated Methods for Assessing Behavioral and Cognitive Markers of Risk for Suicide among U.S. Army National Guard Personnel (2017) (7)
Acoustic-Prosodic and Physiological Response to Stressful Interactions in Children with Autism Spectrum Disorder (2017) (7)
Toward transfer of acoustic cues of emphasis across languages (2013) (7)
Empirical link between hypothesis diversity and fusion performance in an ensemble of automatic speech recognition systems (2013) (7)
The effect of word frequency and lexical class on articulatory-acoustic coupling (2013) (7)
Use of Model Transformations for Distributed Speech Recognition (2001) (7)
Virtual Museum Guides demonstration (2010) (7)
Bluetooth Based Indoor Localization Using Triplet Embeddings (2019) (7)
Measuring convergence in language model estimation using relative entropy (2004) (7)
Bilabial Substitution Patterns During Consonant Production in a Case of Congenital Aglossia (2017) (7)
Improved HMM phone and triphone models for real-time ASR telephony applications (1996) (7)
Towards modeling user behavior in interactions mediated through an automated bidirectional speech translation system (2010) (7)
Effects of emotion on different phoneme classes (2004) (7)
Faster 3d vocal tract real-time MRI using constrained reconstruction (2013) (6)
Multimodal analysis of expressive human communication: speech and gesture interplay (2008) (6)
Subspace techniques for task-independent EEG person identification (2019) (6)
Acoustic stopwords for unstructured audio information retrieval (2010) (6)
Leveraging Linguistic Context in Dyadic Interactions to Improve Automatic Speech Recognition for Children (2020) (6)
Learning Shared Vector Representations of Lyrics and Chords in Music (2019) (6)
Efficient Rotation Invariant Retrieval of Shapes with Applications in Medical Databases (2006) (6)
Gestural coordination of Brazilian Portugese nasal vowels in CV syllables: A real-time MRI study (2015) (6)
Feature Fusion Strategies for End-to-End Evaluation of Cognitive Behavior Therapy Sessions (2020) (6)
Designing Neural Speaker Embeddings with Meta Learning (2020) (6)
Quantifying regulation mechanisms in dating couples through a dynamical systems model of acoustic and physiological arousal (2017) (6)
Improving speaker diarization for naturalistic child-adult conversational interactions using contextual information. (2020) (6)
Automatic classification of question turns in spontaneous speech using lexical and prosodic evidence (2008) (6)
A nuclear strategy for India : national security lecture . Privatisation of logistics support facilities : national security seminar . Terrorism : the challenge to India's security : national security paper (2000) (6)
A study of emotional information present in articulatory movements estimated using acoustic-to-articulatory inversion (2012) (6)
SPEAKER VERIFICATION USING LASSO BASED SPARSE TOTAL VARIABILITY SUPERVECTOR AND PROBABILISTIC LINEAR DISCRIMINANT ANALYSIS (2011) (6)
Using naïve text queries for robust audio information retrieval (2010) (6)
Stochastic Shake-Shake Regularization for Affective Learning from Speech (2018) (6)
A real-time MRI study of articulatory setting in second language speech (2014) (6)
Robust Multimodal Person Recognition Using Low-Complexity Audio-Visual Feature Fusion Approaches (2010) (6)
Health behaviour outcomes of a family based intervention for paediatric obesity in primary care: A randomized type II hybrid effectiveness‐implementation trial (2021) (6)
Characterizing vocal tract dynamics with real-time MRI (2015) (6)
Connecting rhythm and prominence in automatic ESL pronunciation scoring (2009) (6)
User-Based Collaborative Filtering Mobile Health System (2020) (6)
Context-Aware Speech Stress Detection in Hospital Workers Using Bi-LSTM Classifiers (2021) (6)
Information Theoretic Analysis of Direct Articulatory Measurements for Phonetic Discrimination (2007) (6)
Impact of Levodopa in Lung Functions in Patients with Parkinson Disease (2020) (6)
Total Variability Layer in Deep Neural Network Embeddings for Speaker Verification (2019) (6)
Creating data resources for designing usercentric frontends for query-by-humming systems (2005) (6)
Analysis and modeling of the role of laughter in motivational interviewing based psychotherapy conversations (2015) (6)
Complexity-Regularized Tree-Structured Partition for Mutual Information Estimation (2012) (6)
Comparison of feature-level and kernel-level data fusion methods in multi-sensory fall detection (2016) (6)
Affective language model adaptation via corpus selection (2014) (6)
A Sequential Bayesian Dialog Agent for Computational Ethnography (2012) (6)
A detailed study of word-position effects on emotion expression in speech (2009) (6)
High spatio-temporal resolution multi-slice real time MRI of speech using golden angle spiral imaging with constrained reconstruction, parallel imaging, and a novel upper airway coil (2014) (6)
Natural head motion synthesis driven by acoustic prosodic features: Virtual Humans and Social Agents (2005) (6)
Investigating automatic assessment of reading comprehension in young children (2008) (6)
Automatic pronunciation verification of english letter-names for early literacy assessment of preliterate children (2009) (6)
Truncation of pharyngeal gesture in English diphthong [aɪ] (2013) (6)
Predicting Affective Dimensions Based on Self Assessed Depression Severity (2016) (6)
Report of 2017 NSF Workshop on Multimedia Challenges, Opportunities and Research Roadmaps (2019) (5)
Tensor Embedding: A Supervised Framework for Human Behavioral Data Mining and Prediction (2018) (5)
Tree grammars as models of prosodic structure (2008) (5)
Antecedents and outcomes of the knowledge management process (KMP) in Malaysian SMEs (2020) (5)
Learning Domain Invariant Representations for Child-Adult Classification from Speech (2019) (5)
On optimal signal representation for statistical learning and pattern recognition (2008) (5)
Vocal Tract Articulatory Contour Detection in Real-Time Magnetic Resonance Images Using Spatio-Temporal Context (2020) (5)
Towards optimal encoding for classification with applications to distributed speech recognition (2003) (5)
Discovering Optimal Variable-length Time Series Motifs in Large-scale Wearable Recordings of Human Bio-behavioral Signals (2019) (5)
Introduction to the Special Issue on Spontaneous Speech Processing (2004) (5)
Unifying conversational multimedia interfaces for accessing network services across communication devices (2000) (5)
Determining what Questions to Ask, with the Help of Spectral Graph Theory (2011) (5)
The SAIL speaker diarization system for analysis of spontaneous meetings (2008) (5)
Articulatory comparison of Tamil liquids and stops using real‐time magnetic resonance imaging. (2009) (5)
Combining task-dependent information with auditory attention cues for prominence detection in speech (2008) (5)
Laughter Valence Prediction in Motivational Interviewing Based on Lexical and Acoustic Cues (2016) (5)
An audio-visual approach to learning salient behaviors in couples' problem solving discussions (2013) (5)
Multi-Face: Self-supervised Multiview Adaptation for Robust Face Clustering in Videos (2020) (5)
Relation between geometry and kinematics of articulatory trajectory associated with emotional speech production (2008) (5)
Discriminating Two Types of Noise Sources using Cortical Representation and Dimension Reduction Technique (2007) (5)
Multimodal Representation of Advertisements Using Segment-level Autoencoders (2018) (5)
Deep multiple instance learning for foreground speech localization in ambient audio from wearable devices (2021) (5)
Aliasing artifact reduction in spiral real‐time MRI (2021) (5)
Efficient estimation and model generalization for the totalvariability model (2019) (5)
Having a Bad Day? Detecting the Impact of Atypical Life Events Using Wearable Sensors (2020) (5)
Integration and Automation of Data Preparation and Data Mining (2014) (5)
State-ofthe-art MRI Protocol for Comprehensive Assessment of Vocal Tract Structure and Function (2016) (5)
Toward body language generation in dyadic interaction settings from interlocutor multimodal cues (2013) (5)
Modeling head motion entrainment for prediction of couples' behavioral characteristics (2015) (5)
A split lexicon approach for improved recognition of spoken names (2006) (5)
Overlapped speech detection using long-term spectro-temporal similarity in stereo recording (2011) (5)
A dictionary approach to repetitive pattern finding in music (2001) (5)
Multiview Shared Subspace Learning Across Speakers and Speech Commands (2019) (5)
Liquids in Tamil (1996) (5)
Spectro-temporal directional derivative features for automatic speech recognition (2013) (5)
Context-driven automatic bilingual movie subtitle alignment (2009) (5)
Fusing Annotations with Majority Vote Triplet Embeddings (2018) (5)
High-quality bilingual subtitle document alignments with application to spontaneous speech translation (2013) (5)
Affect prediction in music using boosted ensemble of filters (2015) (5)
Loss Function Approaches for Multi-label Music Tagging (2021) (5)
Emotion and mental state recognition from speech (2012) (5)
Bathymetric Influences on Antarctic Ice‐Shelf Melt Rates (2020) (5)
Unsupervised speaker diarization using riemannian manifold clustering (2014) (4)
Online rate adjustment for adaptive random access compressed sensing of time-varying fields (2016) (4)
Discovering Latent Psychological Structures from Self-Report Assessments of Hospital Workers (2018) (4)
On signal representations within the Bayes decision framework (2012) (4)
How beatboxers produce percussion sounds: A real-time magnetic resonance imaging investigation (2018) (4)
AN ENGLISH-PERSIAN AUTOMATIC SPEECH TRANSLATOR: RECENT DEVELOPMENTS IN DOMAIN PORTABILITY AND USER MODELING (2006) (4)
A novel algorithm for unsupervised prosodic language model adaptation (2008) (4)
Shaking Acoustic Spectral Sub-Bands can Letxer Regularize Learning in Affective Computing (2018) (4)
Syllable structure effects on velum‐oral coordination evaluated with real‐time MRI (2006) (4)
Multimodal Embeddings from Language Models (2019) (4)
Music indexing with extracted main melody by using modified Lempel-Ziv algorithm (2001) (4)
Using model trees for evaluating dialog error conditions based on acoustic information (2006) (4)
Evidence of Task-Independent Person-Specific Signatures in EEG Using Subspace Techniques (2020) (4)
Selection of Emotionally Salient Audio-Visual Features for Modeling Human Evaluations of Synthetic Character Emotion Displays (2008) (4)
A dictionary based approach for robust and syllable-independent audio input transcription for query by humming systems (2006) (4)
Recognizing child's emotional state in problem-solving child-machine interactions (2009) (4)
Enhancing the brain's emotion regulation capacity with a randomised trial of a 5-week heart rate variability biofeedback intervention (2021) (4)
Robust speech recognition over packet networks: an overview (2004) (4)
CNMF-based acoustic features for noise-robust ASR (2016) (4)
A JOINT ACOUSTIC-ARTICULATORY STUDY OF NASAL SPECTRAL REDUCTION IN READ VERSUS SPONTANEOUS SPEAKING STYLES (2010) (4)
Text To Speech Synthesis (2006) (4)
Robust representations for out-of-domain emotions using Emotion Profiles (2010) (4)
An information-theoretic analysis of developmental changes in speech (2003) (4)
Identifying Truthful Language in Child Interviews (2020) (4)
Joint training of interpolated exponential n-gram models (2013) (4)
Early auditory processing inspired features for robust automatic speech recognition (2007) (4)
Overview of some theoretical and experimental results on modeling and control of shear flows (2000) (4)
Emotional speech resynthesis (2008) (4)
Supervised acoustic topic model with a consequent classifier for unstructured audio classification (2012) (4)
A Resilient Self-Organizing Offshore Communication Network for Fishermen (2017) (4)
Role Specific Lattice Rescoring for Speaker Role Recognition from Speech Recognition Outputs (2019) (4)
Fusion of diverse denoising systems for robust automatic speech recognition (2014) (4)
An analysis of observation length requirements for machine understanding of human behaviors from spoken language (2019) (4)
ROBUST RECOGNITION AND ASSESSMENT OF NON-NATIVE SPEECH VARIABILITY (2006) (4)
Robust voice activity detection in stereo recording with crosstalk (2010) (4)
Workspace Analysis and Optimization of 3-RRR Planar Parallel Manipulator (2015) (4)
Estimation of the movement trajectories of non-crucial articulators based on the detection of crucial moments and physiological constraints (2014) (4)
A robust frontend for ASR: Combining denoising, noise masking and feature normalization (2013) (4)
A study of semi-supervised speaker diarization system using gan mixture model (2019) (4)
Composite-DBN for recognition of environmental contexts (2012) (4)
Classification of Pathological Speech Using Fusion of Multiple Subsystems (2012) (4)
Improved Depiction of Tissue Boundaries in Vocal Tract Real-Time MRI Using Automatic Off-Resonance Correction (2016) (4)
Lightly-supervised utterance-level emotion identification using latent topic modeling of multimodal words (2016) (4)
Modeling Behavior as Mutual Dependency between Physiological Signals and Indoor Location in Large-Scale Wearable Sensor Study (2020) (4)
Predicting Human-Reported Enjoyment Responses in Happy and Sad Music (2019) (4)
Novel affective features for multiscale prediction of emotion in music (2016) (4)
Multimodal Speaker Segmentation in Presence of Overlapped Speech Segments (2008) (4)
An approach to real‐time magnetic resonance imaging for speech production (2003) (4)
Unsupervised data processing for classifier-based speech translator (2013) (4)
Automatically rating pronunciation through articulatory phonology (2009) (4)
Affect Estimation with Wearable Sensors (2020) (4)
Enhancing Privacy Through Domain Adaptive Noise Injection For Speech Emotion Recognition (2022) (4)
Vertical larynx actions and larynx-oral timing in ejectives and implosives (2019) (4)
Leveraging Label Correlations in a Multi-label Setting: A Case Study in Emotion (2022) (4)
EMO20Q Questioner Agent (2011) (4)
Computational Modeling of Conversational Humor in Psychotherapy (2018) (4)
Modeling Emotion Expression and Perception Behavior in Auditive Emotion Evaluation (2006) (4)
Design of Vortex Generators for Light Transport Vehicles (LTVs) Using CFD (2013) (4)
Learning Behavioral Representations from Wearable Sensors (2019) (4)
Using measures of vocal entrainment to inform outcome-related behaviors in marital conflicts (2012) (4)
A Socratic epistemology for verbal emotional intelligence (2016) (4)
Automatic analysis of constriction location in singleton and geminate consonant articulation using real-time magnetic resonance imaging (2011) (4)
An MRI study of fricative consonants (1994) (4)
Estimation of articulatory gesture patterns from speech acoustics (2009) (4)
Bark Frequency Transform Using an Arbitrary Order Allpass Filter (2010) (4)
Cross modal video representations for weakly supervised active speaker localization (2020) (4)
Chapter 15 Behavioral signal processing and autism: Learning from multimodal behavioral signals (2016) (3)
Histogram-based estimation for the divergence revisited (2009) (3)
A Transcription Scheme for Languages Employing the Arabic Script Motivated by Speech Processing Applications (2004) (3)
Analysis of children’s speech. Pitch and formant frequency (1997) (3)
VCV Synthesis Using Task Dynamics to Animate a Factor-Based Articulatory Model (2017) (3)
Measuring Conversational Productivity in Child Forensic Interviews (2018) (3)
A knowledge transfer and boosting approach to the prediction of affect in movies (2017) (3)
Analysis of acoustic correlates in emotional speech (2004) (3)
VAuLT: Augmenting the Vision-and-Language Transformer with the Propagation of Deep Language Representations (2022) (3)
Fifty Shades of Green: Towards a Robust Measure of Inter-annotator Agreement for Continuous Signals (2020) (3)
Incorporating discourse context in spoken language translation through dialog acts (2008) (3)
Velum-oral timing and its variability in Korean nasal consonants (2020) (3)
A Distribution Free Formulation of the Total Variability Model (2017) (3)
Complexity of vocal tract shaping in glossectomy patients and typical speakers: A principal component analysis. (2021) (3)
Analyzing the interplay between spoken language and gestural cues in conversational child-machine interactions in pre/early literate age groups (2004) (3)
Semi-FedSER: Semi-supervised Learning for Speech Emotion Recognition On Federated Learning using Multiview Pseudo-Labeling (2022) (3)
Causal Indicators for Assessing the Truthfulness of Child Speech in Forensic Interviews (2021) (3)
Automatically assessing the ABCs: Verification of children's spoken letter-names and letter-sounds (2011) (3)
A robust harmony structure modeling scheme for classical music opus identification (2009) (3)
FedAudio: A Federated Learning Benchmark for Audio Tasks (2022) (3)
Temporal analysis of articulatory speech errors using direct image analysis of real time magnetic resonance imaging. (2010) (3)
Using real time magnetic resonance imaging to measure changes in articulatory behavior due to partial glossectomy (2017) (3)
Interspeaker variability in relative tongue size and vowel production (2013) (3)
Perceptual Lateralization of Coda Rhotic Production in Puerto Rican Spanish (2016) (3)
Multimodal detection of salient behaviors of approach-avoidance in dyadic interactions (2012) (3)
Quantifying labial, palatal, and pharyngeal contributions to third formant lowering in American English /ɹ/ (2017) (3)
Using Active Speaker Faces for Diarization in TV shows (2022) (3)
Energy-constrained minimum variance response filter for robust vowel spectral estimation (2014) (3)
Analyzing the structure of parent-moderated narratives from children with ASD using an entity-based approach (2013) (3)
Perceptual-based deep-learning denoiser as a defense against adversarial attacks on ASR systems (2021) (3)
Semi-automatic modeling of tongue surfaces using volumetric structural MRI (2011) (3)
Bringing in the Outliers: A Sparse Subspace Clustering Approach to Learn a Dictionary of Mouse Ultrasonic Vocalizations (2020) (3)
A knowledge-driven framework for ECG representation and interpretation for wearable applications (2017) (3)
Hull detection based on largest empty sector angle with application to analysis of realtime MR images (2014) (3)
Knowledge and Attitudes Toward an Artificial Intelligence-Based Fidelity Measurement in Community Cognitive Behavioral Therapy Supervision (2021) (3)
Detection of Non-Native Named Entities Using Prosodic Features for Improved Speech Recognition and Translation (2006) (3)
The ELISA Situation Frame extraction for low resource languages pipeline for LoReHLT’2016 (2018) (3)
Speech Synthesis Systems in Ambient Intelligence Environments (2010) (3)
Characterizing post-glossectomy speech using real-time magnetic resonance imaging (2013) (3)
DeepPurple: Lexical, String and Affective Feature Fusion for Sentence-Level Semantic Similarity Estimation (2013) (3)
Transfer Learning Between Concepts for Human Behavior Modeling: An Application to Sincerity and Deception Prediction (2017) (3)
Non-Iterative Parameter Estimation for Total Variability Model Using Randomized Singular Value Decomposition (2016) (3)
Modeling Behavioral Consistency in Large-Scale Wearable Recordings of Human Bio-Behavioral Signals (2020) (3)
A Young Patient with Stroke and Primary Tuberculosis (2018) (3)
A Knowledge Driven Structural Segmentation Approach for Play-Talk Classification During Autism Assessment (2018) (3)
Imaging applications in speech production research (1996) (3)
Participatory methods to support team science development for predictive analytics in health (2018) (3)
Robust Character Labeling in Movie Videos: Data Resources and Self-Supervised Feature Adaptation (2020) (3)
Audio visual character profiles for detecting background characters in entertainment media (2022) (3)
L2 Acquisition and Production of the English Rhotic Pharyngeal Gesture (2016) (3)
Modeling Vocal Entrainment in Conversational Speech Using Deep Unsupervised Learning (2020) (3)
Weighted geodesic flow kernel for interpersonal mutual influence modeling and emotion recognition in dyadic interactions (2017) (3)
RNN Based Incremental Online Spoken Language Understanding (2021) (3)
A generative model for scoring children2s reading comprehension (2008) (3)
Automated Empathy Detection for Oncology Encounters (2020) (2)
Signature cluster model selection for incremental Gaussian mixture cluster modeling in agglomerative hierarchical speaker clustering (2009) (2)
Speaker verification using Lasso based sparse total variability supervector with PLDA modeling (2012) (2)
Features for comparing tune similarity of songs across different languages (2012) (2)
Generalized Ambiguity Decomposition for Understanding Ensemble Diversity (2013) (2)
Articulatory compensation strategies employed by an aglossic speaker (2017) (2)
A Label Proportions Estimation Technique for Adversarial Domain Adaptation in Text Classification (2020) (2)
On distinguishing articulatory configurations and articulatory tasks: Tamil retroflex consonants (2013) (2)
Noise Aware and Combined Noise Models for Speech Denoising in Unknown Noise Conditions (2016) (2)
Root-Word Analysis of Turkish Emotional Language (2012) (2)
Denoising and Raw-waveform Networks for Weakly-Supervised Gender Identification on Noisy Speech (2018) (2)
Continuous models of affect from text using n-grams (2013) (2)
Towards unsupervised training of the classifier-based speech translator (2008) (2)
Predicting children's reading ability using evaluator-informed features (2009) (2)
Emotion to emotion speech conversion in phoneme level (2004) (2)
Incremental Online Spoken Language Understanding (2019) (2)
Automatic Estimation of Perceived Sincerity from Spoken Language (2016) (2)
Automatic identification of stable modes and fluctuations in a repetitive task using real-time MRI (2007) (2)
Audiovisual-based adaptive speaker identification (2003) (2)
Improved real‐time tagged MRI using REALTAG (2019) (2)
Can Transcranial Direct Current Stimulation Over the Dorsolateral Prefrontal Cortex Enhance Proprioception? (2019) (2)
Airfoil thickness effects on flow and acoustic characteristics (2021) (2)
Experimental assessment of the tongue incompressibility hypothesis during speech production (2015) (2)
Optimal time-resource allocation for activity-detection via multimodal sensing (2009) (2)
Leveraging Social Networks for the Assessment and Management of Neurological Patients (2022) (2)
A Dynamic Programming Algorithm for Computing N-gram Posteriors from Lattices (2015) (2)
A spoken dialogue system for conference/workshop services (2000) (2)
Investigation of the inter‐articulator correlation in acoustic‐to‐articulatory inversion using generalized smoothness criterion. (2010) (2)
Exploring the Relationship between Conic Affinity of NMF Dictionaries and Speech Enhancement Metrics (2018) (2)
Group-specific models of healthcare workers’ well-being using iterative participant clustering (2020) (2)
Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics (2009) (2)
Learning multiple concepts with incremental diverse density (2014) (2)
Comparison of dictionary-based approaches to automatic repeating melody extraction (2001) (2)
Automatic acoustic synthesis of human-like laughtera ) (2017) (2)
A Preplexity Based Cover Song Matching System for Short Length Queries (2011) (2)
On the road to Autonomous Maritime Transport: A conceptual framework to meet training needs for future ship operations (2022) (2)
Developmental aspects of American English diphthong trajectories in the formant space (2013) (2)
Automating Detection of Papilledema in Pediatric Fundus Images with Explainable Machine Learning (2022) (2)
Cross-Modal Coordination of Face-Directed Gaze and Emotional Speech Production in School-Aged Children and Adolescents with ASD (2019) (2)
Information theoretic acoustic feature selection for acoustic-to-articulatory inversion (2013) (2)
Characterizing dynamically varying acoustic scenes from egocentric audio recordings in workplace setting (2019) (2)
Motor control primitives arising from a learned dynamical systems model of speech articulation (2014) (2)
An attribute-based approach to audio description applied to segmenting vocal sections in popular music songs (2006) (2)
Local dynamic mode of Cognitive Behavioral Therapy (2022) (2)
Mitigation of Data Sparsity in Classifier-Based Translation (2008) (2)
Introduction to the special issue on speech and language processing of children's speech for child-machine interaction applications (2011) (2)
Efficient scalable encoding for distributed speech recognition q (2006) (2)
On instantaneous vocal tract length estimation from formant frequencies (2013) (2)
Strange attractors and chaotic dynamics in the production of voiced and voiceless fricatives (1993) (2)
Key factors impacting women seafarers’ participation in the evolving workplace: A qualitative exploration (2023) (2)
Quid Pro Quo Nature of Leadership Trust Formation – A Monadic Study from the Subordinate’s Perspective (2012) (2)
Prediction of Psychological Flexibility with multi-scale Heart Rate Variability and Breathing Features in an “in-the-wild” Setting (2019) (2)
Using Oliver API for emotion-aware movie content characterization (2019) (2)
Automatic movie index generation based on multimodal information (2001) (2)
Providing the ARCHER community with adjoint modelling tools for high-performance oceanographic and cryospheric computation (2016) (2)
A study of intra-speaker and inter-speaker affective variability using electroglottograph and inverse filtered glottal waveforms (2010) (2)
Role Annotated Speech Recognition for Conversational Interactions (2018) (2)
A discriminative reliability-aware classification model with applications to intelligibility classification in pathological speech (2015) (2)
A system for the 2019 Sentiment, Emotion and Cognitive State Task of DARPA's LORELEI project (2019) (2)
Handling real-time scheduling exceptions using decision support systems (2003) (2)
Velum Control for Oral Sounds (2016) (2)
Behavior Gated Language Models (2019) (2)
Trapezoidal Segmented Regression: A Novel Continuous-scale Real-time Annotation Approximation Algorithm (2019) (2)
User-Level Differential Privacy against Attribute Inference Attack of Speech Emotion Recognition in Federated Learning (2022) (2)
An Automated Quality Evaluation Framework of Psychotherapy Conversations with Local Quality Estimates (2021) (2)
ATQAM/MAST'20: Joint Workshop on Aesthetic and Technical Quality Assessment of Multimedia and Media Analytics for Societal Trends (2020) (2)
Acted vs. Improvised: Domain Adaptation for Elicitation Approaches in Audio-Visual Emotion Recognition (2021) (2)
EDA-gram: designing electrodermal activity fingerprints for visualization and feature extraction. (2016) (2)
A study of bias mitigation strategies for speaker recognition (2022) (2)
Developing Neural Representations for Robust Child-Adult Diarization (2021) (2)
An articulatory analysis of phonological transfer using real-time MRI (2009) (1)
Imaging and quantification of glottal kinematics with ultrasound during speech (2011) (1)
Variability in individual constriction contributions to third formant values in American English /ɹ/. (2020) (1)
Combining window predictions efficiently - A new imputation approach for noise robust automatic speech recognition (2013) (1)
SARCASM RECOGNITION FOR SPOKEN DIALOGUE SYSTEMS (2021) (1)
Audiovisual-based adaptive speaker identification (2003) (1)
A personal visual comfort model: predict individual’s visual comfort using occupant eye pupil size and machine learning (2019) (1)
Data-Driven Unsupervised Adaptation of Acoustic-Prosodic Models (2008) (1)
Fatigue-related medical conditions affecting seafarers : an exploratory case-study of Indian seafarers (2017) (1)
Studying Clicks Using Real-Time MRI (2020) (1)
Resonance tuning in soprano singing and vocal tract shaping: Comparison of sung and spoken vowels (2006) (1)
Computational Audio Analysis (2014) (1)
Variable Span disfluency detection in ASR transcripts (2014) (1)
Enhancing audio source separability using spectro-temporal regularization with NMF (2014) (1)
Nonsyndromic multiple dentigerous cyst: A rare clinical presentation (2016) (1)
Understanding of Emotion Perception from Art (2021) (1)
Improving Semi-Supervised Classification for Low-Resource Speech Interaction Applications (2018) (1)
Interpersonal synchrony across vocal and lexical modalities in interactions involving children with autism spectrum disorder (2022) (1)
How an aglossic speaker produces an alveolar-like percept without a functional tongue tip. (2020) (1)
Joint Multi-Dimensional Model for Global and Time-Series Annotations (2020) (1)
The Role of Annotation Fusion Methods in the Study of Human-Reported Emotion Experience During Music Listening (2020) (1)
Tracking larynx movement in real-time MRI data (2017) (1)
Variation in compensatory strategies as a function of target constriction degree in post-glossectomy speech (2022) (1)
Probing the relationship between qualitative and quantitative performance measures for voice-enabled telecommunication services (1998) (1)
Hierarchical classification for speech-to-speech translation (2010) (1)
Enriching the understanding of glottalic consonant production: Vertical larynx movement in Hausa ejectives and implosives (2018) (1)
Some articulatory details of emotional speech (2005) (1)
Using Shared Vector Representations of Words and Chords in Music for Genre Classification (2019) (1)
Vocal tract contour analysis of emotional speech by the functional data curve representation (2010) (1)
Automatic Recognition of Emotions from the Acoustic Speech Signal 1 (2003) (1)
Normalization Before Shaking Toward Learning Symmetrically Distributed Representation Without Margin in Speech Emotion Recognition (2018) (1)
Detailed study of articulatory kinematics of critical articulators and dependent articulators of emotional speech (2011) (1)
Leadership Styles and Knowledge Management Strategy in Malaysian SMEs (2020) (1)
Visualization of Vocal Tract Shape Using Interleaved Real-Time MRI of Multiple Scan Planes (2011) (1)
Generating Labels for Regression of Subjective Constructs using Triplet Embeddings (2019) (1)
Tracking developmental changes in articulatory strategy during childhood (2017) (1)
Auditory-like filterbank: An optimal speech processor for efficient human speech communication (2011) (1)
Phone Duration Modeling for Speaker Age Estimation in Children (2021) (1)
Statistical estimation of speech kinematics from real-time MRI data (2011) (1)
Novel filler acoustic models for connected digit recognition (1997) (1)
An analysis of observation length requirements in spoken language for machine understanding of human behaviors (2019) (1)
Privacy and Utility Preserving Data Transformation for Speech Emotion Recognition (2021) (1)
An Analysis of Range Difference Based Target Localization in Uniformly Distributed Sensor Field (2005) (1)
Editorial Emotion and Mental State Recognition from Speech (2011) (1)
COTS Integrations: Effort Estimation Best Practices (2010) (1)
Investigating Group-Specific Models of Hospital Workers' Well-Being: Implications for Algorithmic Bias (2020) (1)
An articulatory study of lexicalized and epenthetic schwa using real time magnetic resonance imaging. (2009) (1)
Attention-gated convolutional neural networks for off-resonance correction of spiral real-time MRI (2021) (1)
Unsupervised active speaker detection in media content using cross-modal information (2022) (1)
USC-TIMIT : A database of multimodal speech production data (2013) (1)
Analysis of Inter-Articulator Correlation in Acoustic-to-Articulatory Inversion Using Generalized Smoothness Criterion (2011) (1)
Towards parameter-free classification of sound effects in movies (2005) (1)
Towards Dynamic 3 D MRI of Speech (2011) (1)
A multispeaker dataset of raw and reconstructed speech production real-time MRI video and 3D volumetric images (2021) (1)
Using Emotion Embeddings to Transfer Knowledge Between Emotions, Languages, and Annotation Formats (2022) (1)
EMO 20 Q Questioner Agent (2011) (1)
Lattice-based lexical cues for word fragment detection in conversational speech (2009) (1)
Morbidity, mortality, and emerging drug resistance in Device-associated infections (DAIs) in intensive care patients at a 1000-bedded tertiary care teaching hospital. (2021) (1)
An analysis‐by‐synthesis approach to modeling real‐time MRI articulatory data using the task dynamic application framework. (2009) (1)
The USC CreativeIT database of multimodal dyadic interactions: from speech and full body motion capture to continuous emotional annotations (2015) (1)
A developmental acoustic characterization of English diphthongs (2004) (1)
Gaussian Mixture Model Based Methods for Virtual Microphone Signal Synthesis (2002) (1)
Multimodal neuroimaging data from a 5-week heart rate variability biofeedback randomized clinical trial (2022) (1)
Victim or Perpetrator? Analysis of Violent Characters Portrayals from Movie Scripts (2020) (1)
Modeling Human Movement Behavior Among Nursing Profession (2020) (1)
Database management and analysis for spoken dialog systems: methodology and tools (1997) (1)
On the computation of document frequency statistics from spoken corpora using factor automata (2013) (1)
TILES-2019: A longitudinal physiologic and behavioral data set of medical residents in an intensive care unit (2022) (1)
Mitigating the Bias of Heterogeneous Human Behavior in Affective Computing (2021) (1)
Derivation of Fitts' law from the Task Dynamics model of speech production (2020) (1)
Developmental acoustic study of American English diphthongsa) (2014) (1)
Experimental evaluation of the constant tongue volume hypothesis (2014) (1)
Unsupervised Speech Representation Learning for Behavior Modeling using Triplet Enhanced Contextualized Networks (2021) (1)
Learning a speech manifold for signal subspace speech denoising (2015) (1)
Smooth Gmm Based Multi-Talker Spectral Conversion for Spectrally Degraded Speech (2006) (1)
MovieCLIP: Visual Scene Recognition in Movies (2022) (1)
An analysis of the relationship between signal-derived vocal arousal score and human emotion production and perception (2015) (1)
Speed Accuracy Tradeoffs in Speech Production (2017) (1)
User modeling in a speech translation driven mediated interaction setting (2006) (1)
Speech and language processing for mental health research and care (2016) (1)
On data-driven histogram-based estimation for mutual information (2010) (1)
Sensitivity of Quantitative RT-MRI Metrics of Vocal Tract Dynamics to Image Reconstruction Settings (2016) (1)
A computational framework for exploring the role of speech production in speech processing from a communication system perspective (2011) (1)
A Computational Tool to Study Vocal Participation of Women in UN-ITU Meetings (2021) (1)
Design and Control of Vehicle Trailer with Onboard Power Supply (2015) (1)
Gestural coordination of the velum in singing can be different from coordination in speech (2014) (1)
Analyzing eye-voice coordination in rapid automatized naming (2013) (1)
Intra-topic latency as an automated behavioral marker of treatment response in autism spectrum disorder (2022) (1)
Dynamical Systems Modeling of Acoustic and Physiological Arousal in Young Couples (2016) (1)
Motor control primitives arising from a dynamical systems model of vocal tract articulation (2013) (1)
Predicting Affect in Music Using Regression Methods on Low Level Features (2015) (1)
A comparative cross-linguistic study of vocal tract shaping in sibilant fricatives in English, Serbian and Mandarin using real-time magnetic resonance imaging (2013) (1)
Accelerating Real-time MRI of speech using spiral through-time GRAPPA (2016) (1)
Analysis and synthesis of laughter (2004) (1)
How Are You Doing ? How Are You Doing ? a (2019) (1)
The 2022 Far-field Speaker Verification Challenge: Exploring domain mismatch and semi-supervised learning under the far-field scenario (2022) (1)
Unsupervised HMM adaptation based on speech-silence discrimination (1997) (1)
Statistical analysis of constriction task and articulatory posture variables during speech and pausing intervals using real-time magnetic resonance imaging (2011) (1)
Acoustic Analysis of Preschoo (2003) (1)
Relations between prominence and articulatory-prosodic cues in emotional speech (2016) (1)
TILES-2019: A longitudinal physiologic and behavioral data set of medical residents in an intensive care unit (2022) (1)
Articulatory settings facilitate mechanically advantageous motor control of vocal tract articulators (2013) (1)
Language Aided Speaker Diarization Using Speaker Role Information (2019) (1)
Mel Frequency Spectral Domain Defenses against Adversarial Attacks on Speech Recognition Systems (2022) (1)
ASSOCIATION OF CORONAVIRUS DISEASE 2019 (COVID-19) AND STROKES IN YOUNG AND MIDDLE-AGED ADULTS (2021) (0)
Relationship satisfaction, feelings of closeness and annoyance, and linkage in electrodermal activity. (2023) (0)
ACTING-OUT AND WORKING-THROUGH MODELS OF TRAUMA IN THE LIFE OF ELIE WIESEL (2020) (0)
Development of a parametric basis for vocal tract area function representation from a large speech production database (2014) (0)
Novel imaging tools for supporting the teaching of singing and spoken performance (2017) (0)
Robust unsupervised extraction of vocal tract variables from midsagittal real‐time magnetic resonance image sequences using region segmentation (2007) (0)
It's not what you said, it's how you said it: An analysis of therapist vocal features during psychotherapy. (2021) (0)
Web-based monitoring, logging and reporting tools for multi-service multi-modal systems (2000) (0)
An examination of the articulatory characteristics of prominence in function and content words using real-time magnetic resonance imaging (2013) (0)
Pitch Contour Stylization Using an Optimal (2009) (0)
Supplement for “ Rate my therapist ” : Automated detection of empathy in drug and alcohol counseling via speech and language processing (2015) (0)
Towards natural child-computer interaction: recognizing spoken communicative styles (2006) (0)
Acoustic frame selection for acoustic‐to‐articulatory inversion. (2010) (0)
Applying Machine Learning to Facilitate Autism Diagnostics: Pitfalls and Promises (2014) (0)
Automatic Analysis of Asymmetry in Facial Paralysis Patients Using Landmark-Based Measures. (2022) (0)
A divide-and-conquer approach to Latent Perceptual Indexing of audio for large Web 2.0 applications (2009) (0)
Recognition and characterization of unstructured environmental sounds (2011) (0)
DATADRIVENAPPROACHFORLANGUAGE MODEL ADAPTATIONUSINGSTEPWISE RELATIVEENTROPYMINIMIZATION (2007) (0)
Representation of professions in entertainment media: Insights into frequency and sentiment trends through computational text analysis (2021) (0)
Trapezoidal Segment Sequencing: A Novel Approach for Fusion of Human-Produced Continuous Annotations (2020) (0)
Next-Generation Image and Sound Processing Strategies: Exploiting the Biological Model (2007) (0)
Amount of Information Presented in a Complex List: Effects on User Performance (2001) (0)
Rapid three‐dimensional magnetic resonance imaging of vocal tract shaping using compressed sensing. (2009) (0)
YouTube and COVID-19 vaccines: A mini scoping review. (2023) (0)
SAIL-GRS: Grammar Induction for Spoken Dialogue Systems using CF-IRF Rule Similarity (2014) (0)
Enhancing the quality of cognitive behavioral therapy in community mental health through artificial intelligence generated fidelity feedback (Project AFFECT): a study protocol (2022) (0)
Multitask Learning for Darpa Lorelei’s Situation Frame Extraction Task (2020) (0)
Enhancements to the Training Process of Classifier-Based Speech Translator via Topic Modeling (2011) (0)
ACOUSTICS2008/1677 Letter sound and letter name recognition for automated literacy assessment of young children (2008) (0)
Spoken name pronunciation evaluation (2004) (0)
Behavioral signal processing: computational approaches for modeling and quantifying interaction dynamics in dyadic human interactions (2012) (0)
A Multimodal Approach to Understanding Human Vocal Expressions and Beyond (2018) (0)
Effects of multilingualism on cognition among older Indian adults in the nationally representative LASI‐DAD study (2022) (0)
Conversational correlates of rapid social judgments of children and adolescents with and without ASD (2020) (0)
Separability Using Spectro-Temporal Regularization with NMF (2014) (0)
APPROXIMANTS IN AMERICAN ENGLISH (2019) (0)
Retrieving Social Images using Relevance Filtering and Diverse Selection (2015) (0)
VAuLT: Augmenting the Vision-and-Language Transformer for Sentiment Classification on Social Media (2022) (0)
Ensemble of Gaussian mixture localized neural networks with application to phone recognition (2015) (0)
POS-559 EVALUATION OF MINERAL AND BONE DISORDERS IN PATIENTS WHO HAVE SURVIVED FOR MORE THAN 2 YEARS ON MAINTENANCE HEMODIALYSIS IN INDIA (2021) (0)
Emotion Recognition (2020) (0)
Pseudo golden-ratio spiral imaging with gradient acoustic noise cancellation : application to real-time MRI of fluent speech (2011) (0)
Speaker veriﬁcation based on the fusion of speech acoustics and inverted articulatory signals (cid:2) (2015) (0)
Erratum to: Expression of GroES TB antigen in tobacco and potato (2017) (0)
Optimal Wavelet Packets Decomposition Based on a Rate-Distortion Optimality Criterion (2007) (0)
Multipulse articulatory modeling in the Wisconsin x‐ray microbeam speech production database. (2011) (0)
Content-based Representations , Indexing and Retrieval of Music 1 (2003) (0)
Asymmetric kinematic changes in speaking rate explored with FDA (2003) (0)
Speech / pause distinction means unguided adaptation of Hidden Markov Models (1998) (0)
Human-centered Multimodal Machine Intelligence (2020) (0)
On Role and Location of Normalization before Model-based Data Augmentation in Residual Blocks for Classification Tasks (2019) (0)
Sentence level estimation of psycholinguistic norms using joint multidimensional annotations (2020) (0)
AN INNOVATIVE APPROACH FOR A COST EFFECTIVE SOLUTION FOR R.C.C. CULVERT (1992) (0)
Combining Acoustic , Lexical , and Syn Unsupervised Prosod Sankaranarayanan Ananthakrishna Speech Analysis and Interpre (2006) (0)
SYSTEMS FOR CHILDREN (1997) (0)
Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems? (2022) (0)
An improved cluster model selection method for agglomerative hierarchical speaker clustering using incremental Gaussian mixture models (2010) (0)
Studying Large-Scale Behavioral Differences in Auschwitz-Birkenau with Simulation of Gendered Narratives (2022) (0)
On the Nature of Data-driven Primitive Representations of Speech Articulation (2013) (0)
Dynamical systems modeling of day-to-day signal-based patterns of emotional self-regulation and stress spillover in highly-demanding health professions (2020) (0)
Differential expression of carotenoid biosynthetic pathway genes in two contrasting tomato genotypes for lycopene content (2016) (0)
Planning and Execution in Soprano Singing and Speaking Behavior : an Acoustic / Articulatory Study Using Real-Time MRI (2010) (0)
A Study of the Effectiveness of Articulatory Strokes for Phonemic Recognition (2011) (0)
An Approach towards Sustainable Energy Buildings (2017) (0)
Analyzing Short Term Dynamic Speech Features for Understanding Behavioral Traits of Children with Autism Spectrum Disorder (2021) (0)
Person-organisation fit, employee voice, and knowledge productivity: the moderating role of perceived voice opportunity (2022) (0)
Multilayer vectorization to develop a deeper image feature learning model (2022) (0)
Strategies for Disseminating Information on Biomedical Research on Autism to Hispanic Parents (2015) (0)
Improved real-time MRI of oral-velar coordination using a golden-ratio spiral view order (2010) (0)
Speechlinks: Robust Cross-Lingual Tactical Communication Aids (2008) (0)
A Study of Emo,onal Ar,cula,on in the Framework of the Converter/distributor Model (2014) (0)
Affect Estimation with Wearable Sensors (2020) (0)
Modeling the subglottal space for American English /r/ (1998) (0)
A dataset for Audio-Visual Sound Event Detection in Movies (2023) (0)
Contents Vol. 71, 2014 (2015) (0)
You never know what you are going to get: Large-scale assessment of therapists' supportive counseling skill use. (2022) (0)
A Review of Speech-centric Trustworthy Machine Learning: Privacy, Safety, and Fairness (2022) (0)
Generalized Multiview Shared Subspace Learning Using View Bootstrapping (2020) (0)
Optimized Wavelet Packet decomposition based on Minimum Probability of Error Signal Representation (2008) (0)
Audio and ASR-based Filled Pause Detection (2022) (0)
Capturing the Structure of Electrodermal Activity with Deep Neural Networks (2016) (0)
Annotation and Evaluation of Coreference Resolution in Screenplays (2021) (0)
Just (all) the facts, ma'am (2001) (0)
A study of meta-linguistic features in spontaneous speech processing (2006) (0)
The ELISA Situation Frame extraction for low resource languages pipeline for LoReHLT’2016 (2017) (0)
Exploiting Intra-Annotator Rating Consistency Through Copeland's Method for Estimation of Ground Truth Labels in Couples' Therapy (2017) (0)
Knowledge and Attitudes toward an Artificial Intelligence-Based Fidelity Measurement in Cognitive Behavioral Therapy Supervision (2020) (0)
Designing and Evaluating Speech Emotion Recognition Systems: A reality check case study with IEMOCAP (2023) (0)
On the Role of Visual Context in Enriching Music Representations (2022) (0)
Affect Tracking with Multimodal Kalman Filters (2016) (0)
Why Not Thoracic Epidural Anesthesia in Modified Radical Mastectomy?! A Case Report (2022) (0)
Articulatory coordination in Nama click consonants (2014) (0)
Web-Karma: v2.030 Patch Release for Glue Column Command Bug Fix (2014) (0)
Leveraging Open Data and Task Augmentation to Automated Behavioral Coding of Psychotherapy Conversations in Low-Resource Scenarios (2022) (0)
Deep multiple instance learning for foreground speech localization in ambient audio from wearable devices (2021) (0)
Contextually-rich human affect perception using multimodal scene information (2023) (0)
AN APPROACH TOWARD UNDERSTANDING THE INVARIANT AND VARIANT ASPECTS OF SPEECH PRODUCTION USING LOW-RANK – SPARSE MATRIX DECOMPOSITIONS (2010) (0)
Real‐time MRI tracking of articulation during grammatical and ungrammatical pauses in speech. (2009) (0)
Study of Evoked Potentials in Central Demyelinating Disorders Versus Nondemyelinating Disorders (2013) (0)
Inferring object rankings based on noisy pairwise comparisons from multiple annotators (2016) (0)
Web-Karma: v2.030 Patch Release 2 for PyTranforms (2014) (0)
INFLUENCE OF VARYING INJECTION PRESSURE ON PERFORMANCE AND EMISSION CHARACTERISTICS OF BLEND OF SAPOTA SEED BIODIESEL IN CI ENGINE (2021) (0)
Integration of CAD-associated GWAS loci and deconvolution from human carotid plaques to study smooth muscle cell function in atherosclerosis (2022) (0)
Physical Aspects And Methodology Of Three Dimensional Conformal External Radiation Therapy (2003) (0)
Effect of folic acid and vitamin B12 on the plasma homocysteine levels and neurological function in young and middle-aged acute ischemic stroke patients with hyperhomocysteinemia (2013) (0)
Theorizing seafarers’ participation and learning in an evolving maritime workplace: an activity theory perspective (2023) (0)
Test-Retest Repeatability of Articulatory Strategies Using Real-Time Magnetic Resonance Imaging (2017) (0)
Toward data‐driven modeling of dynamic vocal‐tract data (2003) (0)
Behavioral informatics from multimodal human interaction cues (2014) (0)
Alternative Blank Node Generation (2014) (0)
Understanding individual-level speech variability: From novel speech production data to robust speaker recognition (2017) (0)
An integrated analysis of speech and gestural characteristics in conversational child–computer interactions (2003) (0)
Sudden unexpected death due to hereditary angioedema — A case report (2021) (0)
Fast and efficient techniques for motion estimation using subband analysis (1994) (0)
Confusion2Vec 2.0: Enriching ambiguous spoken language representations with subwords (2021) (0)
Multimodal Estimation of Change Points of Physiological Arousal in Drivers (2022) (0)
Editorial: Intelligent Signal Analysis for Contagious Virus Diseases (2022) (0)
Emotion and mental state recognition from speech (2012) (0)
PRONUNCIATION VERIFICATION FOR AUTOMATIC LITERA (2006) (0)
Imaging for understanding speech communication: Advances and challenges (2005) (0)
Screenplay Quality Assessment: Can We Predict Who Gets Nominated? (2020) (0)
Selection of optimal vocal tract regions using real-time magnetic resonance imaging for robust voice activity detection (2014) (0)
A syllable-based approach for improved recognition of spoken names (2019) (0)
Toward cross-speaker articulatory modeling (2019) (0)
v2.030 Release (2014) (0)
Effects of Emotion on the Lower Lip Movements at Phrase Boundaries (2012) (0)
The geometry of planar linear flows (2022) (0)
Para-Linguistic Mechanisms of Production in Human ‘ Beatboxing ’ : a Real-time Magnetic Resonance Imaging Study (2010) (0)
SYSTEMAND METHOD FOR BLENDING 3 . 49 A 3 (2017) (0)
HMM音声合成における感情表現のモデル化(合成, 韻律, 生成, 一般) (2003) (0)
Emotion and Mental State Recognition from Speech: Special Issue of EURASIP Journals on Advances in Signal Processing vol 15 (2012) (0)
Boys don’t cry (or kiss or dance): A computational linguistic lens into gendered actions in film (2022) (0)
Modeling speaker-specific vocal tract kinematics from gestural scores (2021) (0)
Multimodal Clustering with Role Induced Constraints for Speaker Diarization (2022) (0)
The 2022 Far-ﬁeld Speaker Veriﬁcation Challenge: Exploring domain mismatch and semi-supervised learning under the far-ﬁeld scenario (2022) (0)
Information theoretic analysis of direct and estimated articulatory features for phonetic discrimination (2010) (0)
Three‐dimensional tongue shapes of sibilant fricatives (1994) (0)
Design and Development of Free Flow Vertical Axis Wind Turbine (2014) (0)
Exploring Workplace Behaviors through Speaking Patterns using Large-scale Multimodal Wearable Recordings: A Study of Healthcare Providers (2022) (0)
Deep Crowd Analysis to Spot Social Distancing Violations in Post-COVID 19 Lifestyle (2022) (0)
A Holistic Qualitative Approach to Software Reliability (2013) (0)
v2.028 Release for Sprint 2 (2014) (0)
Detection of Musical Event Drop from Crowdsourced Annotations Using a Noisy Channel Model (2014) (0)
A novel framework paradigm for EMR management cloud system authentication using blockchain security network (2023) (0)
Speaker and Listener Variations in Emotion Assessment (2005) (0)
Motion-Capture Patterns of Voluntarily Mimicked Dynamic Facial Expressions in Children and Adolescents With and Without ASD (2018) (0)
Fitts’ law in tongue and lip movements of repetitive speech (2020) (0)
Towards Adapting NMF Dictionaries Using Total Variability Modeling for Noise-Robust Acoustic Features (2019) (0)
Audio-Visual Activity Guided Cross-Modal Identity Association for Active Speaker Detection (2022) (0)
Virtual parts catalog and component sourcing with VRML (1997) (0)
Analyzing the Multimodal Behaviors of Users of a Speech-to-Speech Translation Device by using Concept Matching Scores (2007) (0)
Automatic speech recognition for mobile hand‐held devices (2000) (0)
Using real-time MRI to assess the development of jaw contribution in constriction formation synergies during early adolescence (2020) (0)
Statistical humming recognition and theme finder for query by humming systems (2003) (0)
Exploiting Conic Affinity Measures to Design Speech Enhancement Systems Operating in Unseen Noise Conditions (2020) (0)
A comparison study of emotional speech articulations using the principal component analysis method (2014) (0)
Speech Encoder Speech Decoder Feature Extraction Recognizer Client Server Encoder Decoder Feature Extraction Recognizer ServerClient (2004) (0)
Classification of clean and noisy bilingual movie audio for speech-to-speech translation corpora design (2014) (0)
Ten questions concerning the impact of environmental stress on office workers (2022) (0)
Cross Domain Emotion Recognition using Few Shot Knowledge Transfer (2021) (0)
Extending the Beta divergence to complex values (2020) (0)
v2.026 Release (2014) (0)
Effcient multichannel audio resynthesis by subband-based spectral conversion (2002) (0)
Creating Human-centric Expressive Interfaces : Linking Perceptual evaluations and Engineering Design of Synthetic Multimodal Communication (2009) (0)
The Silent Treatment? Changes in patient emotional expression after silence (2022) (0)
Beyond acoustic data: Characterizing disordered speech using direct articulatory evidence from real time imaging. (2009) (0)
v2.027 Release (2014) (0)
A Context-Aware Computational Approach for Measuring Vocal Entrainment in Dyadic Conversations (2022) (0)
Study of acoustic correlates associate with emotional speech (2004) (0)
Web-Karma: v2.030 Patch for PyTransforms (2014) (0)
Procede d'utilisation d'une interface en langage naturel pour recuperer des informations dans une ou plusieurs ressources de donnees (1999) (0)
INTERSPEECH 2006-ICSLP T TIME CONTRASTS FOR USE IN SESSMENT (2006) (0)
Model Quantization for Unsupervised Speaker Indexing (2004) (0)
Inclusive machine intelligence and its promise for speech-centered societal application (2021) (0)
Recognition of voice onset time for use in pronunciation modeling (2005) (0)
A TRACE OF PANTHEISM IN THE SELECTED POEMS ON NATURAL PHENOMENA (2020) (0)
Leveraging Real-Time MRI for Illuminating Linguistic Velum Action (2021) (0)
A study on the nature and impact of work culture in colleges (2006) (0)
Task-dependence of articulator synergiesa ) (2019) (0)
Articulatory analysis of foreign-accented speech using real-time MRI. (2009) (0)
Vākyapadīya : sphoṭa, jāti and dravya (2012) (0)
Multimodal Human and Environmental Sensing for Longitudinal Behavioral Studies in Naturalistic Settings: Framework for Sensor Selection, Deployment, and Management (Preprint) (2018) (0)
Technology Issues in Implementation of E-Governance (2005) (0)
Computational Audio Analysis (Dagstuhl Seminar 13451) (2013) (0)
A distributed speech recognition system in multi-user environments (2004) (0)
Web-Karma: v2.030 Patch 3 - Handles missing columns (2014) (0)
Keynote speech 4: Extraction of linguistic and paralinguistic information from audio-visual data (2015) (0)
Title: Motion-capture patterns of voluntarily mimicked dynamic facial expressions in children and adolescents with and without ASD Running head: MoCap facial expression patterns in ASD (2018) (0)
Annotation and classification of Political advertisements (2013) (0)
Does articulatory setting provide some mechanical advantage for speech motor action (2013) (0)
A real-time magnetic resonance imaging study of cross-speaker variability in the production of /ɹ/ (2019) (0)
359 Utilization of machine learning approaches on multimodal and ambulatory data to predict individualized symptom course in adults with obsessive-compulsive disorder. (2023) (0)
A near-optimal (minimax) tree-structured partition for mutual information estimation (2010) (0)
Dark tone quality and vocal tract shaping in soprano song production: Insights from real-time MRI (2021) (0)
On-Line Speaker Indexing (2004) (0)
Front-end Diarization for Percussion Separation in Taniavartanam of Carnatic Music Concerts (2021) (0)
COMBINATION OF MULTIPLE MODALITIES FOR RECO GNITION AND ANALYSIS OF EMOTIONAL EXPRESSION (2009) (0)
Active data acquisition for building language models for speech recognition (2007) (0)
Statistical Modeling and Retrieval of (2007) (0)
A transmission‐line model of the lateral approximants (1996) (0)
Signal Processing Grand Challenge 2023 - e-Prevention: Sleep Behavior as an Indicator of Relapses in Psychotic Patients (2023) (0)
Using the KNOWME Networks Mobile Biomonitoring System to Characterize Physical Activity in Overweight Hispanic Youth: 2033 (2010) (0)
Knowledge as a Constraint on Uncertainty for Unsupervised Classication: A Study in Part-of-Speech Tagging (2008) (0)
TILES-2018, a longitudinal physiologic and behavioral data set of hospital workers (2020) (0)
Heat stress During the Early Flowering Stage Did Not Affect Seed Fatty acid Contents in Conventional Oleic Peanut Varieties (2022) (0)
Representations of electromagnetic articulography data for tongue shaping and vocal tract configuration (2016) (0)
Joint filtering and factorization for recovering latent structure from noisy speech data (2014) (0)
Analyzing the Physiological Synchrony of Children with Autism and their Parents with Signal Processing Techniques (2012) (0)
Simulations of sound change resulting from a production-recovery loop (2013) (0)
Indexing tongue profile narrowing for English lateral consonants using 3D volumetric MR imaging (2017) (0)
Co-registration of articulographic and real-time magnetic resonance imaging data for multimodal analysis of rapid speech (2012) (0)
Quantitative Observational Practice in Family Studies : The case of reactivity (2013) (0)
Caught sleeping : recording of snoring during a real-time MRI scan (2012) (0)
Acoustic Analysis and Auto of Spontaneous Child (2006) (0)
On voicing activity under the control of emotion and loudness (2007) (0)

This paper list is powered by the following services:

Other Resources About Shrikanth Narayanan

en.wikipedia.org

What Schools Are Affiliated With Shrikanth Narayanan?

Shrikanth Narayanan is affiliated with the following schools:

Shrikanth Narayanan's Academic­Influence.com Rankings

Why Is Shrikanth Narayanan Influential?

Shrikanth Narayanan's Published Works

Published Works

Other Resources About Shrikanth Narayanan

What Schools Are Affiliated With Shrikanth Narayanan?

Image Attributions

Shrikanth Narayanan's AcademicInfluence.com Rankings