Björn Schuller

Q: What Schools Are Affiliated With Björn Schuller

Björn Schuller is affiliated with the following schools: University of Passau, University of Augsburg, Imperial College London, Tianjin Normal University, Technical University of Munich

Björn Schuller's AcademicInfluence.com Rankings

Björn Schuller

Engineering

#1260

World Rank

#1897

Historical Rank

Electrical Engineering

#186

World Rank

#215

Historical Rank

engineering Degrees

Download Badge

Engineering

Why Is Björn Schuller Influential?

(Suggest an Edit or Addition)

According to Wikipedia, Björn Wolfgang Schuller is a scientist of electrical engineering, information technology and computer science as well as entrepreneur. He is professor of artificial intelligence at Imperial College London., UK, and holds the chair of embedded intelligence for healthcare and wellbeing at the University of Augsburg in Germany. He was a university professor and holder of the chair of complex and intelligent systems at the University of Passau in Germany. He is also co-founder and managing director as well as the current chief scientific officer of audEERING GmbH, Germany, as well as permanent visiting professor at the Harbin Institute of Technology in the People's Republic of China and associate of CISA at the University of Geneva in French-speaking Switzerland.

(See a Problem?)

Björn Schuller's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

Opensmile: the munich versatile and fast open-source audio feature extractor (2010) (2162)
Recent developments in openSMILE, the munich open-source multimedia feature extractor (2013) (1112)
New Avenues in Opinion Mining and Sentiment Analysis (2013) (1065)
The Geneva Minimalistic Acoustic Parameter Set (GeMAPS) for Voice Research and Affective Computing (2016) (984)
The INTERSPEECH 2009 emotion challenge (2009) (909)
Adieu features? End-to-end speech emotion recognition using a deep convolutional recurrent network (2016) (700)
Introduction (2015) (676)
Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge (2011) (663)
The INTERSPEECH 2013 computational paralinguistics challenge: social signals, conflict, emotion, autism (2013) (649)
Hidden Markov model-based speech emotion recognition (2003) (596)
The INTERSPEECH 2010 paralinguistic challenge (2010) (529)
Speech Enhancement with LSTM Recurrent Neural Networks and its Application to Noise-Robust ASR (2015) (525)
AVEC 2016: Depression, Mood, and Emotion Recognition Workshop and Challenge (2016) (478)
Speech emotion recognition combining acoustic features and linguistic information in a hybrid support vector machine-belief network architecture (2004) (445)
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS Publication Information (2020) (415)
OpenEAR — Introducing the munich open-source emotion and affect recognition toolkit (2009) (415)
AVEC 2013: the continuous audio/visual emotion and depression recognition challenge (2013) (410)
End-to-End Multimodal Emotion Recognition Using Deep Neural Networks (2017) (397)
Cross-Corpus Acoustic Emotion Recognition: Variances and Strategies (2010) (329)
AVEC 2014: 3D Dimensional Affect and Depression Recognition Challenge (2014) (323)
Categorical and dimensional affect analysis in continuous input: Current trends and future directions (2013) (317)
A survey of multimodal sentiment analysis (2017) (317)
YouTube Movie Reviews: Sentiment Analysis in an Audio-Visual Context (2013) (311)
Abandoning emotion classes - towards continuous emotion recognition with modelling of long-range dependencies (2008) (305)
Deep Affect Prediction in-the-Wild: Aff-Wild Database and Challenge, Deep Architectures, and Beyond (2018) (302)
AVEC 2011-The First International Audio/Visual Emotion Challenge (2011) (295)
Paralinguistics in speech and language - State-of-the-art and the challenge (2013) (295)
Sparse Autoencoder-Based Feature Transfer Learning for Speech Emotion Recognition (2013) (293)
SenticNet 4: A Semantic Resource for Sentiment Analysis Based on Conceptual Primitives (2016) (291)
Emotion representation, analysis and synthesis in continuous space: A survey (2011) (289)
Discriminatively trained recurrent neural networks for single-channel speech separation (2014) (285)
Acoustic emotion recognition: A benchmark comparison of performances (2009) (261)
Speech emotion recognition (2018) (255)
Deep neural networks for acoustic emotion recognition: Raising the benchmarks (2011) (253)
AVEC 2012: the continuous audio/visual emotion challenge (2012) (249)
LSTM-Modeling of continuous emotions in an audiovisual affect recognition framework (2013) (249)
The INTERSPEECH 2012 Speaker Trait Challenge (2012) (248)
AVEC 2017: Real-life Depression, and Affect Recognition Workshop and Challenge (2017) (240)
The INTERSPEECH 2011 Speaker State Challenge (2011) (229)
The relevance of feature type for the automatic classification of emotional user states: low level descriptors and functionals (2007) (228)
Autoencoder-based Unsupervised Domain Adaptation for Speech Emotion Recognition (2014) (227)
A Deep Matrix Factorization Method for Learning Attribute Representations (2015) (224)
The INTERSPEECH 2016 Computational Paralinguistics Challenge: Deception, Sincerity & Native Language (2016) (224)
An investigation of the ‘female camouflage effect’ in autism using a computerized ADOS-2 and a test of sex/gender differences (2016) (218)
Real-life voice activity detection with LSTM Recurrent Neural Networks and an application to Hollywood movies (2013) (217)
Computational Paralinguistics (2013) (216)
Personalized machine learning for robot perception of affect and engagement in autism therapy (2018) (212)
Snore Sound Classification Using Image-Based Deep Spectrum Features (2017) (212)
On the Acoustics of Emotion in Audio: What Speech, Music, and Sound have in Common (2013) (211)
Building Autonomous Sensitive Artificial Listeners (2012) (204)
Being bored? Recognising natural interest by extensive audiovisual integration for real-life application (2009) (197)
A novel approach for automatic acoustic novelty detection using a denoising autoencoder with bidirectional LSTM neural networks (2015) (197)
Introducing CURRENNT: the munich open-source CUDA recurrent neural network toolkit (2015) (196)
Prediction of asynchronous dimensional emotion ratings from audiovisual and physiological data (2015) (188)
The TUM Gait from Audio, Image and Depth (GAID) database: Multimodal recognition of subjects and traits (2014) (178)
Context-sensitive multimodal emotion recognition from speech and facial expression using bidirectional LSTM modeling (2010) (176)
AV+EC 2015: The First Affect Recognition Challenge Bridging Across Audio, Video, and Physiological Data (2015) (172)
Emotion on the Road - Necessity, Acceptance, and Feasibility of Affective Computing in the Car (2010) (164)
A Deep Semi-NMF Model for Learning Hidden Representations (2014) (163)
Context-Sensitive Learning for Enhanced Audiovisual Emotion Classification (2012) (162)
Whodunnit - Searching for the most important feature types signalling emotion-related user states in speech (2011) (160)
Combining Long Short-Term Memory and Dynamic Bayesian Networks for Incremental Emotion-Sensitive Artificial Listening (2010) (160)
Towards More Reality in the Recognition of Emotional Speech (2007) (159)
openXBOW - Introducing the Passau Open-Source Crossmodal Bag-of-Words Toolkit (2016) (155)
Speaker Independent Speech Emotion Recognition by Ensemble Classification (2005) (154)
Speaker independent emotion recognition by early fusion of acoustic and linguistic features within ensembles (2005) (152)
Combining Efforts for Improving Automatic Classification of Emotional User States (2006) (150)
Universal Onset Detection with Bidirectional Long Short-Term Memory Neural Networks (2010) (150)
End-to-End Speech Emotion Recognition Using Deep Neural Networks (2018) (149)
AVEC 2019 Workshop and Challenge: State-of-Mind, Detecting Depression with AI, and Cross-Cultural Affect Recognition (2019) (148)
The INTERSPEECH 2014 computational paralinguistics challenge: cognitive & physical load (2014) (147)
Online Driver Distraction Detection Using Long Short-Term Memory (2011) (140)
The INTERSPEECH 2017 Computational Paralinguistics Challenge: Addressee, Cold & Snoring (2017) (140)
The INTERSPEECH 2015 computational paralinguistics challenge: nativeness, parkinson's & eating condition (2015) (139)
Knowledge-Based Approaches to Concept-Level Sentiment Analysis (2013) (135)
Deep Learning for Environmentally Robust Speech Recognition (2017) (130)
On-line emotion recognition in a 3-D activation-valence-time continuum using acoustic and linguistic cues (2010) (127)
At the Border of Acoustics and Linguistics: Bag-of-Audio-Words for the Recognition of Emotions in Speech (2016) (125)
Speech Emotion Classification Using Attention-Based LSTM (2019) (122)
Large-scale audio feature extraction and SVM for acoustic scene classification (2013) (122)
AVEC 2018 Workshop and Challenge: Bipolar Disorder and Cross-Cultural Affect Recognition (2018) (121)
Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments (2017) (121)
Single-channel speech separation with memory-enhanced recurrent neural networks (2014) (121)
An Image-based Deep Spectrum Feature Representation for the Recognition of Emotional Speech (2017) (120)
auDeep: Unsupervised Learning of Representations from Audio with Deep Recurrent Neural Networks (2017) (120)
Emotion Recognition in the Noise Applying Large Acoustic Feature Sets (2006) (119)
SEWA DB: A Rich Database for Audio-Visual Emotion and Sentiment Research in the Wild (2019) (119)
Convolutional RNN: An enhanced model for extracting features from sequential data (2016) (113)
Cooperative Learning and its Application to Emotion Recognition from Speech (2015) (113)
The INTERSPEECH 2018 Computational Paralinguistics Challenge: Atypical & Self-Assessed Affect, Crying & Heart Beats (2018) (112)
Unsupervised learning in cross-corpus acoustic emotion recognition (2011) (108)
Affective Computing and Intelligent Interaction (2011) (108)
Frame vs. Turn-Level: Emotion Recognition from Speech Considering Static and Dynamic Processing (2007) (108)
On the Necessity and Feasibility of Detecting a Driver's Emotional State While Driving (2007) (104)
Universum Autoencoder-Based Domain Adaptation for Speech Emotion Recognition (2017) (99)
COVID-19 and Computer Audition: An Overview on What Speech & Sound Analysis Could Contribute in the SARS-CoV-2 Corona Crisis (2020) (98)
Deep Structured Learning for Facial Action Unit Intensity Estimation (2017) (97)
Timing levels in segment-based speech emotion recognition (2006) (96)
Semisupervised Autoencoders for Speech Emotion Recognition (2018) (96)
Speech analysis for health: Current state-of-the-art and the increasing impact of deep learning. (2018) (94)
The INTERSPEECH 2020 Computational Paralinguistics Challenge: Elderly Emotion, Breathing & Masks (2020) (91)
Summary for AVEC 2016: Depression, Mood, and Emotion Recognition Workshop and Challenge (2016) (91)
A multidimensional dynamic time warping algorithm for efficient multimodal fusion of asynchronous data streams (2009) (89)
Segmenting into Adequate Units for Automatic Recognition of Emotion-Related Episodes: A Speech-Based Approach (2010) (87)
Audiovisual recognition of spontaneous interest within conversations (2007) (87)
Deep Scalogram Representations for Acoustic Scene Classification (2018) (87)
Deep recurrent de-noising auto-encoder and blind de-reverberation for reverberated speech recognition (2014) (86)
The INTERSPEECH 2021 Computational Paralinguistics Challenge: COVID-19 Cough, COVID-19 Speech, Escalation & Primates (2021) (84)
Multi-resolution linear prediction based features for audio onset detection with bidirectional LSTM neural networks (2014) (83)
Statistical Approaches to Concept-Level Sentiment Analysis (2013) (83)
Deep Recurrent Neural Network-Based Autoencoders for Acoustic Novelty Detection (2017) (83)
An Early Study on Intelligent Analysis of Speech under COVID-19: Severity, Sleep Quality, Fatigue, and Anxiety (2020) (81)
Measuring Engagement in Robot-Assisted Autism Therapy: A Cross-Cultural Study (2017) (80)
Combining frame and turn-level information for robust recognition of emotions within speech (2007) (79)
AVEC 2012: the continuous audio/visual emotion challenge - an introduction (2012) (79)
Meta-classifiers in acoustic and linguistic feature fusion-based affect recognition (2005) (79)
Sequence to Sequence Autoencoders for Unsupervised Representation Learning from Audio (2017) (78)
Contextual Bidirectional Long Short-Term Memory Recurrent Neural Network Language Models: A Generative Approach to Sentiment Analysis (2017) (77)
Audiovisual Behavior Modeling by Combined Feature Spaces (2007) (76)
Bidirectional LSTM Networks for Context-Sensitive Keyword Detection in a Cognitive Virtual Agent Framework (2010) (73)
Non-linear prediction with LSTM recurrent neural networks for acoustic novelty detection (2015) (72)
Introducing shared-hidden-layer autoencoders for transfer learning and their application in acoustic emotion recognition (2014) (72)
Brute-forcing hierarchical functionals for paralinguistics: A waste of feature space? (2008) (71)
Using Multiple Databases for Training in Emotion Recognition: To Unite or to Vote? (2011) (70)
Robust speech recognition using long short-term memory recurrent neural networks for hybrid acoustic modelling (2014) (70)
A Survey on perceived speaker traits: Personality, likability, pathology, and the first challenge (2015) (70)
Robust discriminative keyword spotting for emotionally colored spontaneous speech using bidirectional LSTM networks (2009) (69)
Real-Time Speech Separation by Semi-supervised Nonnegative Matrix Factorization (2012) (69)
Feature enhancement by deep LSTM networks for ASR in reverberant multisource environments (2014) (68)
The Handbook of Multimodal-Multisensor Interfaces: Foundations, User Modeling, and Common Modality Combinations - Volume 1 (2017) (68)
A Novel Way to Measure and Predict Development: A Heuristic Approach to Facilitate the Early Detection of Neurodevelopmental Disorders (2017) (68)
Recognizing Affect from Linguistic Information in 3D Continuous Space (2011) (67)
Cross-Corpus Classification of Realistic Emotions - Some Pilot Experiments (2010) (66)
Exploring Deep Spectrum Representations via Attention-Based Recurrent and Convolutional Neural Networks for Speech Emotion Recognition (2019) (66)
The INTERSPEECH 2019 Computational Paralinguistics Challenge: Styrian Dialects, Continuous Sleepiness, Baby Sounds & Orca Activity (2019) (66)
Augment to Prevent: Short-Text Data Augmentation in Deep Learning for Hate-Speech Classification (2019) (66)
openSMILE:): the Munich open-source large-scale multimedia feature extractor (2015) (65)
End-to-end convolutional neural network enables COVID-19 detection from breath and cough audio: a pilot study (2021) (64)
Affective Image Content Analysis: A Comprehensive Survey (2018) (64)
Audio Source Separation (2013) (64)
The Automatic Recognition of Emotions in Speech (2011) (64)
Deep Representation Learning in Speech Processing: Challenges, Recent Advances, and Future Trends (2020) (63)
The Voice of Leadership: Models and Performances of Automatic Analysis in Online Speeches (2012) (62)
Audio recognition in the wild: Static and dynamic classification on a real-world database of animal vocalizations (2011) (62)
An Overview on Audio, Signal, Speech, & Language Processing for COVID-19 (2020) (62)
Sentiment Analysis and Topic Recognition in Video Transcriptions (2021) (61)
Evolutionary Feature Generation in Speech Emotion Recognition (2006) (61)
Static and Dynamic Modelling for the Recognition of Non-verbal Vocalisations in Conversational Speech (2008) (60)
From Hard to Soft: Towards more Human-like Emotion Recognition by Modelling the Perception Uncertainty (2017) (60)
On-line continuous-time music mood regression with deep recurrent neural networks (2014) (60)
Applying multiple classifiers and non-linear dynamics features for detecting sleepiness from speech (2012) (60)
Enhanced semi-supervised learning for multimodal emotion recognition (2016) (60)
CCA based feature selection with application to continuous depression recognition from acoustic speech features (2014) (59)
Semi-supervised learning helps in sound event classification (2012) (59)
Social signal classification using deep blstm recurrent neural networks (2014) (58)
Discriminatively Trained Recurrent Neural Networks for Continuous Dimensional Emotion Recognition from Audio (2016) (57)
Attention-augmented End-to-end Multi-task Learning for Emotion Prediction from Speech (2019) (57)
Cross-language acoustic emotion recognition: An overview and some tendencies (2015) (57)
Adversarial Training in Affective Computing and Sentiment Analysis: Recent Advances and Perspectives [Review Article] (2018) (56)
Low-Level Fusion of Audio, Video Feature for Multi-Modal Emotion Recognition (2008) (56)
A multitask approach to continuous five-dimensional affect sensing in natural speech (2012) (55)
Emotion recognition from speech: Putting ASR in the loop (2009) (55)
iHEARu-PLAY: Introducing a game for crowdsourced data collection for affective computing (2015) (54)
Multimodal emotion recognition in audiovisual communication (2002) (54)
AVEC 2015: The 5th International Audio/Visual Emotion Challenge and Workshop (2015) (53)
The Munich 2011 CHiME Challenge Contribution: NMF-BLSTM Speech Enhancement and Recognition for Reverberated Multisource Environments (2011) (53)
From speech to letters - using a novel neural network architecture for grapheme based ASR (2009) (53)
What Should a Generic Emotion Markup Language Be Able to Represent? (2007) (52)
Medium-term speaker states - A review on intoxication, sleepiness and the first challenge (2014) (52)
Attention-based convolutional neural networks for acoustic scene classification (2018) (52)
‘Mister D.J., Cheer Me Up!’: Musical and Textual Features for Automatic Mood Classification (2010) (52)
String-based audiovisual fusion of behavioural events for the assessment of dimensional affect (2011) (52)
Semi-Supervised Active Learning for Sound Classification in Hybrid Learning Environments (2016) (52)
Data-driven clustering in emotional space for affect recognition using discriminatively trained LSTM networks (2009) (51)
A multi-stream ASR framework for BLSTM modeling of conversational speech (2011) (51)
Multi-Task Semi-Supervised Adversarial Autoencoding for Speech Emotion Recognition (2019) (51)
Efficient Recognition of Authentic Dynamic Facial Expressions on the Feedtum Database (2006) (51)
Deep Canonical Time Warping for Simultaneous Alignment and Representation Learning of Sequences (2018) (51)
Recognizing Emotions From Whispered Speech Based on Acoustic Feature Transfer Learning (2017) (50)
Recognition of Noisy Speech: A Comparative Survey of Robust Model Architecture and Feature Enhancement (2009) (50)
Classification of the Excitation Location of Snore Sounds in the Upper Airway by Acoustic Multifeature Analysis (2017) (50)
Advanced Data Exploitation in Speech Analysis: An overview (2017) (50)
Non-negative matrix factorization as noise-robust feature extractor for speech recognition (2010) (50)
The Age of Artificial Emotional Intelligence (2018) (49)
Intelligent Audio Analysis (2013) (49)
Determination of Nonprototypical Valence and Arousal in Popular Music: Features and Performances (2010) (48)
"Would You Buy a Car from Me?" - On the Likability of Telephone Voices (2011) (48)
The Computational Paralinguistics Challenge (2012) (48)
Dawn of the transformer era in speech emotion recognition: closing the valence gap (2022) (47)
Emotional expression in psychiatric conditions: New technology for clinicians (2018) (47)
Multi-task deep neural network with shared hidden layers: Breaking down the wall between emotion representations (2017) (46)
Cross-lingual Zero- and Few-shot Hate Speech Detection Utilising Frozen Transformer Language Models and AXEL (2020) (46)
Keyword spotting exploiting Long Short-Term Memory (2013) (46)
CultureNet: A Deep Learning Approach for Engagement Intensity Estimation from Face Images of Children with Autism (2018) (46)
Facing Realism in Spontaneous Emotion Recognition from Speech: Feature Enhancement by Autoencoder with LSTM Neural Networks (2016) (46)
Applying multi layer homography for multi camera person tracking (2008) (45)
Learning Image-based Representations for Heart Sound Classification (2018) (45)
The MERL/MELCO/TUM system for the REVERB Challenge using Deep Recurrent Neural Network Feature Enhancement (2014) (45)
Non-negative matrix factorization for highly noise-robust ASR: To enhance or to recognize? (2012) (45)
Feature enhancement by bidirectional LSTM networks for conversational speech recognition in highly non-stationary noise (2013) (44)
New avenues in knowledge bases for natural language processing (2016) (44)
End2You - The Imperial Toolkit for Multimodal Profiling by End-to-End Learning (2018) (44)
Patterns, prototypes, performance: classifying emotional user states (2008) (44)
Acoustic Gait-based Person Identification using Hidden Markov Models (2014) (44)
Combining a parallel 2D CNN with a self-attention Dilated Residual Network for CTC-based discrete speech emotion recognition (2021) (44)
Affective and behavioural computing: Lessons learnt from the First Computational Paralinguistics Challenge (2019) (43)
Strength modelling for real-worldautomatic continuous affect recognition from audiovisual signals (2017) (43)
Towards responsive Sensitive Artificial Listeners (2008) (43)
Automatic recognition of physiological parameters in the human voice: Heart rate and skin conductance (2013) (43)
Prediction-based learning for continuous emotion recognition in speech (2017) (43)
Selecting Training Data for Cross-Corpus Speech Emotion Recognition: Prototypicality vs. Generalization (2011) (42)
I Hear You Eat and Speak: Automatic Recognition of Eating Condition and Food Type, Use-Cases, and Impact on ASR Performance (2016) (42)
Attention-based Atrous Convolutional Neural Networks: Visualisation and Understanding Perspectives of Acoustic Scenes (2019) (42)
Comparing one and two-stage acoustic modeling in the recognition of emotion in speech (2007) (42)
The hinterland of emotions: Facing the open-microphone challenge (2009) (41)
Continuous Estimation of Emotions in Speech by Dynamic Cooperative Speaker Models (2017) (41)
Modeling gender information for emotion recognition using Denoising autoencoder (2014) (41)
Automatic recognition of emotion evoked by general sound events (2012) (41)
Recognising interest in conversational speech - comparing bag of frames and supra-segmental features (2009) (40)
Combining speech recognition and acoustic word emotion models for robust text-independent emotion recognition (2008) (40)
Attention-Enhanced Connectionist Temporal Classification for Discrete Speech Emotion Recognition (2019) (40)
Deep Canonical Time Warping (2016) (39)
Active Learning by Sparse Instance Tracking and Classifier Confidence in Acoustic Emotion Recognition (2012) (39)
Detecting COVID-19 from Breathing and Coughing Sounds using Deep Neural Networks (2020) (39)
DeepCoder: Semi-Parametric Variational Autoencoders for Automatic Facial Action Coding (2017) (39)
The voice of COVID-19: Acoustic correlates of infection in sustained vowelsa) (2020) (39)
Affect recognition in real-life acoustic conditions - a new perspective on feature selection (2013) (39)
A Bag-of-Audio-Words Approach for Snore Sounds' Excitation Localisation (2016) (39)
Driver Frustration Detection from Audio and Video in the Wild (2016) (39)
Detecting overlapping speech with long short-term memory recurrent neural networks (2013) (39)
Cross lingual speech emotion recognition using canonical correlation analysis on principal component subspace (2016) (39)
Transfer learning emotion manifestation across music and speech (2014) (38)
The MuSe 2021 Multimodal Sentiment Analysis Challenge: Sentiment, Emotion, Physiological-Emotion, and Stress (2021) (38)
Automatic Assessment of Depression From Speech via a Hierarchical Attention Transfer Network and Attention Autoencoders (2020) (37)
Validity of machine learning in biology and medicine increased through collaborations across fields of expertise (2020) (37)
The Computational Paralinguistics Challenge [Social Sciences] (2012) (37)
End-to-end multimodal affect recognition in real-world environments (2021) (37)
Earlier Identification of Children with Autism Spectrum Disorder: An Automatic Vocalisation-Based Approach (2017) (36)
Affect-Robust Speech Recognition by Dynamic Emotional Adaptation (2006) (36)
Tango or Waltz?: Putting Ballroom Dance Style into Tempo Detection (2008) (36)
A real-time system for hand gesture controlled operation of in-car devices (2003) (36)
End-to-end learning for dimensional emotion recognition from physiological signals (2017) (36)
Typicality and emotion in the voice of children with autism spectrum condition: evidence across three languages (2015) (36)
MEC 2016: The Multimodal Emotion Recognition Challenge of CCPR 2016 (2016) (36)
Hidden Conditional Random Fields for Meeting Segmentation (2007) (36)
DEMoS: an Italian emotional speech corpus (2020) (36)
Multimodal Bag-of-Words for Cross Domains Sentiment Analysis (2018) (35)
The Munich Biovoice Corpus: Effects of Physical Exercising, Heart Rate, and Skin Conductance on Human Speech Production (2014) (35)
Speech-based Diagnosis of Autism Spectrum Condition by Generative Adversarial Network Representations (2017) (35)
Towards Temporal Modelling of Categorical Speech Emotion Recognition (2018) (35)
Detecting Road Surface Wetness from Audio: A Deep Learning Approach (2015) (35)
Learning with synthesized speech for automatic emotion recognition (2010) (34)
Enhancing Speech-Based Depression Detection Through Gender Dependent Vowel-Level Formant Features (2017) (34)
Memory-Enhanced Neural Networks and NMF for Robust ASR (2014) (34)
Deep Sequential Image Features on Acoustic Scene Classification (2017) (34)
Detection of negative emotions in speech signals using bags-of-audio-words (2015) (34)
Sentiment analysis and opinion mining: on optimal parameters and performances (2015) (34)
ASC-Inclusion: Interactive Emotion Games for Social Inclusion of Children with Autism Spectrum Conditions (2013) (34)
Voice and Speech Analysis in Search of States and Traits (2011) (34)
Sentiment analysis using image-based deep spectrum features (2017) (33)
Supervised and semi-supervised suppression of background music in monaural speech recordings (2012) (33)
Active learning for bird sound classification via a kernel-based extreme learning machine. (2017) (33)
Recognition of Nonprototypical Emotions in Reverberated and Noisy Speech by Nonnegative Matrix Factorization (2011) (33)
On the Impact of Children's Emotional Speech on Acoustic and Language Models (2010) (33)
Emotion Recognition in the Wild: Incorporating Voice and Lip Activity in Multimodal Decision-Level Fusion (2014) (33)
Stargan for Emotional Speech Conversion: Validated by Data Augmentation of End-To-End Emotion Recognition (2020) (33)
Analyzing the memory of BLSTM Neural Networks for enhanced emotion classification in dyadic spoken interactions (2012) (33)
Recent developments and results of ASC-Inclusion: An Integrated Internet-Based Environment for Social Inclusion of Children with Autism Spectrum Conditions (2015) (32)
Towards Robust Speech Emotion Recognition Using Deep Residual Networks for Speech Enhancement (2019) (32)
Automatic Classification of Autistic Child Vocalisations: A Novel Database and Results (2017) (32)
Snoring classified: The Munich-Passau Snore Sound Corpus (2018) (32)
Emotion recognition using imperfect speech recognition (2010) (32)
The state of play of ASC-Inclusion: An Integrated Internet-Based Environment for Social Inclusion of Children with Autism Spectrum Conditions (2014) (31)
Exploitation of Phase-Based Features for Whispered Speech Emotion Recognition (2016) (31)
A Comparison of Acoustic and Linguistics Methodologies for Alzheimer's Dementia Recognition (2020) (31)
Dynamic Difficulty Awareness Training for Continuous Emotion Prediction (2018) (30)
Speech overlap detection and attribution using convolutive non-negative sparse coding (2012) (30)
Snore-GANs: Improving Automatic Snore Sound Classification With Synthesized Data (2019) (30)
The Multimodal Sentiment Analysis in Car Reviews (MuSe-CaR) Dataset: Collection, Insights and Improvements (2021) (30)
Mothers, adults, children, pets — towards the acoustics of intimacy (2008) (29)
MuSe 2020 Challenge and Workshop: Multimodal Sentiment Analysis, Emotion-target Engagement and Trustworthiness Detection in Real-life Media: Emotional Car Reviews in-the-wild (2020) (29)
Connecting Subspace Learning and Extreme Learning Machine in Speech Emotion Recognition (2019) (29)
Emotion sensitive speech control for human-robot interaction in minimal invasive surgery (2008) (29)
Emotion Recognition in Naturalistic Speech and Language—A Survey (2015) (29)
Affective Video Retrieval: Violence Detection in Hollywood Movies by Large-Scale Segmental Feature Extraction (2013) (29)
A Hierarchical Attention Network-Based Approach for Depression Detection from Transcribed Clinical Interviews (2019) (29)
Emotion in the speech of children with autism spectrum conditions: prosody and everything else (2012) (29)
Learning and Knowledge-Based Sentiment Analysis in Movie Review Key Excerpts (2010) (29)
Spectral and Cepstral Audio Noise Reduction Techniques in Speech Emotion Recognition (2016) (29)
Localization of non-linguistic events in spontaneous speech by Non-Negative Matrix Factorization and Long Short-Term Memory (2011) (29)
Automatic Analysis of Typical and Atypical Encoding of Spontaneous Emotion in the Voice of Children (2016) (29)
Emotion in the singing voice—a deeperlook at acoustic features in the light ofautomatic classification (2015) (29)
Detection of security related affect and behaviour in passenger transport (2008) (29)
Optimization and Parallelization of Monaural Source Separation Algorithms in the openBliSSART Toolkit (2012) (29)
Recognition of Spontaneous Emotions by Speech within Automotive Environment (2006) (29)
Audio for Audio is Better? An Investigation on Transfer Learning Models for Heart Sound Classification (2020) (29)
AVEC 2014: the 4th international audio/visual emotion challenge and workshop (2014) (28)
Calibrated Prediction Intervals for Neural Network Regressors (2018) (28)
Fisher Kernels on Phase-Based Features for Speech Emotion Recognition (2016) (28)
COVID-19 detection from audio: seven grains of salt (2021) (28)
Reconstruction-error-based learning for continuous emotion recognition in speech (2017) (28)
AI-Based human audio processing for COVID-19: A comprehensive overview (2021) (28)
Personalized Estimation of Engagement From Videos Using Active Learning With Deep Reinforcement Learning (2019) (28)
Leveraging Unlabeled Data for Emotion Recognition With Enhanced Collaborative Semi-Supervised Learning (2018) (28)
Confidence Measures in Speech Emotion Recognition Based on Semi-supervised Learning (2012) (28)
The University of Passau Open Emotion Recognition System for the Multimodal Emotion Challenge (2016) (27)
A Review on Five Recent and Near-Future Developments in Computational Processing of Emotion in the Human Voice (2020) (27)
Emotional Analysis of Music: A Comparison of Methods (2014) (27)
Likability Classification - A Not so Deep Neural Network Approach (2012) (27)
Automatic Assessment of Singer Traits in Popular Music: Gender, Age, Height and Race (2011) (27)
Wavelet features for classification of vote snore sounds (2016) (27)
Classification of the Excitation Location of Snore Sounds in the Upper Airway by Acoustic Multi-Feature Analysis. (2016) (27)
Continuous Emotion Recognition in Speech - Do We Need Recurrence? (2019) (27)
Serious Gaming for Behavior Change: The State of Play (2013) (27)
CAST a database: Rapid targeted large-scale big data acquisition via small-world modelling of social media platforms (2017) (26)
Linked Source and Target Domain Subspace Feature Transfer Learning -- Exemplified by Speech Emotion Recognition (2014) (26)
Classification of Music Genres Based on Music Separation into Harmonic and Drum Components (2015) (26)
Classification of Lung Nodules Based on Deep Residual Networks and Migration Learning (2020) (26)
Bag-of-Deep-Features: Noise-Robust Deep Feature Representations for Audio Analysis (2018) (26)
Introducing the Weighted Trustability Evaluator for Crowdsourcing Exemplified by Speaker Likability Classification (2016) (26)
Categorical vs Dimensional Perception of Italian Emotional Speech (2018) (26)
An 'End-to-Evolution' Hybrid Approach for Snore Sound Classification (2017) (26)
Discrimination of speech and non-linguistic vocalizations by Non-Negative Matrix Factorization (2010) (26)
A Comparison of Online Automatic Speech Recognition Systems and the Nonverbal Responses to Unintelligible Speech (2019) (26)
Wavelets Revisited for the Classification of Acoustic Scenes (2017) (25)
Bird sounds classification by large scale acoustic features and extreme learning machine (2015) (25)
A discriminative approach to polyphonic piano note transcription using supervised non-negative matrix factorization (2013) (25)
Average Jane, Where Art Thou? – Recent Avenues in Efficient Machine Learning Under Subjectivity Uncertainty (2020) (25)
Suspicious Behavior Detection in Public Transport by Fusion of Low-Level Video Descriptors (2007) (25)
RECOGNISING ACOUSTIC SCENES WITH LARGE-SCALE AUDIO FEATURE EXTRACTION AND SVM (2013) (25)
Syllabification of conversational speech using Bidirectional Long-Short-Term Memory Neural Networks (2011) (25)
The MuSe 2022 Multimodal Sentiment Analysis Challenge: Humor, Emotional Reactions, and Stress (2022) (25)
Multi-Camera Person Tracking and Left Luggage Detection Applying Homographic Transformation (2007) (25)
A comparative study on sparsity penalties for NMF-based speech separation: Beyond LP-norms (2013) (25)
“The Godfather” vs. “Chaos”: Comparing Linguistic Analysis Based on On-line Knowledge Sources and Bags-of-N-Grams for Movie Review Valence Estimation (2009) (25)
Convolutional Neural Networks with Data Augmentation for Classifying Speakers' Native Language (2016) (25)
Words that Fascinate the Listener: Predicting Affective Ratings of On-Line Lectures (2013) (25)
Early Vocal Development in Autism Spectrum Disorder, Rett Syndrome, and Fragile X Syndrome: Insights from Studies Using Retrospective Video Analysis (2018) (25)
Pairwise Decomposition with Deep Neural Networks and Multiscale Kernel Subspace Learning for Acoustic Scene Classification (2016) (25)
Confidence Measures for Speech Emotion Recognition: A Start (2012) (25)
The Munich LSTM-RNN Approach to the MediaEval 2014 "Emotion in Music'" Task (2014) (25)
MixedEmotions: An Open-Source Toolbox for Multimodal Emotion Analysis (2018) (24)
Audiovisual classification of vocal outbursts in human conversation using Long-Short-Term Memory networks (2011) (24)
Emotion-Awareness for Intelligent Vehicle Assistants: A Research Agenda (2018) (24)
Hierarchical Attention Transfer Networks for Depression Assessment from Speech (2020) (24)
Towards Intelligent Crowdsourcing for Audio Data Annotation: Integrating Active Learning in the Real World (2017) (24)
Fast and Robust Meter and Tempo Recognition for the Automatic Discrimination of Ballroom Dance Styles (2007) (24)
Ask Alice: an artificial retrieval of information agent (2016) (23)
Blind Enhancement of the Rhythmic and Harmonic Sections by NMF: Does it help? (2009) (23)
OpenBliSSART: Design and evaluation of a research toolkit for Blind Source Separation in Audio Recognition Tasks (2011) (23)
Incremental acoustic valence recognition: an inter-corpus perspective on features, matching, and performance in a gating paradigm (2010) (23)
Investigating NMF speech enhancement for neural network based acoustic models (2014) (23)
Exploiting time-frequency patterns with LSTM-RNNs for low-bitrate audio restoration (2019) (23)
Enhancing Multilingual Recognition of Emotion in Speech by Language Identification (2016) (23)
Robust in-car spelling recognition - a tandem BLSTM-HMM approach (2009) (23)
Co-training succeeds in Computational Paralinguistics (2013) (23)
Speech, Emotion, Age, Language, Task, and Typicality: Trying to Disentangle Performance and Feature Relevance (2012) (23)
Deep Bidirectional Long Short-Term Memory Recurrent Neural Networks for Grapheme-to-Phoneme Conversion Utilizing Complex Many-to-Many Alignments (2016) (23)
Recognition of interest in human conversational speech (2006) (23)
The ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts (2022) (23)
Stacked denoising autoencoders for sentiment analysis: a review (2017) (23)
Exploring Nonnegative Matrix Factorization for Audio Classification: Application to Speaker Recognition (2012) (22)
Audio, Speech, Language, & Signal Processing for COVID-19: A Comprehensive Overview (2020) (22)
Five Crucial Challenges in Digital Health (2020) (22)
Active learning by label uncertainty for acoustic emotion recognition (2013) (22)
HEAR: Holistic Evaluation of Audio Representations (2022) (22)
Deep speaker conditioning for speech emotion recognition (2021) (22)
Deep Unsupervised Representation Learning for Abnormal Heart Sound Classification (2018) (22)
eXplainable Cooperative Machine Learning with NOVA (2020) (22)
THE UP SYSTEM FOR THE 2016 DCASE CHALLENGE USING DEEP RECURRENT NEURAL NETWORK AND MULTISCALE KERNEL SUBSPACE LEARNING (2016) (22)
Synthesized speech for model training in cross-corpus recognition of human emotion (2012) (22)
Audio self-supervised learning: A survey (2022) (22)
Multi-modal Active Learning From Human Data: A Deep Reinforcement Learning Approach (2019) (22)
Automatic Emotion Recognition by the Speech Signal (2002) (22)
EmoBed: Strengthening Monomodal Emotion Recognition via Training with Crossmodal Emotion Embeddings (2019) (21)
A demonstration of audiovisual sensitive artificial listeners (2009) (21)
Automatic Transcription of Recorded Music (2012) (21)
Feature Selection and Stacking for Robust Discrimination of Speech, Monophonic Singing, and Polyphonic Music (2005) (21)
the Munich open Speech and Music Interpretation by Large Space Extraction toolkit (2010) (21)
Channel mapping using bidirectional long short-term memory for dereverberation in hands-free voice controlled devices (2014) (21)
A machine learning based system for the automatic evaluation of aphasia speech (2017) (21)
Using multimodal interaction to navigate in arbitrary virtual VRML worlds (2001) (21)
Balancing spoken content adaptation and unit length in the recognition of emotion and interest (2008) (21)
A Tandem BLSTM-DBN Architecture for Keyword Spotting with Enhanced Context Modeling (2009) (21)
Noise robust ASR in reverberated multisource environments applying convolutive NMF and Long Short-Term Memory (2013) (21)
Proceedings of the 6th International Workshop on Audio/Visual Emotion Challenge (2014) (21)
Automatically Estimating Emotion in Music with Deep Long-Short Term Memory Recurrent Neural Networks (2015) (21)
Towards a standard set of acoustic features for the processing of emotion in speech. (2010) (21)
EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion Recognition (2021) (21)
MuSe-Toolbox: The Multimodal Sentiment Analysis Continuous Annotation Fusion and Discrete Class Transformation Toolbox (2021) (21)
THE IMPACT OF F0 EXTRACTION ERRORS ON THE CLASSIFICATION OF PROMINENCE AND EMOTION (2007) (21)
Recognition of spontaneous conversational speech using long short-term memory phoneme predictions (2010) (21)
On the Influence of Phonetic Content Variation for Acoustic Emotion Recognition (2008) (20)
The Munich Feature Enhancement Approach to the 2013 CHiME Challenge Using BLSTM Recurrent Neural Networks (2013) (20)
YouTube Movie Reviews: In, Cross, and Open-domain Sentiment Analysis in an Audiovisual Context (2013) (20)
Can Deep Generative Audio be Emotional? Towards an Approach for Personalised Emotional Audio Generation (2019) (20)
The Perception and Analysis of the Likeability and Human Likeness of Synthesized Speech (2018) (20)
Affective Image Content Analysis: Two Decades Review and New Perspectives (2021) (20)
Can Machine Learning Assist Locating the Excitation of Snore Sound? A Review (2020) (20)
Artificial Intelligence Internet of Things for the Elderly: From Assisted Living to Health-Care Monitoring (2021) (20)
MAPTRAITS 2014 - The First Audio/Visual Mapping Personality Traits Challenge - An Introduction: Perceived Personality and Social Dimensions (2014) (20)
Hidden Markov model-based speech emotion recognition (2003) (20)
Is Deception Emotional? An Emotion-Driven Predictive Approach (2016) (20)
Shared acoustic codes underlie emotional communication in music and speech—Evidence from deep transfer learning (2017) (20)
Convolutive Non-Negative Sparse Coding and New Features for Speech Overlap Handling in Speaker Diarization (2012) (20)
Enhancing LSTM RNN-Based Speech Overlap Detection by Artificially Mixed Data (2017) (19)
LiRA: Learning Visual Speech Representations from Audio through Self-supervision (2021) (19)
Robust Acoustic Speech Emotion Recognition by Ensembles of Classifiers (2005) (19)
Distributing Recognition in Computational Paralinguistics (2014) (19)
Multi-Modal Non-Prototypical Music Mood Analysis in Continuous Space: Reliability and Performances (2011) (19)
An Online Robot Collision Detection and Identification Scheme by Supervised Learning and Bayesian Decision Theory (2021) (19)
Robust vocabulary independent keyword spotting with graphical models (2009) (19)
Does She Speak RTT? Towards an Earlier Identification of Rett Syndrome Through Intelligent Pre-Linguistic Vocalisation Analysis (2016) (19)
Generating and Protecting Against Adversarial Attacks for Deep Speech-Based Emotion Recognition Models (2020) (19)
CINEMO - A French Spoken Language Resource for Complex Emotions: Facts and Baselines (2010) (19)
Dynamic Active Learning Based on Agreement and Applied to Emotion Recognition in Spoken Interactions (2015) (19)
Speaker, Noise, and Acoustic Space Adaptation for Emotion Recognition in the Automotive Environment (2011) (19)
The TUM+TUT+KUL approach to the CHiME challenge 2013: Multi-stream ASR exploiting BLSTM networks and sparse NMF (2013) (19)
Gait-based person identification by spectral, cepstral and energy-related audio features (2013) (19)
Novel Metrics of Speech Rhythm for the Assessment of Emotion (2012) (18)
The ASC-Inclusion Perceptual Serious Gaming Platform for Autistic Children (2019) (18)
Towards Conditional Adversarial Training for Predicting Emotions from Speech (2018) (18)
The Detection of Parkinson's Disease From Speech Using Voice Source Information (2021) (18)
Multi-stream LSTM-HMM decoding and histogram equalization for noise robust keyword spotting (2011) (18)
A Bag of Wavelet Features for Snore Sound Classification (2019) (18)
The TUM+TUT+KUL Approach to the 2nd CHiME Challenge: Multi-Stream ASR Exploiting BLSTM Networks and Sparse NMF (2013) (18)
State of Mind: Classification through Self-reported Affect and Word Use in Speech (2018) (18)
CAA-Net: Conditional Atrous CNNs With Attention for Explainable Device-Robust Acoustic Scene Classification (2020) (18)
The TUM Approach to the MediaEval Music Emotion Task Using Generic Affective Audio Features (2013) (18)
Implementing Gender-Dependent Vowel-Level Analysis for Boosting Speech-Based Depression Recognition (2017) (18)
Wearable Assistance for the Ballroom-Dance Hobbyist - Holistic Rhythm Analysis and Dance-Style Classification (2007) (18)
Computer Audition for Healthcare: Opportunities and Challenges (2020) (17)
Augmenting Generative Adversarial Networks for Speech Emotion Recognition (2020) (17)
An Agreement and Sparseness-based Learning Instance Selection and its Application to Subjective Speech Phenomena (2014) (17)
Towards measuring similarity between emotional corpora (2010) (17)
Chapter 8 Voice-enabled assistive robots for handling autism spectrum conditions : an examination of the role of prosody (2014) (17)
MAPTRAITS 2014: The First Audio/Visual Mapping Personality Traits Challenge (2014) (17)
Fitbeat: COVID-19 estimation based on wristband heart rate using a contrastive convolutional auto-encoder (2021) (17)
Biosensors and Internet of Things in smart healthcare applications: challenges and opportunities (2020) (17)
Effects of In-Car Noise-Conditions on the Recognition of Emotion within Speech (2007) (17)
The ICL-TUM-PASSAU Approach for the MediaEval 2015 "Affective Impact of Movies" Task (2015) (17)
A Two-Dimensional Framework of Multiple Kernel Subspace Learning for Recognizing Emotion in Speech (2017) (16)
“Are You Playing a Shooter Again?!” Deep Representation Learning for Audio-Based Video Game Genre Recognition (2020) (16)
Low Level Texture Features for Snore Sound Discrimination (2018) (16)
The Handbook of Multimodal-Multisensor Interfaces: Signal Processing, Architectures, and Detection of Emotion and Cognition - Volume 2 (2018) (16)
Implicit Fusion by Joint Audiovisual Training for Emotion Recognition in Mono Modality (2019) (16)
Spoken term detection with Connectionist Temporal Classification: A novel hybrid CTC-DBN decoder (2010) (16)
A Fusion of Deep Convolutional Generative Adversarial Networks and Sequence to Sequence Autoencoders for Acoustic Scene Classification (2018) (16)
Automatische Emotionserkennung aus sprachlicher und manueller Interaktion (2005) (16)
Video Based Online Behavior Detection Using Probabilistic Multi Stream Fusion (2005) (16)
Poisson CNN: Convolutional neural networks for the solution of the Poisson equation on a Cartesian mesh (2019) (16)
Machine Listening for Heart Status Monitoring: Introducing and Benchmarking HSS—The Heart Sounds Shenzhen Corpus (2019) (16)
Snore sound recognition: On wavelets and classifiers from deep nets to kernels (2017) (16)
Computer Audition for Fighting the SARS-CoV-2 Corona Crisis—Introducing the Multitask Speech Corpus for COVID-19 (2021) (16)
Deep recurrent music writer: Memory-enhanced variational autoencoder-based musical score composition and an objective measure (2017) (16)
Affective speaker state analysis in the presence of reverberation (2011) (16)
Perception of Paralinguistic Traits in Synthesized Voices (2017) (15)
End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks (2021) (15)
On rater reliability and agreement based dynamic active learning (2015) (15)
Tandem decoding of children's speech for keyword detection in a child-robot interaction scenario (2011) (15)
Music Information Retrieval: An Inspirational Guide to Transfer from Related Disciplines (2012) (15)
Real-Time Tracking of Speakers' Emotions, States, and Traits on Mobile Platforms (2016) (15)
Music Theoretic and Perception-based Features for Audio Key Determination (2012) (15)
Hierarchical neural networks and enhanced class posteriors for social signal classification (2013) (15)
The effect of personality trait, age, and gender on the performance of automatic speech valence recognition (2017) (15)
Voice Emotion Games: Language and Emotion in the Voice of Children with Autism Spectrum Conditio (2015) (15)
Ethics and Good Practice in Computational Paralinguistics (2022) (15)
Emotional Speech of Mentally and Physically Disabled Individuals: Introducing the EmotAsS Database and First Findings (2017) (15)
Vocalist Gender Recognition in Recorded Popular Music (2010) (15)
Multiple Camera Person Tracking in Multiple Layers Combining 2D and 3D Information (2008) (15)
Towards intuitive speech interaction by the integration of emotional aspects (2002) (15)
Dimensionality reduction for speech emotion features by multiscale kernels (2015) (15)
Ten Recent Trends in Computational Paralinguistics (2011) (15)
Recognition of Echolalic Autistic Child Vocalisations Utilising Convolutional Recurrent Neural Networks (2018) (15)
Deep Architecture Enhancing Robustness to Noise, Adversarial Attacks, and Cross-corpus Setting for Speech Emotion Recognition (2020) (15)
DeepSpectrumLite: A Power-Efficient Transfer Learning Framework for Embedded Speech and Audio Processing From Decentralized Data (2021) (15)
Acoustic Geo-Sensing: Recognising cyclists' route, route direction, and route progress from cell-phone audio (2013) (15)
The Perception of Vocal Traits in Synthesized Voices: Age, Gender, and Human Likeness (2018) (15)
Recent Advances in Computer Audition for Diagnosing COVID-19: An Overview (2020) (15)
Semantic Speech Tagging: Towards Combined Analysis of Speaker Traits (2011) (14)
Prediction on Mechanical Properties of Non-Equiatomic High-Entropy Alloy by Atomistic Simulation and Machine Learning (2020) (14)
Towards automatic airborne pollen monitoring: From commercial devices to operational by mitigating class-imbalance in a deep learning approach. (2021) (14)
A Novel Attention-Based Gated Recurrent Unit and its Efficacy in Speech Emotion Recognition (2021) (14)
Typical vs. atypical: Combining auditory Gestalt perception and acoustic analysis of early vocalisations in Rett syndrome. (2018) (14)
Representation transfer learning from deep end-to-end speech recognition networks for the classification of health states from speech (2021) (14)
Active Learning for Bird Sounds Classification (2017) (14)
Score-Informed Leading Voice Separation from Monaural Audio (2012) (14)
On Many-to-Many Mapping Between Concordance Correlation Coefficient and Mean Square Error (2019) (14)
Acoustic-Linguistic Recognition of Interest in Speech with Bottleneck-BLSTM Nets (2011) (14)
Towards Cross-lingual Automatic Diagnosis of Autism Spectrum Condition in Children's Voices (2016) (14)
Teaching Machines on Snoring: A Benchmark on Computer Audition for Snore Sound Excitation Localisation (2018) (14)
Towards distributed recognition of emotion from speech (2012) (14)
Feature Frame Stacking in RNN-Based Tandem ASR Systems - Learned vs. Predefined Context (2011) (14)
A novel bottleneck-BLSTM front-end for feature-level context modeling in conversational speech recognition (2011) (14)
Speech Communication and Multimodal Interfaces (2006) (14)
A Paralinguistic Approach To Speaker Diarisation: Using Age, Gender, Voice Likability and Personality Traits (2017) (14)
Word Accent and Emotion (2010) (14)
Proceedings of the 3rd ACM international workshop on Audio/visual emotion challenge (2013) (13)
A Broadcast News Corpus for Evaluation and Tuning of German LVCSR Systems (2014) (13)
Self-attention transfer networks for speech emotion recognition (2021) (13)
Real-world automatic continuous affect recognition from audiovisual signals (2019) (13)
A Real-Time Speech Enhancement Framework in Noisy and Reverberated Acoustic Scenarios (2013) (13)
Face reading from speech - predicting facial action units from audio cues (2015) (13)
Asynchronous and Event-Based Fusion Systems for Affect Recognition on Naturalistic Data in Comparison to Conventional Approaches (2018) (13)
The ACM Multimedia 2022 Computational Paralinguistics Challenge: Vocalisations, Stuttering, Activity, & Mosquitoes (2022) (13)
Semi-autonomous data enrichment based on cross-task labelling of missing targets for holistic speech analysis (2016) (13)
Audiovisual Analysis for Recognising Frustration during Game-Play: Introducing the Multimodal Game Frustration Database (2019) (13)
Combining monaural source separation with Long Short-Term Memory for increased robustness in vocalist gender recognition (2011) (13)
Towards cross-modal pre-training and learning tempo-spatial characteristics for audio recognition with convolutional and recurrent neural networks (2020) (13)
Scaling Speech Enhancement in Unseen Environments with Noise Embeddings (2018) (13)
Using Speech to Predict Sequentially Measured Cortisol Levels During a Trier Social Stress Test (2019) (13)
Deep Recurrent Neural Networks for Emotion Recognition in Speech (2018) (13)
End-2-End COVID-19 Detection from Breath & Cough Audio (2021) (13)
Musical Signal Type Discrimination based on Large Open Feature Sets (2006) (13)
How Did You like 2017? Detection of Language Markers of Depression and Narcissism in Personal Narratives (2018) (13)
On-line Driver Distraction Detection using Long Short-Term Memory (2011) (13)
Multimodal Sentiment Analysis in the Wild: Ethical considerations on Data Collection, Annotation, and Exploitation (2016) (13)
Real-time robust recognition of speakers' emotions and characteristics on mobile platforms (2015) (13)
Poisson CNN: Convolutional Neural Networks for the Solution of the Poisson Equation with Varying Meshes and Dirichlet Boundary Conditions (2019) (13)
Learning Higher Representations from Pre-Trained Deep Models with Data Augmentation for the COMPARE 2020 Challenge Mask Task (2020) (12)
Emotion recognition in the manual interaction with graphical user interfaces (2004) (12)
Applying Cooperative Machine Learning to Speed Up the Annotation of Social Signals in Large Multi-modal Corpora (2018) (12)
Deep neural networks for anger detection from real life speech data (2017) (12)
Evaluating the COVID-19 Identification ResNet (CIdeR) on the INTERSPEECH COVID-19 From Audio Challenges (2021) (12)
Adventitious Respiratory Classification Using Attentive Residual Neural Networks (2020) (12)
Multimodal Affect Recognition for Naturalistic Human-Computer and Human-Robot Interactions (2015) (12)
Bags in Bag: Generating Context-Aware Bags for Tracking Emotions from Speech (2018) (12)
Elements of an EmotionML 1.0, W3C Incubator Group Report, 20 Nov. 2008 (2008) (12)
Speech Emotion Recognition Exploiting Acoustic and Linguistic Information Sources (2005) (12)
EAT -: The ICMI 2018 Eating Analysis and Tracking Challenge (2018) (12)
Late fusion of individual engines for improved recognition of negative emotion in speech - learning vs. democratic vote (2010) (12)
Synthesising 3D Facial Motion from “In-the-Wild” Speech (2019) (12)
CovNet: A Transfer Learning Framework for Automatic COVID-19 Detection From Crowd-Sourced Cough Sounds (2022) (12)
Analysing communication requirements for crowd sourced backend generation of HD Maps used in automated driving (2018) (12)
The Perception of Emotions in Noisified Nonsense Speech (2017) (12)
Automated Classification of Airborne Pollen using Neural Networks (2019) (12)
Personalized Federated Deep Learning for Pain Estimation From Face Images (2021) (12)
The ACII 2022 Affective Vocal Bursts Workshop & Competition (2022) (12)
AVEC'19: Audio/Visual Emotion Challenge and Workshop (2019) (12)
Multimodal music retrieval for large databases (2004) (12)
MuSe 2022 Challenge: Multimodal Humour, Emotional Reactions, and Stress (2022) (11)
MEC 2017: Multimodal Emotion Recognition Challenge (2018) (11)
An Evaluation of Speech-Based Recognition of Emotional and Physiological Markers of Stress (2021) (11)
Audiovisual vocal outburst classification in noisy acoustic conditions (2012) (11)
Domain Invariant Feature Learning for Speaker-Independent Speech Emotion Recognition (2022) (11)
Likability of human voices: A feature analysis and a neural network regression approach to automatic likability estimation (2013) (11)
Prosodic , Spectral or Voice Quality ? Feature Type Relevance for the Discrimination of Emotion Pairs (2008) (11)
Speech recognition in noisy environments using a switching linear dynamic model for feature enhancement (2008) (11)
Learning Audio Sequence Representations for Acoustic Event Classification (2017) (11)
Emotion Recognition in Speech with Latent Discriminative Representations Learning (2018) (11)
Uncertainty-Aware Machine Support for Paper Reviewing on the Interspeech 2019 Submission Corpus (2020) (11)
The TUM system for the REVERB Challenge: Recognition of Reverberated Speech using Multi-Channel Correlation Shaping Dereverberation and BLSTM Recurrent Neural Networks (2014) (11)
Group-level Speech Emotion Recognition Utilising Deep Spectrum Features (2020) (11)
Cough-Based COVID-19 Detection with Contextual Attention Convolutional Neural Networks and Gender Information (2021) (11)
What is my Dog Trying to Tell Me? the Automatic Recognition of the Context and Perceived Emotion of Dog Barks (2018) (11)
Context Modelling Using Hierarchical Attention Networks for Sentiment and Self-assessed Emotion Detection in Spoken Narratives (2019) (11)
The acoustics of eye contact: detecting visual attention from conversational audio cues (2013) (11)
Emotion Intensity and its Control for Emotional Voice Conversion (2022) (11)
The Inclusion of Gamification Solutions to Enhance User Enjoyment on Crowdsourcing Platforms (2018) (11)
A Combined LSTM-RNN - HMM - Approach for Meeting Event Segmentation and Recognition (2006) (11)
Affective neural networks and cognitive learning systems for big data analysis (2014) (11)
Hybrid Network Feature Extraction for Depression Assessment from Speech (2020) (11)
Does affect affect automatic recognition of children2s speech? (2008) (11)
X-AWARE: ConteXt-AWARE Human-Environment Attention Fusion for Driver Gaze Prediction in the Wild (2020) (11)
Multi-instance Learning for Bipolar Disorder Diagnosis using Weakly Labelled Speech Data (2019) (11)
Emotion-augmented machine learning: Overview of an emerging domain (2017) (11)
MuSe 2020 - The First International Multimodal Sentiment Analysis in Real-life Media Challenge and Workshop (2020) (11)
Enhancing Transferability of Black-Box Adversarial Attacks via Lifelong Learning for Speech Emotion Recognition Models (2020) (11)
Speech-Based Non-Prototypical Affect Recognition for Child-Robot Interaction in Reverberated Environments (2011) (10)
AVEC 2014 – The Three Dimensional Affect and Depression Challenge (2014) (10)
Active learning for dimensional speech emotion recognition (2013) (10)
“You sound ill, take the day off”: Automatic recognition of speech affected by upper respiratory tract infection (2017) (10)
Integrating noise estimation and factorization-based speech separation: A novel hybrid approach (2013) (10)
Be at Odds? Deep and Hierarchical Neural Networks for Classification and Regression of Conflict in Speech (2015) (10)
Audio-based Recognition of Bipolar Disorder Utilising Capsule Networks (2019) (10)
Sincerity and Deception in Speech: Two Sides of the Same Coin? A Transfer- and Multi-Task Learning Perspective (2016) (10)
[Acoustic information in snoring noises]. (2017) (10)
An emotional modulation model as signature for the identification of children developmental disorders (2018) (10)
Emotion Recognition in Public Speaking Scenarios Utilising An LSTM-RNN Approach with Attention (2021) (10)
Exploring Perception Uncertainty for Emotion Recognition in Dyadic Conversation and Music Listening (2020) (10)
Robust feature extraction for automatic recognition of vibrato singing in recorded polyphonic music (2012) (10)
A hybrid music retrieval system using belief networks to integrate multimodal queries and contextual knowledge (2003) (10)
Speech Analysis in the Big Data Era (2015) (10)
Learning New Acoustic Events in an HMM-Based System Using MAP Adaptation (2011) (10)
Weakly Supervised One-Shot Detection with Attention Siamese Networks (2018) (10)
Summary of MuSe 2020: Multimodal Sentiment Analysis, Emotion-target Engagement and Trustworthiness Detection in Real-life Media (2020) (10)
Summary for AVEC 2017: Real-life Depression and Affect Challenge and Workshop (2017) (10)
Audio onset detection: A wavelet packet based approach with recurrent neural networks (2014) (10)
A Generic Human–Machine Annotation Framework Based on Dynamic Cooperative Learning (2020) (10)
An array of physical sensors and an adaptive regression strategy for emotion recognition in a noisy scenario (2017) (10)
HMM-based music retrieval using stereophonic feature information and framelength adaptation (2003) (10)
Using linguistic information to detect overlapping speech (2013) (10)
The TUM Cumulative DTW Approach for the Mediaeval 2012 Spoken Web Search Task (2012) (10)
Predicting Biological Signals from Speech: Introducing a Novel Multimodal Dataset and Results (2019) (10)
Workshop summary for the 3rd international audio/visual emotion challenge and workshop (AVEC'13) (2013) (10)
Discrimination of speech and monophonic singing in continuous audio streams applying multi-layer support vector machines (2004) (9)
"Did you laugh enough today?" - Deep Neural Networks for Mobile and Wearable Laughter Trackers (2017) (9)
Acoustic Emotion Recognition in Car Environment Using a 3D Emotion Space Approach (2007) (9)
Manual versus Automated: The Challenging Routine of Infant Vocalisation Segmentation in Home Videos to Study Neuro(mal)development (2016) (9)
The Handbook of Multimodal-Multisensor Interfaces, Volume 2: Signal Processing, Architectures, and Detection of Emotion and Cognition (2018) (9)
Speech Emotion Recognition Using Semantic Information (2021) (9)
Modelling Sample Informativeness for Deep Affective Computing (2019) (9)
Passive monitoring and geo-based prediction of mobile network vehicle-to-server communication (2018) (9)
Segmentation and Recognition of Meeting Events using a Two-Layered HMM and a Combined MLP-HMM Approach (2006) (9)
The DiCOVA 2021 Challenge - An Encoder-Decoder Approach for COVID-19 Recognition from Coughing Audio (2021) (9)
Emotion and Themes Recognition in Music Utilising Convolutional and Recurrent Neural Networks (2019) (9)
Tunable Sensitivity to Large Errors in Neural Network Training (2016) (9)
Applying Bayesian belief networks in approximate string matching for robust keyword-based retrieval (2004) (9)
Improving Keyword Spotting with a Tandem BLSTM-DBN Architecture (2009) (9)
Intelligent systems’ Holistic Evolving Analysis of Real-life Universal speaker characteristics (2014) (9)
Submotions for Hidden Markov Model Based Dynamic Facial Action Recognition (2006) (9)
Violent Scenes Detection with Large, Brute-forced Acoustic and Visual Feature Sets (2012) (9)
Acquisition of Affect (2017) (9)
Synchronization in Interpersonal Speech (2019) (9)
From Speech to Facial Activity: Towards Cross-modal Sequence-to-Sequence Attention Networks (2019) (9)
Akustische Informationen von Schnarchgeräuschen (2017) (9)
A new technique for adjusting distraction moments in multitasking non-field usability tests (2002) (9)
The Role of Task and Acoustic Similarity in Audio Transfer Learning: Insights from the Speech Emotion Recognition Case (2021) (9)
Face mask recognition from audio: The MASC database and an overview on the mask challenge (2021) (9)
Deep learning for multisensorial and multimodal interaction (2018) (9)
Squeeze for Sneeze: Compact Neural Networks for Cold and Flu Recognition (2020) (9)
Fast Single-Class Classification and the Principle of Logit Separation (2017) (9)
Influence of Low-Level Features Extracted from Rhythmic and Harmonic Sections on Music Genre Classification (2013) (9)
Robust Multi-stream Keyword and Non-linguistic Vocalization Detection for Computationally Intelligent Virtual Agents (2011) (9)
A Diplomatic Edition of Il Lauro Secco: Ground Truth for OMR of White Mensural Notation (2019) (9)
On Laughter and Speech-Laugh, Based on Observations of Child-Robot Interaction (2019) (9)
The First Audio/Visual Emotion Challenge and Workshop - An Introduction (2011) (9)
A Deep Learning Approach for Location Independent Throughput Prediction (2019) (9)
HEAR 2021: Holistic Evaluation of Audio Representations (2022) (9)
Stimulation of psychological listener experiences by semi-automatically composed electroacoustic environments (2017) (8)
Assessing the Prosody of Non-Native Speakers of English: Measures and Feature Sets (2016) (8)
Improving generalisation and robustness of acoustic affect recognition (2012) (8)
Identifying Emotions in Opera Singing: Implications of Adverse Acoustic Conditions (2018) (8)
SVTS: Scalable Video-to-Speech Synthesis (2022) (8)
Robot-Based Intervention for Children with Autism Spectrum Disorder: A Systematic Literature Review (2021) (8)
Feature selection in multimodal continuous emotion prediction (2017) (8)
Evidence of emotion-antecedent appraisal checks in electroencephalography and facial electromyography (2018) (8)
The Role of Prosody in Affective Speech, Linguistic Insights, Studies in Language and Communication (2009) (8)
I see it in your eyes: Training the shallowest-possible CNN to recognise emotions and pain from muted web-assisted in-the-wild video-chats in real-time (2020) (8)
The Effect of Narrow-Band Transmission on Recognition of Paralinguistic Information From Human Vocalizations (2016) (8)
Proceedings of the 7th Annual Workshop on Audio/Visual Emotion Challenge (2017) (8)
Experimental evaluation of user errors at the skill-based level in an automative environment (2002) (8)
Off-line refinement of audio-to-score alignment by observation template adaptation (2013) (8)
Large-scale Data Collection and Analysis via a Gamified Intelligent Crowdsourcing Platform (2019) (8)
Capturing dynamics of post-earnings-announcement drift using a genetic algorithm-optimized XGBoost (2021) (8)
Cognitive and Emotional Information Processing for Human–Machine Interaction (2012) (8)
On the Impact of Word Error Rate on Acoustic-Linguistic Speech Emotion Recognition: An Update for the Deep Learning Era (2021) (8)
ConcealNet: An End-to-end Neural Network for Packet Loss Concealment in Deep Speech Emotion Recognition (2020) (8)
MARVEL: Multimodal Extreme Scale Data Analytics for Smart Cities Environments (2021) (8)
Internet of emotional people: Towards continual affective computing cross cultures via audiovisual signals (2021) (8)
Ordinal Learning for Emotion Recognition in Customer Service Calls (2020) (8)
Considerations for a More Ethical Approach to Data in AI: On Data Representation and Infrastructure (2020) (8)
An Evolutionary-based Generative Approach for Audio Data Augmentation (2020) (8)
End-to-end Audio Classification with Small Datasets – Making It Work (2019) (8)
Audio chord labeling by musiological modeling and beat-synchronization (2009) (8)
I see it in your eyes: Training the shallowest-possible CNN to recognise emotions and pain from muted web-assisted in-the-wild video-chats in real-time (2020) (8)
Developing a digital game to support cultural learning amongst immigrants (2013) (8)
A Cnn-Gru Approach to Capture Time-Frequency Pattern Interdependence for Snore Sound Classification (2018) (7)
Deep attention-based neural networks for explainable heart sound classification (2022) (7)
Time-series Clustering with Jointly Learning Deep Representations, Clusters and Temporal Boundaries (2019) (7)
Long short-term memory networks for noise robust speech recognition (2010) (7)
An Enhanced Adversarial Network with Combined Latent Features for Spatio-Temporal Facial Affect Estimation in the Wild (2021) (7)
Introduction: scope, trends, and paradigm shift in the field of computer interfaces (2017) (7)
A Multimodal Listener Behaviour Driven by Audio Input (2010) (7)
Probabilistic speech feature extraction with context-sensitive Bottleneck neural networks (2014) (7)
Automatic Processing of Clinical Aphasia Data collected during Diagnosis Sessions: Challenges and Prospects (2018) (7)
Detecting problems in spoken child-computer interaction (2008) (7)
Switching Linear Dynamic Models for Noise Robust In-Car Speech Recognition (2008) (7)
Enhancing Spontaneous Speech Recognition with BLSTM Features (2011) (7)
Switching Linear Dynamic Models for Recognition of Emotionally Colored and Noisy Speech (2010) (7)
Toward Silent Paralinguistics: Speech-to-EMG - Retrieving Articulatory Muscle Activity from Speech (2020) (7)
Combining Bottleneck-BLSTM and Semi-Supervised Sparse NMF for Recognition of Conversational Speech in Highly Instationary Noise (2012) (7)
Trustability-Based Dynamic Active Learning for Crowdsourced Labelling of Emotional Audio Data (2018) (7)
Navigation in virtual worlds via natural speech (2001) (7)
Go-CaRD - Generic, Optical Car Part Recognition and Detection: Collection, Insights, and Applications (2020) (7)
Speech overlap detection using convolutive non-negative sparse coding: New improvements and insights (2012) (7)
A hierarchical approach for visual suspicious behavior detection in aircrafts (2009) (7)
Fully Automatic Audiovisual Emotion Recognition: Voice, Words, and the Face (2012) (7)
Evaluation of the Pain Level from Speech: Introducing a Novel Pain Database and Benchmarks (2018) (7)
Route and Stopping Intent Prediction at Intersections From Car Fleet Data (2016) (7)
Speech in Minimal Invasive Surgery - Towards an Affective Language Resource of Real-life Medical Operations (2010) (7)
Latent-Based Adversarial Neural Networks for Facial Affect Estimations (2020) (7)
Enhanced Robustness in Speech Emotion Recognition Combining Acoustic and Semantic Analyses (2004) (7)
Deep Learning for Mobile Mental Health: Challenges and recent advances (2021) (7)
An Evaluation of the Effect of Anxiety on Speech - Computational Prediction of Anxiety from Sustained Vowels (2020) (7)
Cross-Domain Classification of Drowsiness in Speech: The Case of Alcohol Intoxication and Sleep Deprivation (2017) (7)
AI Hears Your Health: Computer Audition for Health Monitoring (2021) (7)
Pre-training in Deep Reinforcement Learning for Automatic Speech Recognition (2019) (6)
Data Augmentation and Deep Learning for Hate Speech Detection (2018) (6)
harAGE: A Novel Multimodal Smartwatch-based Dataset for Human Activity Recognition (2021) (6)
The Handbook of Multimodal-Multisensor Interfaces: Language Processing, Software, Commercialization, and Emerging Directions - Volume 3 (2019) (6)
Can Affective Computing Save Lives? Meet Mobile Health (2017) (6)
Real-Time Activity Detection in a Multi-Talker Reverberated Environment (2012) (6)
Computational Analysis of Vocal Expression of Affect: Trends and Challenges (2017) (6)
Summary for AVEC 2018: Bipolar Disorder and Cross-Cultural Affect Recognition (2018) (6)
Automatic Detection of Major Depressive Disorder via a Bag-of-Behaviour-Words Approach (2019) (6)
Autonomous Emotion Learning in Speech: A View of Zero-Shot Speech Emotion Recognition (2019) (6)
audEERING's approach to the One-Minute-Gradual Emotion Challenge (2018) (6)
Towards automation of usability studies (2002) (6)
Resolving partial occlusions in crowded environments utilizing range data and video cameras (2009) (6)
Real Time Person Tracking and Behavior Interpretation in Multi Camera Scenarios Applying Homography and Coupled HMMs (2010) (6)
DeepCoder: Semi-parametric Variational Autoencoders for Facial Action Unit Intensity Estimation (2017) (6)
Computational Emotion Analysis From Images: Recent Advances and Future Directions (2021) (6)
Outer Product-Based Fusion of Smartwatch Sensor Data for Human Activity Recognition (2022) (6)
Beat-Synchronous Data-Driven Automatic Chord Labeling (2008) (6)
Continuous-Time Audiovisual Fusion with Recurrence vs. Attention for In-The-Wild Affect Recognition (2022) (6)
Real-Time Speech Recognition in a Multi-talker Reverberated Acoustic Scenario (2011) (6)
The Perceived Emotion of Isolated Synthetic Audio: The EmoSynth Dataset and Results (2018) (6)
A Novel Policy for Pre-trained Deep Reinforcement Learning for Speech Emotion Recognition (2021) (6)
Deep Reinforcement Learning with Pre-training for Time-efficient Training of Automatic Speech Recognition (2020) (6)
Speaker trait characterization in web videos: Uniting speech, language, and facial features (2013) (6)
Towards Silent Paralinguistics: Deriving Speaking Mode and Speaker ID from Electromyographic Signals (2020) (6)
Towards Sonification in Multimodal and User-friendlyExplainable Artificial Intelligence (2021) (6)
Deep Wavelets for Heart Sound Classification (2019) (6)
Hierarchical Attention-Based Temporal Convolutional Networks for Eeg-Based Emotion Recognition (2021) (6)
Modelling User Affect and Sentiment in Intelligent User Interfaces: A Tutorial Overview (2015) (6)
Recognising Covid-19 from Coughing Using Ensembles of SVMs and LSTMs with Handcrafted and Deep Audio Features (2021) (6)
Seeking the SuperStar: Automatic assessment of perceived singing quality (2017) (6)
Agreement-based Dynamic Active Learning with Least and Medium Certainty Query Strategy (2015) (6)
Learning Multimodal Representations for Drowsiness Detection (2022) (6)
MuSe 2021 Challenge: Multimodal Emotion, Sentiment, Physiological-Emotion, and Stress Detection (2021) (6)
The Challenge of Automatic Eating Behaviour Analysis and Tracking (2019) (6)
N-HANS: A neural network-based toolkit for in-the-wild audio enhancement (2021) (6)
The SEILS Dataset: Symbolically Encoded Scores in Modern-Early Notation for Computational Musicology (2017) (6)
The ICSTM+TUM+UP Approach to the 3rd CHIME Challenge: Single-Channel LSTM Speech Enhancement with Multi-Channel Correlation Shaping Dereverberation and LSTM Language Models (2015) (6)
Audio watermarking based on empirical mode decomposition and beat detection (2016) (5)
High-Fidelity Audio Generation and Representation Learning With Guided Adversarial Autoencoder (2020) (5)
Boosting multi-modal camera selection with semantic features (2009) (5)
Frustration recognition from speech during game interaction using wide residual networks (2021) (5)
Multimodal multimodel emotion analysis as linked data (2017) (5)
Speaking Corona? Human and Machine Recognition of COVID-19 from Voice (2021) (5)
Computational Assessment of Interest in Speech—Facing the Real-Life Challenge (2011) (5)
Social and Affective Robotics Tutorial (2016) (5)
Interacting with Emotional Virtual Agents (2011) (5)
Redundancy Reduction Twins Network: A Training framework for Multi-output Emotion Regression (2022) (5)
Aspekte effizienten Usability Engineerings (Aspects of Efficient Usability Engineering) (2002) (5)
A Deep Adaptation Network for Speech Enhancement: Combining a Relativistic Discriminator With Multi-Kernel Maximum Mean Discrepancy (2021) (5)
Emotion and mental state recognition from speech (2012) (5)
Deep Learning Our Everyday Emotions (2015) (5)
Deep Convolutional Recurrent Neural Network for Rare Acoustic Event Detection (2018) (5)
Evolving Learning for Analysing Mood-Related Infant Vocalisation (2018) (5)
A Personalised Approach to Audiovisual Humour Recognition and its Individual-level Fairness (2022) (5)
Sound and the City: Current Perspectives on Acoustic Geo-Sensing in Urban Environment (2019) (5)
Self Supervised Adversarial Domain Adaptation for Cross-Corpus and Cross-Language Speech Emotion Recognition (2022) (5)
Using Computer Intelligence for Depression Diagnosis and Crowdsourcing (2016) (5)
Cross-corpus acoustic emotion recognition: Variances and strategies (Extended abstract) (2015) (5)
Spotting Social Signals in Conversational Speech over IP: A Deep Learning Perspective (2017) (5)
Fatigue Prediction in Outdoor Running Conditions using Audio Data (2022) (5)
Deep Attentive End-to-End Continuous Breath Sensing from Speech (2020) (5)
Tendencies regarding the effect of emotional intensity in inter corpus phoneme-level speech emotion modelling (2016) (5)
Three recent trends in Paralinguistics on the way to omniscient machine intelligence (2018) (5)
Teaching Machines to Know Your Depressive State: On Physical Activity in Health and Major Depressive Disorder (2019) (5)
Neural Networks and Learning Systems Come Together (2012) (5)
A summary of the ComParE COVID-19 challenges (2022) (5)
Computer Audition for Continuous Rainforest Occupancy Monitoring: The Case of Bornean Gibbons' Call Detection (2020) (5)
Customized ViNeRS Method for Video Neuro-Advertising of Green Housing (2020) (5)
Detecting Vocal Irony (2017) (5)
More Than Fifty Years of Speech Processing – The Rise of Computational Paralinguistics and Ethical Demands (2014) (5)
Recognising Guitar Effects - Which Acoustic Features Really Matter? (2017) (5)
Fine-tuning HMMS for nonverbal vocalizations in spontaneous speech: A multicorpus perspective (2012) (5)
Artificial intelligence to aid the detection of mood disorders (2020) (5)
Probing Speech Emotion Recognition Transformers for Linguistic Knowledge (2022) (5)
Language proficiency assessment of English L2 speakers based on joint analysis of prosody and native language (2016) (5)
3d gesture recognition applying long short-term memory and contextual knowledge in a CAVE (2010) (5)
Automatic multi-lingual arousal detection from voice applied to real product testing applications (2017) (5)
Humane Anthropomorphic Agents : the Quest for the Outcome Measure (2019) (5)
VCMNet: Weakly Supervised Learning for Automatic Infant Vocalisation Maturity Analysis (2019) (5)
One Day in Half an Hour: Music Thumbnailing Incorporating Harmony- and Rhythm Structure (2008) (5)
Can Appliances Understand the Behavior of Elderly Via Machine Learning? A Feasibility Study (2021) (5)
Speech control in surgery: A field analysis and strategies (2009) (5)
Music Thumbnailing Incorporating Harmony- and Rhythm Structure (2008) (5)
Will Affective Computing Emerge from Foundation Models and General AI? A First Evaluation on ChatGPT (2023) (5)
Noise Invariant Frame Selection: A Simple Method to Address the Background Noise Problem for Text-independent Speaker Verification (2018) (4)
A Comparison of AI-Based Throughput Prediction for Cellular Vehicle-To-Server Communication (2019) (4)
Automatic Analysis of Aesthetics: Human Beauty, Attractiveness, and Likability (2017) (4)
Empirical Mode Decomposition : A Data-Enrichment Perspective on Speech Emotion Recognition (2016) (4)
Dimensional and continuous analysis of emotions for multimedia applications: a tutorial overview (2012) (4)
Applying Bayes Markov chains for the detection of ATM related scenarios (2009) (4)
Improving Recognition of Speaker States and Traits by Cumulative Evidence: Intoxication, Sleepiness, Age and Gender (2012) (4)
The Filtering Effect of Face Masks in their Detection from Speech (2021) (4)
Continuous Monitoring of Emotions by a Multimodal Cooperative Sensor System (2015) (4)
The perception of emotional cues by children in artificial background noise (2020) (4)
DEMoS: an Italian emotional speech corpus (2019) (4)
Robust Federated Learning Against Adversarial Attacks for Speech Emotion Recognition (2022) (4)
Probabilistic asr feature extraction applying context-sensitive connectionist temporal classification networks (2013) (4)
VoiLA: An Online Intelligent Speech Analysis and Collection Platform (2018) (4)
Continuous Sleepiness , Baby Sounds & Orca Activity (2019) (4)
Temporal and Situational Context Modeling for Improved Dominance Recognition in Meetings (2012) (4)
Applications in Intelligent Music Analysis (2013) (4)
Unsupervised Representation Learning with Attention and Sequence to Sequence Autoencoders to Predict Sleepiness From Speech (2020) (4)
Editorial: IEEE Transactions on Affective Computing - Challenges and Chances (2017) (4)
Multimodal Affect Databases (2015) (4)
Multimodal user state and trait recognition: an overview (2018) (4)
COVID-19 Detection with a Novel Multi-Type Deep Fusion Method using Breathing and Coughing Information (2021) (4)
A football player rating system (2020) (4)
A football player rating system (2020) (4)
Humans Inside: Cooperative Big Multimedia Data Mining (2019) (4)
Sincerity in Acted Speech: Presenting the Sincere Apology Corpus and Results (2019) (4)
COVID-19 Biomarkers in Speech: On Source and Filter Components (2021) (4)
Hierarchical Component-attention Based Speaker Turn Embedding for Emotion Recognition (2020) (4)
Come and have an emotional workout with sensitive artificial listeners! (2011) (4)
Efficient Collection and Representation of Preverbal Data in Typical and Atypical Development (2020) (4)
Predictable Robots for Autistic Children—Variance in Robot Behaviour, Idiosyncrasies in Autistic Children’s Characteristics, and Child–Robot Engagement (2021) (4)
Laughter as a Controller in a Stress Buster Game (2020) (4)
Vision Based Online Multi-Stream Behavior Detection Applying Bayesian Networks (2005) (4)
Guided Generative Adversarial Neural Network for Representation Learning and High Fidelity Audio Generation using Fewer Labelled Audio Data (2020) (4)
Discrimination of Linguistic and Non-Linguistic Vocalizations in Spontaneous Speech: Intra- and Inter-Corpus Perspectives (2012) (4)
Tracking Authentic and In-the-wild Emotions Using Speech (2018) (4)
Conversational Agent as Trustworthy Autonomous System (Trust-CA) (Dagstuhl Seminar 21381) (2021) (4)
Example-based Explanations with Adversarial Attacks for Respiratory Sound Analysis (2022) (4)
Remote smartphone-based speech collection: acceptance and barriers in individuals with major depressive disorder (2021) (4)
A Novel Fusion of Attention and Sequence to Sequence Autoencoders to Predict Sleepiness From Speech (2020) (4)
Multi-Attentive Detection of the Spider Monkey Whinny in the (Actual) Wild (2021) (3)
Is Speech the New Blood? Recent Progress in AI-Based Disease Detection From Audio in a Nutshell (2022) (3)
Exploring Zero-Shot Emotion Recognition in Speech Using Semantic-Embedding Prototypes (2022) (3)
Compact Bilinear Deep Features For Environmental Sound Recognition (2018) (3)
[VOTE versus ACLTE: comparison of two snoring noise classifications using machine learning methods]. (2019) (3)
Multistage linguistic conditioning of convolutional layers for speech emotion recognition (2021) (3)
Towards a Common Linked Data Model for Sentiment and Emotion Analysis (3)
Prosodic and spectral features within segment-based acoustic modeling (2008) (3)
How Good Is Your Model 'Really'? On 'Wildness' of the In-the-Wild Speech-Based Affect Recognisers (2018) (3)
VoicePlay — An affective sports game operated by speech emotion recognition based on the component process model (2017) (3)
A closed-form solution to the graph total variation problem for continuous emotion profiling in noisy environment (2018) (3)
The Principle of Logit Separation (2018) (3)
Robust Speech Recognition for Human-Robot Interaction in Minimal Invasive Surgery (2008) (3)
Vocalisation Repertoire at the End of the First Year of Life: An Exploratory Comparison of Rett Syndrome and Typical Development (2022) (3)
Transformer-based CNNs: Mining Temporal Context Information for Multi-sound COVID-19 Diagnosis (2021) (3)
An Overview & Analysis of Sequence-to-Sequence Emotional Voice Conversion (2022) (3)
The effect of music in anxiety reduction: A psychological and physiological assessment (2020) (3)
The effect of music in anxiety reduction: A psychological and physiological assessment (2020) (3)
Preserving actual dynamic trend of emotion in dimensional speech emotion recognition (2012) (3)
Voice Analysis for Neurological Disorder Recognition–A Systematic Review and Perspective on Emerging Trends (2022) (3)
Supporting Multi Camera Tracking by Monocular Deformable Graph Tracking (2009) (3)
An Investigation of Cross-Cultural Semi-Supervised Learning for Continuous Affect Recognition (2020) (3)
MEDAS: an open-source platform as a service to help break the walls between medicine and informatics (2020) (3)
Integratives Konzept zur prototypischen Implementierung multimodaler Benutzerschnittstellen - Integrative rapid-prototyping for multimodal user interfaces (2002) (3)
A large-scale and PCR-referenced vocal audio dataset for COVID-19 (2022) (3)
Building autonomous sensitive artificial listeners (Extended abstract) (2015) (3)
A Survey on Client Throughput Prediction Algorithms in Wired and Wireless Networks (2021) (3)
Matching Monophonic Audio Clips to Polyphonic Recordings (2005) (3)
Emotion and Sentiment Analysis (2016) (3)
Dynamic Restrained Uncertainty Weighting Loss for Multitask Learning of Vocal Expression (2022) (3)
Automatic vocalisation-based detection of fragile X syndrome and Rett syndrome (2022) (3)
On the Influence of Alcohol Intoxication on Speaker Recognition (2014) (3)
N-HANS: Introducing the Augsburg Neuro-Holistic Audio-eNhancement System (2019) (3)
Big Data Multimedia Mining: Feature Extraction Facing Volume, Velocity, and Variety (2019) (3)
Ethical Awareness in Paralinguistics: A Taxonomy of Applications (2022) (3)
The Acoustic Dissection of Cough: Diving Into Machine Listening-based COVID-19 Analysis and Detection (2022) (3)
Dominance Detection in a Reverberated Acoustic Scenario (2012) (3)
Exploring the Importance of Individual Differences to the Automatic Estimation of Emotions Induced by Music (2015) (3)
Emotion Modelling via Speech Content and Prosody: In Computer Games and Elsewhere (2016) (3)
Towards intoxicated speech recognition (2017) (3)
Learning Multi-Resolution Representations for Acoustic Scene Classification via Neural Networks (2019) (3)
IDGEI 2014: 2nd international workshop on intelligent digital games for empowerment and inclusion (2014) (3)
An Improved StarGAN for Emotional Voice Conversion: Enhancing Voice Quality and Data Augmentation (2021) (3)
Snoring - An Acoustic Definition (2019) (3)
Robust Spelling and Digit Recognition in the Car: Switching Models and Their Like (2008) (3)
Exploring speaker enrolment for few-shot personalisation in emotional vocalisation prediction (2022) (3)
Intelligent user interfaces in digital games for empowerment and inclusion (2015) (3)
Normalise for Fairness: A Simple Normalisation Technique for Fairness in Regression Machine Learning Problems (2022) (3)
Time-Continuous Audiovisual Fusion with Recurrence vs Attention for In-The-Wild Affect Recognition (2022) (3)
Editorial Proc of The Second International Audio/Visual Emotion Challenge and Workshop - An Introduction (2012) (3)
Sparse, Hierarchical and Semi-Supervised Base Learning for Monaural Enhancement of Conversational Speech (2012) (3)
Adversarial-based neural networks for affect estimations in the wild (2020) (3)
Audio-based AI classifiers show no evidence of improved COVID-19 screening over simple symptoms checkers (2022) (3)
Accelerating Biomedical Signal Processing Using GPU: A Case Study of Snore Sound Feature Extraction (2017) (3)
Annotator Trustability-based Cooperative Learning Solutions for Intelligent Audio Analysis (2018) (3)
Exploring A New Method for Food Likability Rating Based on DT-CWT Theory (2018) (3)
Affective Computing and Intelligent Interaction: Fourth International Conference, ACII 2011, Memphis,TN, USA, October 9-12, 2011; Proceedings, Part II ... Vision, Pattern Recognition, and Graphics) (2011) (3)
Responsible and Representative Multimodal Data Acquisition and Analysis: On Auditability, Benchmarking, Confidence, Data-Reliance & Explainability (2019) (3)
A Physiologically-adapted Gold Standard for Arousal During a Stress Induced Scenario (2021) (2)
A Ranking-based Emotion Annotation Scheme and Real-life Speech Database (2012) (2)
Validity of machine learning in biology and medicine increased through collaborations across fields of expertise (2020) (2)
Deep Unsupervised Representation Learning for Audio-Based Medical Applications (2020) (2)
Advanced Man-Machine Interaction (2006) (2)
Towards Speech Robustness for Acoustic Scene Classification (2020) (2)
ASC-Inclusion a Virtual Environment Teaching Children with ASC to Understand and Express Emotions (2014) (2)
On Deep Speech Packet Loss Concealment: A Mini-Survey (2020) (2)
"I have vxxx bxx connexxxn!": Facing Packet Loss in Deep Speech Emotion Recognition (2020) (2)
Bias and privacy in AI's cough-based COVID-19 recognition – Authors' reply (2021) (2)
I Know How you Feel Now, and Here's why!: Demystifying Time-Continuous High Resolution Text-Based Affect Predictions in the Wild (2019) (2)
Self-Learning Acoustic Feature Generation and Selection for the Discrimination of Musical Signals (2006) (2)
Introduction to the Special Issue on Broadening the View on Speaker Analysis (2014) (2)
Unsupervised Graph-based Topic Modeling from Video Transcriptions (2021) (2)
Robust Speech Emotion Recognition Under Different Encoding Conditions (2019) (2)
Weakly Supervised One-Shot Detection with Attention Similarity Networks (2018) (2)
Editorial: Transactions on Affective Computing-Good Reasons for Joy and Excitement (2018) (2)
Nkululeko: A Tool For Rapid Speaker Characteristics Detection (2022) (2)
Intelligent Digital Games for Empowerment and Inclusion – An Introduction (2013) (2)
Deep End-to-End Representation Learning for Food Type Recognition from Speech (2018) (2)
Deformable Dilated Faster R-CNN for Universal Lesion Detection in CT Images (2021) (2)
Guest Editorial: Special Section on Naturalistic Affect Resources for System Building and Evaluation (2012) (2)
Evaluating the Role of Speech Technology in Medical Case Management (H. Patil and M. Kulshreshtha, eds.) (2014) (2)
VOTE versus ACLTE: Vergleich zweier Schnarchgeräuschklassifikationen mit Methoden des maschinellen Lernens (2019) (2)
IDGEI 2015: 3rd International Workshop on Intelligent Digital Games for Empowerment and Inclusion (2015) (2)
Patterns, Prototypes, Performance (2008) (2)
Analysis of loss functions for fast single-class classification (2019) (2)
At the Border of Acoustics and Linguistics : BagofAudioWords BagofAudioWords BagofAudioWords ofAudioWords ofAudioWords AudioWords AudioWords for the Recognition of Emotions in Speech (2016) (2)
Evaluating the Impact of Voice Activity Detection on Speech Emotion Recognition for Autistic Children (2022) (2)
A Discriminative Approach to Polyphonic Piano Note Transcription using Nonnegative Matrix Factorization (2013) (2)
FROM SPEECH : PUTTING ASR IN THE LOOP (2009) (2)
An Estimation of Online Video User Engagement from Features of Continuous Emotions (2021) (2)
MORE THAN FIFTY YEARS OF SPEECH AND LANGUAGE PROCESSING-THE RISE OF COMPUTATIONAL PARALINGUISTICS AND ETHICAL DEMANDS (2014) (2)
Onset Detection Exploiting Adaptive Linear Prediction Filtering in DWT Domain with Bidirectional Long Short-Term Memory Neural Networks (2013) (2)
A Real-Time Speech Enhancement Framework for Multi-party Meetings (2011) (2)
Multiscale kernel locally penalised discriminant analysis exemplified by emotion recognition in speech (2016) (2)
Coughing-Based Recognition of Covid-19 with Spatial Attentive ConvLSTM Recurrent Neural Networks (2021) (2)
The Perception of Emotion in the Singing Voice: The Understanding of Music Mood for Music Organisation (2017) (2)
Intelligent Signal Processing for Affective Computing [From the Guest Editors] (2021) (2)
What Affective Computing Reveals about Autistic Children's Facial Expressions of Joy or Fear (2018) (2)
deepSELF: An Open Source Deep Self End-to-End Learning Framework (2020) (2)
Personalised depression forecasting using mobile sensor data and ecological momentary assessment (2022) (2)
Automated Classification of Children's Linguistic versus Non-Linguistic Vocalisations (2018) (2)
Improving Exertion and Wellbeing Prediction in Outdoor Running Conditions using Audio-based Surface Recognition (2022) (2)
Introduction to the Special Issue on Next Generation Computational Paralinguistics (2012) (2)
Single-Channel Speech Separation with Auxiliary Speaker Embeddings (2019) (2)
Evaluating Misinterpretations during Human-Machine Communication in Automotive Environments (2002) (2)
Musical-Linguistic Annotations of Il Lauro Secco (2018) (2)
A Two-Layer Graphical Model for Combined Video Shot and Scene Boundary Detection (2006) (2)
Does my speech rock? automatic assessment of public speaking skills (2015) (2)
ERM4CT 2015: Workshop on Emotion Representations and Modelling for Companion Systems (2015) (2)
Fairness and underspecification in acoustic scene classification: The case for disaggregated evaluations (2021) (2)
Machine-Based Decoding of Paralinguistic Vocal Features (2018) (2)
Distinguishing between pre- and post-treatment in the speech of patients with chronic obstructive pulmonary disease (2022) (2)
Correction: Shared acoustic codes underlie emotional communication in music and speech—Evidence from deep transfer learning (2018) (2)
Dual Attention and Element Recalibration Networks for Automatic Depression Level Prediction (2022) (2)
Statistical Design and Analysis for Robust Machine Learning: A Case Study from COVID-19 (2022) (2)
Audio-Visual Gated-Sequenced Neural Networks for Affect Recognition (2022) (2)
Introduction to the special issue on sensing emotion and affect - Facing realism in speech processing (2011) (2)
Chain of Audio Processing (2013) (2)
Proceedings of the International Workshop on Emotion Representations and Modelling for Companion Technologies (2015) (2)
User Experience for Multi-Device Ecosystems: Challenges and Opportunities (2021) (2)
Responding to uncertainty in emotion recognition (2019) (2)
Latest Advances in Computational Speech Analysis for Mobile Sensing (2019) (2)
COLD Fusion: Calibrated and Ordinal Latent Distribution Fusion for Uncertainty-Aware Multimodal Emotion Recognition (2022) (2)
Performance Analysis of Unimodal and Multimodal Models in Valence-Based Empathy Recognition (2019) (2)
A Mini Review on Current Clinical and Research Findings for Children Suffering from COVID-19 (2020) (2)
GMs in On-Line Handwritten Whiteboard Note Recognition: The Influence of Implementation and Modeling (2009) (2)
MUSIC CLASSIFICATION WITH THE MUNICH OPENSMILE TOOLKIT ( MIREX 2010 SUBMISSION ) (2010) (2)
An Overview of the FIRST ICASSP Special Session on Computer Audition for Healthcare (2022) (2)
Introduction to the special issue on Paralinguistics in Naturalistic Speech and Language (2013) (2)
Uncertainty Aware Review Hallucination for Science Article Classification (2021) (2)
A Cross-Corpus Speech-Based Analysis of Escalating Negative Interactions (2022) (1)
Psychological Field Versus Physiological Field: From Qualitative Analysis to Quantitative Modeling of the Mental Status (2022) (1)
A MULTIMODAL WAVETRANSFORMER ARCHITECTURE CONDITIONED ON OPENL3 EMBEDDINGS FOR AUDIO-VISUAL SCENE CLASSIFICATION Technical Report (2021) (1)
GraphTMT: Unsupervised Graph-based Topic Modeling from Video Transcripts (2021) (1)
EIHW-MTG: Second DiCOVA Challenge System Report (2021) (1)
Health Technologies and Innovations to Effectively Respond to the COVID-19 Pandemic (2022) (1)
Rethinking Auditory Affective Descriptors Through Zero-Shot Emotion Recognition in Speech (2022) (1)
Automatic Detection of Textual Triggers of Reader Emotion in Short Stories (2016) (1)
Intelligent Audio Analysis: A Definition (2013) (1)
Introducing an Emotion-Driven Assistance System for Cognitively Impaired Individuals (2018) (1)
Supra‐segmental Features (2013) (1)
Essential Principles to Make Multimodal Sentiment Analysis Work in the Wild (2016) (1)
The age of data analytics: converting biomedical data into actionable insights. (2018) (1)
Perspectives on predictive power of multimodal deep learning: surprises and future directions (2018) (1)
sustAGE 1.0 – First Prototype, Use Cases, and Usability Evaluation (2022) (1)
A Federated Learning Paradigm for Heart Sound Classification (2022) (1)
Journaling Data for Daily PHQ-2 Depression Prediction and Forecasting (2022) (1)
The Inverse Problems for Computational Psychophysiology: Opinions and Insights (2022) (1)
Signal- und Mustererkennung (2011) (1)
ERM4HCI 2013: the 1st workshop on emotion representation and modelling in human-computer-interaction-systems (2013) (1)
Einmal Schmerzen – immer Schmerzen? Ergebnisse einer bevölkerungsbezogenen Längsschnittstudie zum Verlauf chronischer Rückenschmerzen (2006) (1)
Multi-Type Outer Product-Based Fusion of Respiratory Sounds for Detecting COVID-19 (2022) (1)
Estimating biosignals using the human voice (2016) (1)
SyntAct: A Synthesized Database of Basic Emotions (2022) (1)
Self-Supervised Attention Networks and Uncertainty Loss Weighting for Multi-Task Emotion Recognition on Vocal Bursts (2022) (1)
Sensing the Sounds of Silence: A Pilot Study on the Detection of Model Mice of Autism Spectrum Disorder from Ultrasonic Vocalisations (2021) (1)
Are 3D Face Shapes Expressive Enough for Recognising Continuous Emotions and Action Unit Intensities? (2022) (1)
Emotion and Theme Recognition in Music Using Attention-Based Methods (2020) (1)
motilitAI: A machine learning framework for automatic prediction of human sperm motility (2022) (1)
GPU-based training of autoencoders for bird sound data processing (2017) (1)
Handbook of Affective Computing, Rafael A. Calvo and Sidney D’Mello and Jonathan Gratch and Arvid Kappas (eds.) (2013) (1)
Graphical models for multi-modal automatic video editing in meetings (2009) (1)
Voice command generation using Progressive Wavegans (2019) (1)
Multitask Learning from Augmented Auxiliary Data for Improving Speech Emotion Recognition (2022) (1)
Intelligent Audio Analysis for Continuous Rainforest Occupancy Monitoring (2018) (1)
Personalised Deep Learning for Monitoring Depressed Mood from Speech (2022) (1)
Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning (2022) (1)
Predicting Group Work Performance from Physical Handwriting Features in a Smart English Classroom (2021) (1)
Automatic Guitar String Detection by String-Inverse Frequency Estimation (2017) (1)
Interaction with the Soundscape: Exploring Emotional Audio Generation for Improved Individual Wellbeing (2020) (1)
Introduction to the Special Section on Multimedia Computing and Applications of Socio-Affective Behaviors in the Wild (2018) (1)
You Sound Like Your Counterpart: Interpersonal Speech Analysis (2018) (1)
A Curriculum Learning Approach for Pain Intensity Recognition from Facial Expressions (2020) (1)
Towards Automatic Intoxication Detection from Speech in Real-Life Acoustic Environments (2012) (1)
Introduction To The Special Issue On Affect Analysis In Continuous Input (2013) (1)
Laughter in Child-Robot Interaction (2009) (1)
Automatic Recognition of Texture in Renaissance Music (2021) (1)
Guided Generative Adversarial Neural Network for Representation Learning and Audio Generation Using Fewer Labelled Audio Data (2021) (1)
Intelligent Audio Analysis for Continuous Rainforest Occupancy Monitoring (2018) (1)
On-Line NMF-Based Stereo Up-Mixing of Speech Improves Perceived Reduction of Non-Stationary Noise (2014) (1)
Acoustic Sounds for Wellbeing: A Novel Dataset and Baseline Results. (2019) (1)
GPU-based fast signal processing for large amounts of snore sound data (2016) (1)
Computational Audio Analysis (2014) (1)
AI-Based Emotion Recognition: Promise, Peril, and Prescriptions for Prosocial Path (2022) (1)
An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era (2022) (1)
Belief Networks in Natural Language Processing for Improves Speech Motion Recognition (2004) (1)
Identifying surgical-mask speech using deep neural networks on low-level aggregation (2021) (1)
Deep Learning for Sentiment Analysis (2021) (1)
A Physiologically-Adapted Gold Standard for Arousal during Stress (2021) (1)
Multimodal Prediction of Spontaneous Humour: A Novel Dataset and First Results (2022) (1)
Large-scale Data Collection and Analysis via a Gamified Intelligent Crowdsourcing Platform (2019) (1)
Aspekte effizienten Usability Engineerings (2002) (1)
COVID-19's Impact on Mental Health - The Hour of Computational Aid? (2022) (1)
System Requirements Specification for Unmanned Aerial Vehicle (UAV) to Server Communication (2021) (1)
Editorial: Transactions on Affective Computing - Changes and Continuance (2016) (1)
Robust Key-Word Spotting in Field Noise for Open-Microphone Surgeon-Robot Interaction (2009) (1)
Data Augmentation for Dementia Detection in Spoken Language (2022) (1)
Reading the Author and Speaker: Towards a Holistic and Deep Approach on Automatic Assessment of What is in One's Words (2017) (1)
Convoluational Transformer With Adaptive Position Embedding For Covid-19 Detection From Cough Sounds (2022) (1)
COVYT: Introducing the Coronavirus YouTube and TikTok speech dataset featuring the same speakers with and without infection (2022) (1)
Analysing and Inferring of Intimacy Based on fNIRS Signals and Peripheral Physiological Signals (2019) (1)
Triplet Loss-Based Models for COVID-19 Detection from Vocal Sounds (2022) (1)
Depression Diagnosis and Forecast based on Mobile Phone Sensor Data (2022) (1)
Human Affect Recognition: Audio-Based Methods (2015) (1)
Bird Sound Classification Individual Project (2016) (1)
Heart Sound Classification based on Fractional Fourier Transformation Entropy (2022) (1)
Insights on Modelling Physiological, Appraisal, and Affective Indicators of Stress using Audio Features (2022) (1)
Onset Detection Exploiting Wavelet Transform with Bidirectional Long Short-Term Memory Neural Networks (2013) (1)
UNICORE Data Management: Recent Advancements (2011) (1)
A Prototypical Network Approach for Evaluating Generated Emotional Speech (2021) (1)
Automatic Analysis of Social Emotions (2017) (1)
Big Data, Deep Learning - At the Edge of X-Ray Speaker Analysis (2017) (1)
Guest editorial: Multimodal sentiment analysis and mining in the wild (2017) (1)
Introduction to the Special Issue on MMAC: Multimodal Affective Computing of Large-Scale Multimedia Data (2021) (1)
SPOKEN LANGUAGE IDENTIFICATION BY MEANS OF ACOUSTIC MID-LEVEL DESCRIPTORS (2020) (1)
Deep Attention-based Representation Learning for Heart Sound Classification (2021) (1)
IEEE Transactions on Affective Computing-On Novelty and Valence (2019) (1)
Guest Editorial Special Issue on Adversarial Learning in Computational Intelligence (2020) (0)
Advances in Emotion Recognition (A. Konar and A. Chakraborty, eds.) (2013) (0)
Conversational Speech Recognition in Non-stationary Reverberated Environments (2011) (0)
The ACM Multimedia 2023 Computational Paralinguistics Challenge: Emotion Share & Requests (2023) (0)
Online Personalisation of Deep Mobile Activity Recognisers (2022) (0)
Keynote Lecture 1: NLP in Tomorrow’s Profiling - Words May Fail You (2017) (0)
Fast Yet Effective Speech Emotion Recognition with Self-distillation (2022) (0)
Motivation, Aims, and Solutions (2013) (0)
Session details: Oral Session 2: Multimodal Fusion (2014) (0)
Convolutional Neural Networks for the Solution of the 2D Poisson Equation with Arbitrary Dirichlet Boundary Conditions, Mesh Sizes and Grid Spacings (2019) (0)
Propagating Variational Model Uncertainty for Bioacoustic Call Label Smoothing (2022) (0)
Grundbegriffe der Statistik (2011) (0)
Supervised Contrastive Learning for Game-Play Frustration Detection from Speech (2021) (0)
Deep Learning Post-Earnings-Announcement Drift (2021) (0)
Three recent trends in Paralinguistics on the way to omniscient machine intelligence (2018) (0)
Robust Laughter Detection for Wearable Wellbeing Sensing (2018) (0)
Masking Speech Contents by Random Splicing: is Emotional Expression Preserved? (2023) (0)
The First Personality Mapping Challenge (2014) (0)
Real-Time Activity Detection in a Multi-Talker Reverberated Environment (2012) (0)
Towards Heart Rate Categorisation from Speech in Outdoor Running Conditions (2022) (0)
SU‐GG‐J‐175: Target Registration Error Analysis Via KV Imaging and Conebeam CT in Accelerated Partial Breast Irradiation (2008) (0)
MULTIMODAL SEMI-SUPERVISED LEARNING FOR EMOTION RECOGNITION (2016) (0)
WEARABLE ASSISTANCEFOR THE BALLROOM-DANCE HOBBYIST- HOLISTICRHYTHM ANALYSISAND DANCE-STYLECLASSIFICATION (2007) (0)
MULTI-STREAM BEHAVIOR DETECTION APPLYING BAYESIAN NEWORKS (2005) (0)
5. Functional Aspects (2013) (0)
Large-Scale Nonverbal Vocalization Detection Using Transformers (2023) (0)
A Temporal-oriented Broadcast ResNet for COVID-19 Detection (2022) (0)
A Comprehensive Survey on Heart Sound Analysis in the Deep Learning Era (2023) (0)
Heart Sound Classification based on Residual Shrinkage Networks (2022) (0)
Embracing and Exploiting Annotator Emotional Subjectivity: An Affective Rater Ensemble Model (2021) (0)
Facial Emotion Recognition using Deep Residual Networks in Real-World Environments (2021) (0)
Acknowledgment to reviewers (1993) (0)
ASMMC-MMAC 2018: The Joint Workshop of 4th the Workshop on Affective Social Multimedia Computing and first Multi-Modal Affective Computing of Large-Scale Multimedia Data Workshop (2018) (0)
Machine learning in digital health, recent trends, and ongoing challenges (2020) (0)
Proceedings of the 2014 Workshop on Mapping Personality Traits Challenge and Workshop, MAPTRAITS@ICMI 2014, Istanbul, Turkey, November 12, 2014 (2014) (0)
Learning of units and knowledge representation (2013) (0)
Evaluating Deep Music Generation Methods Using Data Augmentation (2021) (0)
Proceedings of the 4th international conference on Affective computing and intelligent interaction - Volume Part II (2011) (0)
6. Corpus Engineering (2013) (0)
Hierarchical Network with Decoupled Knowledge Distillation for Speech Emotion Recognition (2023) (0)
6th International Symposium on Attention in Cognitive Systems 2013 (2013) (0)
Retrospektive Analyse frühkindlicher Lautäußerungen in "Home- Videos": Ein signalanalytischer Ansatz zur Früherkennung von Entwicklungsstörungen (2016) (0)
Emotional Expressions and Daily Cognitive Functions (2015) (0)
Guest Editorial Intelligence in Serious Games (2019) (0)
Learning of units and knowledge representation (2013) (0)
Mensch, Maschine, Emotion: Erkennung aus sprachlicher und manueller Interaktion (2007) (0)
A Deep Audiovisual Approach for Human Confidence Classification (2021) (0)
Adaptive Multimedia Retrieval, Revised Selected and Invited Papers of the 6th Workshop on Adaptive Multimedia Retrieval, (AMR 2008) (2009) (0)
Will Affective Computing Emerge From Foundation Models and General Artificial Intelligence? A First Evaluation of ChatGPT (2023) (0)
Climate Change & Computer Audition: A Call to Action and Overview on Audio Intelligence to Help Save the Planet (2022) (0)
The Sincerity Sub-Challenge: The Data (2016) (0)
The Native Language Sub-Challenge: The Data (2016) (0)
Learning the Acoustics of Autism-Spectrum Emotional Expressions - A Children’s Game? (2012) (0)
9. Linguistic Features (2013) (0)
Automatic Estimation of Biosignals From the Human Voice (2015) (0)
Editorial: Ethical Machine Learning and Artificial Intelligence (2021) (0)
EIHW-MTG DiCOVA 2021 Challenge System Report (2021) (0)
ICMI 2014 chairs' welcome (2014) (0)
Transferring Cross-Corpus Knowledge: An Investigation on Data Augmentation for Heart Sound Classification (2021) (0)
EMI Security Architecture (2013) (0)
A Walkthrough for the Principle of Logit Separation (2019) (0)
FASTAND ROBUSTMETER AND TEMPO RECOGNITIONFOR THE AUTOMATIC DISCRIMINATIONOFBALLROOM DANCE STYLES (2007) (0)
Accelerating Biomedical Signal Processing Using GPU: A Case Study of Snore Sound Feature Extraction (2017) (0)
Learning complementary representations via attention-based ensemble learning for cough-based COVID-19 recognition (2022) (0)
Exploiting time-frequency patterns with LSTM-RNNs for low-bitrate audio restoration (2019) (0)
audb - Sharing and Versioning of Audio and Annotation Data in Python (2023) (0)
Guest Editorial Special Issue on Computational Intelligence for End-to-End Audio Processing (2018) (0)
A System Structure for Multimodal Emotion Recognition in Meeting Environments (2005) (0)
Audio-based Eating Analysis and Tracking Utilising Deep Spectrum Features (2019) (0)
Computational Charisma - A Brick by Brick Blueprint for Building Charismatic Artificial Intelligence (2022) (0)
The First Audio / Visual Mapping Personality Traits Challenge Perceived Personality and Social Dimensions (2014) (0)
A Decade of Encouraging Speech Processing "Outside of the Box" - A Foreword (2016) (0)
Audiovisual Affect Assessment and Autonomous Automobiles: Applications (2022) (0)
Daily Mental Health Monitoring from Speech: A Real-World Japanese Dataset and Multitask Learning Analysis (2023) (0)
Speech Augmentation via Speaker-Specific Noise in Unseen Environment (2019) (0)
A Machine Learning Framework for Automatic Prediction of Human Semen Motility (2021) (0)
MuSe 2020 Chairs' Welcome (2020) (0)
Domain Adapting Deep Reinforcement Learning for Real-world Speech Emotion Recognition (2022) (0)
Presenting the Acoustic Sounds for Wellbeing Dataset and Baseline Classification Results (2019) (0)
Cognitive and Emotional Information Processing for Human–Machine Interaction (2012) (0)
Automatic speaker analysis 2.0: Hearing the bigger picture (2017) (0)
Prosody and phonemes (2013) (0)
Machine‐Based Modelling (2013) (0)
Ist Stimme das neue Blut? KI und Stimmbiomarker zu früheren Diagnose - für jedermann, überall und jederzeit (2022) (0)
Guest Editorial Special Issue on Concept-Level Opinion and Sentiment Analysis (2012) (0)
Onset Detection : A Wavelet Packet Based Approach with Recurrent Neural Networks (2014) (0)
Proceedings of the 2014 workshop on Emotion Representation and Modelling in Human-Computer-Interaction-Systems, ERM4HCI@ICMI 2014, Istanbul, Turkey, November 16, 2014 (2014) (0)
ASC-Inclusion – Interactive Software to Help Children with ASC Understand and Express Emotions (2013) (0)
Do Computers Have Personality? (2015) (0)
Aspects of Modelling (2013) (0)
Natural Language Processing and Attentional-Based Fusion Strategies for Multimodal Sentiment Analysis (2018) (0)
Assessing the Feasibility of a Text-Based Conversational Agent for Asthma Support: Protocol for a Mixed Methods Observational Study (2022) (0)
ERM4Proc. Int. Conf. on Human-Computer Interaction HCI 2013 - The 1st Workshop on Emotion Representation and Modelling in Human-Computer-Interaction-Systems (2013) (0)
The EIHW-GLAM Deep Attentive Multi-model Fusion System for Cough-based COVID-19 Recognition in the DiCOVA 2021 Challenge (2021) (0)
System Integration and Application (2013) (0)
Eyben, F. and Petridis, S. and Schuller, Björn and Tzimiropoulos, Georgios and Zafeiriou, Stefanos and Pantic, Maja (2011) Audiovisual classification of vocal outbursts in human conversation using long-short-term (2016) (0)
Multimedia Information Extraction (2009) (0)
Emotion in the singing voice—a deeperlook at acoustic features in the light ofautomatic classification (2015) (0)
What can we learn from massive music archives (2013) (0)
Socially Aware Many-to-Machine Communication (2012) (0)
Guest Editorial: Introduction to the Special Section on Efficient Network Design for Convergence of Deep Learning and Edge Computing (2022) (0)
Speaker Identification - Comparing Linear Regression Based Adaptation and Acoustic High-Level Feature (2005) (0)
Audio Barlow Twins: Self-Supervised Audio Representation Learning (2022) (0)
AUDIOVISUAL VOCAL OUTBURST RECOGNITION IN NOISY ACOUSTIC CONDITIONS (2011) (0)
Quantifying Cognitive Load from Voice using Transformer-Based Models and a Cross-Dataset Evaluation (2022) (0)
ICMI 2013 chairs' welcome (2013) (0)
Automatic Emotion Modelling in Written Stories (2022) (0)
Towards an Efficient Deep Learning Model for Emotion and Theme Recognition in Music (2021) (0)
MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning (2023) (0)
Computational Modelling of Paralinguistics: Overview (2013) (0)
Timing Levels in Segment-Based S (2006) (0)
Zero-Shot Speech Emotion Recognition Using Generative Learning with Reconstructed Prototypes (2023) (0)
Exploring Spatial-Temporal Representations for fNIRS-based Intimacy Detection via an Attention-enhanced Cascade Convolutional Recurrent Neural Network (2021) (0)
Introduction Affective neural networks and cognitive learning systems for big data analysis (2014) (0)
Editor Proceedings of 4th International LREC Workshop on Emotion Sentiment and Social Signals 2012 Istanbul (2012) (0)
Automatic recognition of emotional dimensions in singing (2015) (0)
ERM4HCI 2014: The 2nd Workshop on Emotion Representation and Modelling in Human-Computer-Interaction-Systems (2014) (0)
Computational Audio Analysis (Dagstuhl Seminar 13451) (2013) (0)
Applications in Intelligent Speech Analysis (2013) (0)
Next Gen Music Analysis: Some Inspirations from Speech (2011) (0)
Classification of Stuttering – the Compare Challenge and Beyond (2023) (0)
Analysis of loss functions for fast single-class classification (2019) (0)
The Deception Sub-Challenge: The Data (2016) (0)
An emotional modulation model as signature for the identification of children developmental disorders (2018) (0)
A Real-Time Speech Enhancement Framework in Noisy and Reverberated Acoustic Scenarios (2012) (0)
ASMMC21: The 6th International Workshop on Affective Social Multimedia Computing (2021) (0)
Perception and classification of emotions in nonsense speech: Humans versus machines (2023) (0)
Session details: Challenge 1: 2nd international audio/visual emotion challenge and workshop - AVEC 2012 (2012) (0)
A Bag of Wavelet Features for Snore Sound Classification (2019) (0)
Video-Driven Speech Reconstruction - Show & Tell Demo (2020) (0)
Computational Methods for Affect Detection from Natural Language (2020) (0)
‘Hands‐On’: Existing Toolkits and Practical Tutorial (2013) (0)
Contributing to the early identification of neurodevelopmental disorders: The retrospective analysis of pre-linguistic vocalisations in home video material (2016) (0)
The Phonetics of Laughing (J. Trouvain and N. Campbell, eds.) (2012) (0)
The Influence of Pleasant and Unpleasant Odours on the Acoustics of Speech (2022) (0)
Towards cross-modal pre-training and learning tempo-spatial characteristics for audio recognition with convolutional and recurrent neural networks (2020) (0)
Optimization and Parallelization of Monaural Source Separation Algorithms in the openBliSSART Toolkit (2012) (0)
Exploring Perception Uncertainty for Emotion Recognition in Dyadic Conversation and Music Listening (2020) (0)
COVID-19 Detection Exploiting Self-Supervised Learning Representations of Respiratory Sounds (2022) (0)
Recognition of Interest in Human (2006) (0)
Comparison of Automatic Speech Recognition Systems (2021) (0)
Deliverable D10.4 Psychological Experiments and Evaluation with Adult and Child Players Project Acronym Asc-inclusion Project Title Integrated Internet-based Environment for Social Inclusion of Children with Autism Spectrum Conditions Deliverable Title Psychological Experiments and Evaluation with A (0)
Emotion and mental state recognition from speech (2012) (0)
eXplainable Cooperative Machine Learning with NOVA (2020) (0)
New Avenues in Audio Intelligence: Towards Holistic Real-life Audio Understanding (2021) (0)
Synthesized speech for model training in cross-corpus recognition of human emotion (2012) (0)
Proceedings of the ACII Affective Vocal Bursts Workshop and Competition 2022 (A-VB): Understanding a critically understudied modality of emotional expression (2022) (0)
Zero-Shot Audio Classification Using Synthesised Classifiers and Pre-Trained Models (2022) (0)
Special issue of IEEE Transactions on Affective Computing ' Naturalistic Affect Resources (vol 3,1) (2012) (0)
Predicting Sex and Stroke Success - Computer-aided Player Grunt Analysis in Tennis Matches (2022) (0)
Cross-Layer Similarity Knowledge Distillation for Speech Enhancement (2022) (0)
The perception of emotional cues by children in artificial background noise (2020) (0)
How to build a machine that people enjoy talking to (2010) (0)
Capturing dynamics of post-earnings-announcement drift using genetic algorithm-optimised supervised learning (2020) (0)
Emotion and Themes Recognition in Music with Convolutional and Recurrent Attention-Blocks (2020) (0)
Correction to: The perception of emotional cues by children in artificial background noise (2021) (0)
Stream fusion for multi-stream automatic speech recognition (2016) (0)
Detektion und Estimation (2011) (0)
Novel Insights on Induced Sparsity in Multi-Time Attention Networks (2022) (0)
Detecting somatisation disorder via speech: introducing the Shenzhen Somatisation Speech Corpus (2023) (0)
Can a Holistic View Facilitate the Development of Intelligent Traditional Chinese Medicine? A Survey (2023) (0)
AVEC 2017 chairs' welcome (2017) (0)
SUBMOTIONSFOR HIDDENMARKOV MODEL BASED DYNAMIC FACIALACTIONRECOGNITION (2006) (0)
Future-generation personality prediction from digital footprints (2022) (0)
A multi-information fusion model for short term load forecasting of an architectural complex considering spatio-temporal characteristics (2022) (0)
Positive-Pair Redundancy Reduction Regularisation for Speech-Based Asthma Diagnosis Prediction (2023) (0)
Maus- und tastaturunterstützte Detektion von Schläfrigkeitszuständen (2012) (0)
EEG Emotion Recognition Based on Self-attention Dynamic Graph Neural Networks (2022) (0)
An Estimation of Online Video User Engagement From Features of Time- and Value-Continuous, Dimensional Emotions (2022) (0)
CNN-Based Heart Sound Classification with an Imbalance-Compensating Weighted Loss Function (2022) (0)
A Novel Graphical Technique for Combinational Logic Representation and Optimization (2017) (0)
Hearttoheart: The Arts of Infant Versus Adult-Directed Speech Classification (2023) (0)
HEAR4Health: A blueprint for making computer audition a staple of modern healthcare (2023) (0)
Robust Audio Watermarking Based on Empirical Mode Decomposition and Group Differential Relations (2023) (0)
Speech Denoising and Compensation for Hearing Aids Using an FTCRN-Based Metric GAN (2023) (0)
Multimodal Machine Learning for Social Interaction with Ageing Individuals (2021) (0)
Correction to: The perception of emotional cues by children in artificial background noise (2021) (0)
Intelligent Music Intervention for Mental Disorders: Insights and Perspectives (2023) (0)
Human-Aligned Trading by Imitative Multi-Loss Reinforcement Learning (2023) (0)
Novel no-reference multi-dimensional perceptual similarity metric (2022) (0)
Applications in Intelligent Sound Analysis (2013) (0)
Guest Editorial: Special Issue on Affective Speech and Language Synthesis, Generation, and Conversion (2023) (0)
Exploring interpretable representations for heart sound abnormality detection (2023) (0)
Selective Element and Two Orders Vectorization Networks for Automatic Depression Severity Diagnosis via Facial Changes (2022) (0)
COVID-19 Detection from Speech in Noisy Conditions (2023) (0)
CoughLIME: Sonified Explanations for the Predictions of COVID-19 Cough Classifiers (2022) (0)
Child and Youth Affective Computing—Challenge Accepted (2022) (0)
Verfahren und System für das Training von Sprachverarbeitungseinrichtungen (2009) (0)
Federated Intelligent Terminals Facilitate Stuttering Monitoring (2023) (0)
MuSe-Trust: Multimodal Trustworthiness Sub-challenge (MuSe2020) (2020) (0)
Investigating Individual- and Group-Level Model Adaptation for Self-Reported Runner Exertion Prediction from Biomechanics (2022) (0)
A Glance-and-Gaze Network for Respiratory Sound Classification (2022) (0)
Microexpressions: A Chance for Computers to Beat Humans at Detecting Hidden Emotions? (2019) (0)
Emotional factors in speech based human-machine interaction in the operating room (2010) (0)
Stream fusion for multi-stream automatic speech recognition (2016) (0)
Socio-Cognitive Language Processing for Special User Groups (2021) (0)
Early Vocal Development in Autism Spectrum Disorder, Rett Syndrome, and Fragile X Syndrome: Insights from Studies Using Retrospective Video Analysis (2018) (0)
Editorial: Intelligent Signal Analysis for Contagious Virus Diseases (2022) (0)
Digital Mental Health - Breaking a Lance for Prevention (2022) (0)
Editorial Special Issue on Cognitive and Emotional Information Processing for Human-Machine Interaction (2011) (0)
The Voice of the Body: Why AI Should Listen to It and an Archive (2023) (0)
A.I. & Speech: A Silent Anthropomorphism? (2019) (0)
AUDIOVISUALBEHAVIORMODELING BY COMBINED FEATURESPACES (2007) (0)
Prosody and Phonemes: On the Influence of Speaking Style (2013) (0)
Audio Enhancement and Robustness (2013) (0)
BayesianNetworkBased MultiStreamFusion forAutomated OnlineVideo Surveillance (2005) (0)
Index, Biographies, Glossary (2017) (0)
Capturing Time Dynamics From Speech Using Neural Networks for Surgical Mask Detection (2022) (0)
Structure of the Book (2013) (0)
Classifying Emotion-Antecedent Appraisal in Brain Activity Using Machine Learning Methods (2015) (0)
Knowledge Transfer For On-Device Speech Emotion Recognition with Neural Structured Learning (2022) (0)
FEATURE SELECTION AND STACKING FOR ROBUST DISCRIMINATION OF SPEECH , MONOPHONIC SINGING , AND POLYPHONIC MUSIC ( WedPmOR 5 ) Author ( s ) : (2005) (0)
Histologische Korrelation von Portio-Biopsie und Konus in der klinischen Praxis: 2001–2008 (2009) (0)

This paper list is powered by the following services:

Other Resources About Björn Schuller

en.wikipedia.org

What Schools Are Affiliated With Björn Schuller?

Björn Schuller is affiliated with the following schools:

Björn Schuller's Academic­Influence.com Rankings