Alexandros Potamianos
#176,452
Most Influential Person Now
Alexandros Potamianos's AcademicInfluence.com Rankings
Alexandros Potamianosengineering Degrees
Engineering
#8115
World Rank
#9591
Historical Rank
Applied Physics
#2876
World Rank
#2934
Historical Rank

Download Badge
Engineering
Alexandros Potamianos's Degrees
- PhD Electrical and Computer Engineering University of Maryland, College Park
- Masters Electrical and Computer Engineering University of Maryland, College Park
- Bachelors Electrical and Computer Engineering National Technical University of Athens
Why Is Alexandros Potamianos Influential?
(Suggest an Edit or Addition)According to Wikipedia, Alexandros Potamianos is an engineer at the National Technical University of Athens, Greece. He was named a Fellow of the Institute of Electrical and Electronics Engineers in 2016 for his contributions to human-centered speech and multimodal signal analysis.
Alexandros Potamianos's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- Acoustics of children's speech: developmental changes of temporal and spectral parameters. (1999) (790)
- A comparison of the energy operator and the Hilbert transform approach to signal and speech demodulation (1994) (258)
- Multimodal Saliency and Fusion for Movie Summarization Based on Aural, Visual, and Textual Attention (2013) (207)
- Robust recognition of children's speech (2003) (205)
- Creating conversational interfaces for children (2002) (166)
- Multi-band speech recognition in noisy environments (1998) (162)
- Speech formant frequency and bandwidth tracking using multiband energy demodulation (1995) (151)
- DARPA communicator dialog travel planning systems: the june 2000 data collection (2001) (129)
- A review of ASR technologies for children's speech (2009) (122)
- Fractal dimensions of speech sounds: computation and application to automatic speech recognition. (1999) (120)
- Automatic speech recognition for children (1997) (117)
- Higher order differential energy operators (1995) (115)
- Unsupervised Semantic Similarity Computation between Terms Using Web Documents (2010) (112)
- Robust AM-FM features for speech recognition (2005) (108)
- Batch and Adaptive PARAFAC-Based Blind Separation of Convolutive Speech Mixtures (2010) (105)
- An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models (2019) (101)
- Detecting emotional state of a child in a conversational computer game (2011) (100)
- DARPA communicator: cross-system results for the 2001 evaluation (2002) (99)
- NTUA-SLP at SemEval-2018 Task 1: Predicting Affective Content in Tweets with Deep Attentive RNNs and Transfer Learning (2018) (95)
- A supervised approach to movie emotion tracking (2011) (93)
- Multimodal Prediction of Affective Dimensions and Depression in Human-Computer Interactions (2014) (89)
- Speech analysis and synthesis using an AM-FM modulation model (1997) (88)
- Open Challenges in Modelling, Analysis and Synthesis of Human Behaviour in Human–Human and Human–Machine Interactions (2015) (88)
- Auditory Teager energy cepstrum coefficients for robust speech recognition (2005) (74)
- Video event detection and summarization using audio, visual and text saliency (2009) (69)
- Data Augmentation Using GANs for Speech Emotion Recognition (2019) (68)
- Distributional Semantic Models for Affective Text Analysis (2013) (68)
- Time-frequency distributions for automatic speech recognition (2001) (67)
- Structural Attention Neural Networks for improved sentiment analysis (2017) (67)
- Analysis of children's speech: duration, pitch and formants (1997) (66)
- SEQˆ3: Differentiable Sequence-to-Sequence-to-Sequence Autoencoder for Unsupervised Abstractive Sentence Compression (2019) (65)
- A Comparison of the Squared Energy and Teager-Kaiser Operators for Short-Term Energy Estimation in Additive Noise (2009) (64)
- Improving speech recognition for children using acoustic adaptation and pronunciation modeling (2014) (63)
- Movie summarization based on audiovisual saliency detection (2008) (55)
- DARPA communicator evaluation: progress from 2000 to 2001 (2002) (54)
- Segment-based speech emotion recognition using recurrent neural networks (2017) (52)
- Kernel Models for Affective Lexicon Creation (2011) (45)
- Spoken dialog systems for children (1998) (45)
- A system for finding speech formants and modulations via energy separation (1994) (44)
- NTUA-SLP at SemEval-2018 Task 3: Tracking Ironic Tweets using Ensembles of Word and Character Level Attentive RNNs (2018) (43)
- Classification of cognitive load from speech using an i-vector framework (2014) (41)
- COGNIMUSE: a multimodal video database annotated with saliency, events, semantics and emotion with application to summarization (2017) (41)
- Dialogue management in the Bell Labs communicator system (2000) (39)
- Multimodal Processing and Interaction, Audio, Video, Text (2010) (39)
- Soft-feature decoding for speech recognition over wireless channels (2001) (37)
- Similarity computation using semantic networks created from web-harvested data (2013) (37)
- Demonstration of assembly work using augmented reality (2007) (37)
- On the Effects of Filterbank Design and Energy Computation on Robust Speech Recognition (2011) (35)
- A review of the acoustic and linguistic properties of children's speech (2007) (33)
- Detecting Politeness and frustration state of a child in a conversational computer game (2005) (32)
- Multi-band long-term signal variability features for robust voice activity detection (2013) (27)
- Auto-induced semantic classes (2004) (27)
- Adaptive language models for spoken dialogue systems (2002) (27)
- UDALM: Unsupervised Domain Adaptation through Language Modeling (2021) (27)
- Adaptive categorical understanding for spoken dialogue systems (2005) (26)
- An error-protected speech recognition system for wireless communications (2002) (26)
- Audiovisual Attention Modeling and Salient Event Detection (2008) (24)
- Blind Speech Separation Using Parafac Analysis and Integer Least Squares (2006) (23)
- Predicting audio-visual salient events based on visual, audio and text modalities for movie summarization (2015) (23)
- DESIGN PRINCIPLES AND TOOLS FOR MULTIMODAL DIALOG SYSTEMS (2000) (23)
- Statistical recursive finite state machine parsing for speech understanding (2000) (23)
- Stream Weight Computation for Multi-Stream Classifiers (2006) (23)
- Hybrid natural language generation for spoken dialogue systems (2001) (23)
- Ambiguity representation and resolution in spoken dialogue systems (2001) (22)
- Attention-based Conditioning Methods for External Knowledge Integration (2019) (22)
- Multimodal User Interface for Augmented Assembly (2007) (22)
- Spectral Moment Features Augmented by Low Order Cepstral Coefficients for Robust ASR (2010) (22)
- On combining frequency warping and spectral shaping in HMM based speech recognition (1997) (21)
- A saliency-based approach to audio event detection and summarization (2012) (21)
- UNSUPERVISED COMBINATION OF METRICS FOR SEMANTIC CLASS INDUCTION (2006) (21)
- Finding speech formants and modulations via energy separation: with application to a vocoder (1993) (20)
- SAIL: A hybrid approach to sentiment analysis (2013) (19)
- Multimodal system evaluation using modality efficiency and synergy metrics (2008) (19)
- Combining statistical similarity measures for automatic induction of semantic classes (2005) (19)
- Speaker adaptation for audio-visual speech recognition (1999) (18)
- A comparison of four metrics for auto-inducing semantic classes (2001) (17)
- Metrics for measuring domain independence of semantic classes (2001) (17)
- An investigation of vocal arousal dynamics in child-psychologist interactions using synchrony measures and a conversation-based model (2014) (17)
- Tweester at SemEval-2016 Task 4: Sentiment Analysis in Twitter Using Semantic-Affective Model Adaptation (2016) (17)
- A soft-clustering algorithm for automatic induction of semantic classes (2007) (16)
- Unsupervised Semantic Similarity Computation using Web Search Engines (2007) (16)
- Developmental acoustic study of American English diphthongs. (2014) (16)
- SAIL: Sentiment Analysis using Semantic Similarity and Contrast Features (2014) (16)
- Affective evaluation of a mobile multimodal dialogue system using brain signals (2012) (16)
- Deep Hierarchical Fusion with Application in Sentiment Analysis (2019) (16)
- Engagement detection for children with Autism Spectrum Disorder (2017) (16)
- Information Seeking Spoken Dialogue Systems— Part II: Multimodal Dialogue (2007) (15)
- Affective Lexicon Creation for the Greek Language (2016) (15)
- Integrating Recurrence Dynamics for Speech Emotion Recognition (2018) (14)
- Valence, arousal and dominance estimation for English, German, Greek, Portuguese and Spanish lexica using semantic models (2015) (13)
- Modulation features for speech recognition (2002) (13)
- Towards adapting fantasy, curiosity and challenge in multimodal dialogue systems for preschoolers (2009) (13)
- Audio salient event detection and summarization using audio and text modalities (2015) (13)
- SemSim: Resources for Normalized Semantic Similarity Computation Using Lexical Networks (2012) (12)
- A Study in Efficiency and Modality Usage in Multimodal Form Filling Systems (2008) (12)
- Language model adaptation for spoken language systems (1998) (12)
- NTUA-SLP at IEST 2018: Ensemble of Neural Transfer Methods for Implicit Emotion Classification (2018) (12)
- Toward the Automatic Extraction of Policy Networks Using Web Links and Documents (2013) (11)
- NTUA-SLP at SemEval-2018 Task 2: Predicting Emojis using RNNs with Context-aware Attention (2018) (11)
- Web data harvesting for speech understanding grammar induction (2013) (11)
- DeepPurple: Estimating Sentence Semantic Similarity using N-gram Regression Models and Web Snippets (2012) (11)
- Tweester at SemEval-2017 Task 4: Fusion of Semantic-Affective and pairwise classification models for sentiment analysis in Twitter (2017) (11)
- Region-based vocal tract length normalization for ASR (2008) (10)
- Information Seeking Spoken Dialogue Systems— Part I: Semantics and Pragmatics (2007) (10)
- Affective Conditioning on Hierarchical Networks applied to Depression Detection from Transcribed Clinical Interviews (2020) (10)
- Short-time instantaneous frequency and bandwidth features for speech recognition (2009) (10)
- Dialogue Act Semantic Representation and Classification Using Recurrent Neural Networks (2017) (10)
- Statistical analysis of amplitude modulation in speech signals using an AM-FM model (2009) (10)
- An affective evaluation tool using brain signals (2013) (9)
- EmotiWord: Affective Lexicon Creation with Application to Interaction and Multimedia Data (2011) (9)
- Affective Conditioning on Hierarchical Attention Networks Applied to Depression Detection from Transcribed Clinical Interviews (2020) (9)
- Instantaneous frequency and bandwidth estimation using filterbank arrays (2013) (9)
- Unsupervised Stream Weight Estimation using Anti-Models (2007) (9)
- Word Semantic Similarity for Morphologically Rich Languages (2014) (9)
- Fantasy, curiosity and challenge as adaptation indicators in multimodal dialogue systems for preschoolers (2009) (9)
- The SpeDial datasets: datasets for Spoken Dialogue Systems analytics (2016) (8)
- Linguistic analysis of spontaneous children speech (2008) (8)
- Neural Activation Semantic Models: Computational lexical semantic models of localized neural activations (2018) (8)
- Unsupervised Low-Rank Representations for Speech Emotion Recognition (2019) (8)
- Advanced front-end for robust speech recognition in extremely adverse environments (2007) (8)
- Multimodal systems for children: building a prototype (1999) (8)
- Cognitively Motivated Distributional Representations of Meaning (2016) (8)
- Speech understanding for spoken dialogue systems: From corpus harvesting to grammar rule induction (2018) (8)
- Hierarchical bi-directional attention-based RNNs for supporting document classification on protein–protein interactions affected by genetic mutations (2018) (8)
- Towards incorporating language morphology into statistical machine translation systems (2005) (7)
- Mmlatch: Bottom-Up Top-Down Fusion For Multimodal Sentiment Analysis (2022) (7)
- Semantic Similarity Computation for Abstract and Concrete Nouns Using Network-based Distributional Semantic Models (2013) (7)
- Unsupervised Stream-Weights Computation in Classification and Recognition Tasks (2009) (7)
- Modulation and chaotic acoustic features for speech recognition (2002) (7)
- Categorical understanding using statistical ngram models (1999) (7)
- Affective language model adaptation via corpus selection (2014) (6)
- Human-Computer Interfaces to Multimedia Content a Review (2008) (6)
- Learning of Semantic Relations between Ontology Concepts using Statistical Techniques (2008) (6)
- A feature-space transformation for telephone based speech recognition (1995) (5)
- Modality tracking in the multimodal Bell Labs Communicator (2003) (5)
- M3: MultiModal Masking Applied to Sentiment Analysis (2021) (5)
- Spoken dialogue evaluation for the Bell Labs communicator system (2002) (5)
- Using lexical, syntactic and semantic features for non-terminal grammar rule induction in Spoken Dialogue Systems (2014) (5)
- On the effectiveness of PARAFAC-based estimation for blind speech separation (2008) (5)
- Quality evaluation of computational models for movie summarization (2015) (5)
- Cognitive Multimodal Processing: from Signal to Behavior (2014) (5)
- The effect of input mode on inactivity and interaction times of multimodal systems (2007) (5)
- Audio-Based Distributional Representations of Meaning Using a Fusion of Feature Encodings (2016) (4)
- Root Cause Analysis of Miscommunication Hotspots in Spoken Dialogue Systems (2016) (4)
- COBRA - Mining Web for Corporate Brand and Reputation Analysis (2007) (4)
- Mixture of Topic-Based Distributional Semantic and Affective Models (2018) (4)
- BLENDING SPEECH AND VISUAL INPUT IN MULTIMODAL DIALOGUE SYSTEMS (2006) (4)
- Fusion of knowledge-based and data-driven approaches to grammar induction (2014) (4)
- IEEE International Workshop on Multimedia Signal Processing (MMSP'07) (2007) (4)
- Lexical and affective models in early acquisition of semantics (2017) (4)
- Hierarchical bidirectional attention-based RNN in BioCreative VI precision medicine track, document triage task (2017) (4)
- Instantaneous Energy Operators: Applications to Speech Processing and Communications 1. Speech Processing Applications 2. Higher-order Energy Operators (2007) (4)
- Multiple time resolution analysis of speech signal using MCE training with application to speech recognition (2009) (4)
- On the effect of fundamental frequency on amplitude and frequency modulation patterns in speech resonances (2010) (3)
- Fusion of Compositional Network-based and Lexical Function Distributional Semantic Models (2015) (3)
- A codec for speech recognition in a wireless system (2000) (3)
- Towards Speaker and Environmental Robustness in ASR: The HIWIRE Project (2006) (3)
- Low-Dimensional Manifold Distributional Semantic Models (2014) (3)
- Speech Emotion Recognition Using Affective Saliency (2016) (3)
- Combined frequency warping and spectral shaping in HMM based speech recognition (2000) (3)
- Associative and Semantic Features Extracted From Web-Harvested Corpora (2012) (3)
- Speech recognition for wireless applications (2001) (3)
- Analysis of children’s speech. Pitch and formant frequency (1997) (3)
- DeepPurple: Lexical, String and Affective Feature Fusion for Sentence-Level Semantic Similarity Estimation (2013) (3)
- Pattern Search Multidimensional Scaling (2018) (3)
- Crossmodal Network-Based Distributional Semantic Models (2016) (2)
- On using fractal features of speech sounds in automatic speech recognition (1997) (2)
- Developmental aspects of American English diphthong trajectories in the formant space (2013) (2)
- Introduction to the special issue on speech and language processing of children's speech for child-machine interaction applications (2011) (2)
- Spoken dialogue grammar induction from crowdsourced data (2014) (2)
- Using Oliver API for emotion-aware movie content characterization (2019) (2)
- BabyExp: Constructing a Huge Multimodal Resource to Acquire Commonsense Knowledge Like Children Do (2010) (2)
- Cross-Topic Distributional Semantic Representations Via Unsupervised Mappings (2019) (2)
- Cross-domain classification using generalized domain acts (2000) (2)
- Continuous models of affect from text using n-grams (2013) (2)
- Agora: a GUI approach to multimodal user interfaces (2002) (2)
- Audio-based Distributional Semantic Models for Music Auto-tagging and Similarity Measurement (2016) (1)
- Unsupervised HMM adaptation based on speech-silence discrimination (1997) (1)
- Feeling is Understanding: From Affective to Semantic Spaces (2015) (1)
- SemEval-2014 Task 2: Grammar Induction for Spoken Dialogue Systems (2014) (1)
- Unsupervised Stream Weight Computation in a Segmentaion Task: Application to Audio-Visual Speech Recognition (2007) (1)
- A Multi-Task BERT Model for Schema-Guided Dialogue State Tracking (2022) (1)
- Up from Limited Dialog Systems! (2012) (1)
- EmpBot: A T5-based Empathetic Chatbot focusing on Sentiments (2021) (1)
- End-to-end Generative Zero-shot Learning via Few-shot Learning (2021) (1)
- Developmental acoustic study of American English diphthongsa) (2014) (1)
- A semantic-affective compositional approach for the affective labelling of adjective-noun and noun-noun pairs (2016) (1)
- Transition features for CRF-based speech recognition and boundary detection (2009) (1)
- SeqAug: Sequential Feature Resampling as a modality agnostic augmentation method (2023) (0)
- A Dataset for Greek Traditional and Folk Music: Lyra (2022) (0)
- Pattern Search MDS (2018) (0)
- Efficient Audio Captioning Transformer with Patchout and Text Guidance (2023) (0)
- Combination of frequency distortion and spectral shaping in an HMM - based speech recognizer (1998) (0)
- Poster) An Affective Evaluation Tool Using Brain Signals (2013) (0)
- Depression detection in social media posts using affective and social norm features (2023) (0)
- Reliability Evaluation of decoded signal blocks for speech recognition on wireless Ubertragungkanälen (2000) (0)
- Adapted Multimodal BERT with Layer-wise Fusion for Sentiment Analysis (2022) (0)
- COGNIMUSE: a multimodal video database annotated with saliency, events, semantics and emotion with application to summarization (2017) (0)
- Sample-Efficient Unsupervised Domain Adaptation of Speech Recognition Systems A case study for Modern Greek (2022) (0)
- IDesign Principles for Multimodal Spoken Dialogue Systems (2008) (0)
- Affective evaluation of multimodal dialogue games for preschoolers using physiological signals (2013) (0)
- Order Print-on-demand Copies From: Table of Contents Predictions for Self-priming from Incremental Updating Models Unifying Comprehension and Production Pragmatic Alignment on Social Support Type in Health Forum Conversations Fusion of Compositional Network-based and Lexical Function Distributional (0)
- Speech / pause distinction means unguided adaptation of Hidden Markov Models (1998) (0)
- tucSage: Grammar Rule Induction for Spoken Dialogue Systems via Probabilistic Candidate Selection (2014) (0)
- SYSTEMS FOR CHILDREN (1997) (0)
- Alternating Objectives Generates Stronger PGD-Based Adversarial Attacks (2022) (0)
- Demodulation of AM–FM resonances in speech using energy separation (1994) (0)
- An unstructured distributional semantic model for morphologically rich languages: the Dutch case study (2014) (0)
- Proceedings of the 10th International Conference on Multimodal Interfaces, ICMI 2008, Chania, Crete, Greece, October 20-22, 2008 (2008) (0)
- Affective classification of generic audio clips using regression models (2013) (0)
- INVESTIGATIONS IN ARTICULATORY SYNTHESIS (2007) (0)
- Session details: Multimodal dialog (2009) (0)
- Proceedings of the 9th International Conference on Multimodal Interfaces, ICMI 2007, Nagoya, Aichi, Japan, November 12-15, 2007 (2007) (0)
- BLIND SPEECH SEPARATION ALGORITHM USING PARAFAC AND INTEGER LEAST SQUARES (0)
- Regotron: Regularizing the Tacotron2 Architecture Via Monotonic Alignment Loss (2022) (0)
- Open Challenges in Modelling, Analysis and Synthesis of Human Behaviour in Human–Human and Human–Machine Interactions (2015) (0)
- Novel features for robust speech recognition (2002) (0)
- Sensory-Aware Multimodal Fusion for Word Semantic Similarity Estimation (2017) (0)
- BabyRobot-Next Generation Social Robots (2016) (0)
- Extending Compositional Attention Networks for Social Reasoning in Videos (2022) (0)
This paper list is powered by the following services:
Other Resources About Alexandros Potamianos
What Schools Are Affiliated With Alexandros Potamianos?
Alexandros Potamianos is affiliated with the following schools: