Yu-long Qiao
#116,284
Most Influential Person Now
Yu-long Qiao's AcademicInfluence.com Rankings
Yu-long Qiaocomputer-science Degrees
Computer Science
#4548
World Rank
#4798
Historical Rank
Artificial Intelligence
#1148
World Rank
#1168
Historical Rank
Database
#1762
World Rank
#1847
Historical Rank

Download Badge
Computer Science
Yu-long Qiao's Degrees
- PhD Computer Science Chinese University of Hong Kong
- Bachelors Computer Science Peking University
Similar Degrees You Can Earn
Why Is Yu-long Qiao Influential?
(Suggest an Edit or Addition)Yu-long Qiao's Published Works
Number of citations in a given year to any of this author's works
Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author
Published Works
- Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks (2016) (3709)
- A Discriminative Feature Learning Approach for Deep Face Recognition (2016) (2997)
- Temporal Segment Networks: Towards Good Practices for Deep Action Recognition (2016) (2983)
- ESRGAN: Enhanced Super-Resolution Generative Adversarial Networks (2018) (2116)
- Action recognition with trajectory-pooled deep-convolutional descriptors (2015) (1078)
- NTIRE 2017 Challenge on Single Image Super-Resolution: Methods and Results (2017) (1051)
- Detecting Text in Natural Image with Connectionist Text Proposal Network (2016) (788)
- Bag of visual words and fusion methods for action recognition: Comprehensive study and good practice (2014) (654)
- SpiderCNN: Deep Learning on Point Sets with Parameterized Convolutional Filters (2018) (578)
- Temporal Segment Networks for Action Recognition in Videos (2017) (480)
- Towards Good Practices for Very Deep Two-Stream ConvNets (2015) (415)
- FOTS: Fast Oriented Text Spotting with a Unified Network (2018) (395)
- Robust Scene Text Detection with Convolution Neural Network Induced MSER Trees (2014) (376)
- Action Recognition with Stacked Fisher Vectors (2014) (372)
- Real-Time Action Recognition with Enhanced Motion Vector CNNs (2016) (365)
- Region Attention Networks for Pose and Occlusion Robust Facial Expression Recognition (2019) (318)
- Pairwise Rotation Invariant Co-Occurrence Local Binary Pattern (2012) (315)
- Domain Generalization with MixStyle (2021) (286)
- Single Shot Text Detector with Regional Attention (2017) (281)
- Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward (2017) (256)
- Reading Scene Text in Deep Convolutional Sequences (2015) (252)
- Domain Generalization: A Survey (2021) (237)
- Suppressing Uncertainties for Large-Scale Facial Expression Recognition (2020) (236)
- RankSRGAN: Generative Adversarial Networks With Ranker for Image Super-Resolution (2019) (234)
- Text-Attentional Convolutional Neural Network for Scene Text Detection (2015) (233)
- Adaptive Pyramid Context Network for Semantic Segmentation (2019) (228)
- A Key Volume Mining Deep Framework for Action Recognition (2016) (226)
- LSTD: A Low-Shot Transfer Detector for Object Detection (2018) (212)
- Multi-view Super Vector for Action Recognition (2014) (195)
- An End-to-End TextSpotter with Explicit Alignment and Attention (2018) (179)
- Action Recognition and Detection by Combining Motion and Appearance Features (2014) (176)
- Deep auto-context convolutional neural networks for standard-dose PET image estimation from low-dose PET/MRI (2017) (172)
- Dynamic Multi-Scale Filters for Semantic Segmentation (2019) (165)
- Learning Attentive Pairwise Interaction for Fine-Grained Classification (2020) (162)
- Motionlets: Mid-level 3D Parts for Human Motion Recognition (2013) (153)
- Latent Factor Guided Convolutional Neural Networks for Age-Invariant Face Recognition (2016) (152)
- A Comparative Study of Encoding, Pooling and Normalization Methods for Action Recognition (2012) (146)
- RPAN: An End-to-End Recurrent Pose-Attention Network for Action Recognition in Videos (2017) (142)
- Video Action Detection with Relational Dynamic-Poselets (2014) (139)
- AdaCos: Adaptively Scaling Cosine Logits for Effectively Learning Deep Face Representations (2019) (131)
- Places205-VGGNet Models for Scene Recognition (2015) (130)
- CUHK & ETHZ & SIAT Submission to ActivityNet Challenge 2016 (2016) (124)
- MoFAP: A Multi-level Representation for Action Recognition (2016) (124)
- Deep embedding convolutional neural network for synthesizing CT image from T1‐Weighted MR image (2017) (115)
- Real-Time Action Recognition With Deeply Transferred Motion Vector CNNs (2018) (113)
- Recurrent Spatial-Temporal Attention Network for Action Recognition in Videos (2018) (109)
- Text-Attentional Convolutional Neural Networks for Scene Text Detection (2016) (103)
- Super-Identity Convolutional Neural Network for Face Hallucination (2018) (103)
- Adaptive Dilated Network With Self-Correction Supervision for Counting (2020) (102)
- Domain Adaptive Ensemble Learning (2020) (101)
- Frame Attention Networks for Facial Expression Recognition in Videos (2019) (99)
- PIRM Challenge on Perceptual Image Enhancement on Smartphones: Report (2018) (98)
- NTIRE 2019 Challenge on Real Image Super-Resolution: Methods and Results (2019) (96)
- FD-GAN: Generative Adversarial Networks with Fusion-discriminator for Single Image Dehazing (2020) (95)
- Knowledge Guided Disambiguation for Large-Scale Scene Classification With Multi-Resolution CNNs (2016) (93)
- Unsupervised optimal phoneme segmentation: Objectives, algorithm and comparisons (2008) (91)
- Visual Compositional Learning for Human-Object Interaction Detection (2020) (91)
- Actionness Estimation Using Hybrid Fully Convolutional Networks (2016) (90)
- A Study on Invariance of $f$-Divergence and Its Application to Speech Recognition (2010) (86)
- Mining Motion Atoms and Phrases for Complex Action Recognition (2013) (85)
- Temporal Context Aggregation Network for Temporal Action Proposal Refinement (2021) (83)
- Attention-Guided Hierarchical Structure Aggregation for Image Matting (2020) (82)
- DeepWriter: A Multi-stream Deep CNN for Text-Independent Writer Identification (2016) (81)
- Accurate Text Localization in Natural Image with Cascaded Convolutional Text Network (2016) (79)
- AnoPCN: Video Anomaly Detection via Deep Predictive Coding Network (2019) (77)
- Weakly Supervised PatchNets: Describing and Aggregating Local Patches for Scene Recognition (2016) (74)
- Boosting VLAD with Supervised Dictionary Learning and High-Order Statistics (2014) (72)
- Gender and Smile Classification Using Deep Convolutional Neural Networks (2016) (70)
- Modulating Image Restoration With Continual Levels via Adaptive Feature Modification Layers (2019) (69)
- SmallBigNet: Integrating Core and Contextual Views for Video Classification (2020) (67)
- MetaCleaner: Learning to Hallucinate Clean Representations for Noisy-Labeled Visual Recognition (2019) (66)
- Automatic differentiation of Glaucoma visual field from non-glaucoma visual filed using deep convolutional neural network (2018) (66)
- Object-Scene Convolutional Neural Networks for event recognition in images (2015) (64)
- Domain Generalization in Vision: A Survey (2021) (58)
- PA3D: Pose-Action 3D Machine for Video Recognition (2019) (58)
- Find and Focus: Retrieve and Localize Video Events with Natural Language Queries (2018) (58)
- Group emotion recognition with individual facial emotion CNNs and global image based CNNs (2017) (58)
- Image Segmentation with Pyramid Dilated Convolution Based on ResNet and U-Net (2017) (57)
- ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic (2021) (56)
- Deep Recurrent Multi-instance Learning with Spatio-temporal Features for Engagement Intensity Prediction (2018) (56)
- A Comprehensive Study on Center Loss for Deep Face Recognition (2019) (54)
- Learning Geometry-Disentangled Representation for Complementary Understanding of 3D Object Point Cloud (2020) (53)
- Pedestrian detection with unsupervised multispectral feature learning using deep neural networks (2019) (53)
- Context-Transformer: Tackling Object Confusion for Few-Shot Detection (2020) (52)
- Geometry Sharing Network for 3D Point Cloud Classification and Segmentation (2019) (52)
- Action and Gesture Temporal Spotting with Super Vector Representation (2014) (51)
- DF2Net: A Dense-Fine-Finer Network for Detailed 3D Face Reconstruction (2019) (49)
- Transferring Deep Object and Scene Representations for Event Recognition in Still Images (2018) (49)
- Locally Supervised Deep Hybrid Model for Scene Recognition (2016) (45)
- Mutual Component Convolutional Neural Networks for Heterogeneous Face Recognition (2019) (42)
- COCAS: A Large-Scale Clothes Changing Person Dataset for Re-Identification (2020) (42)
- Common Feature Discriminant Analysis for Matching Infrared Face Images to Optical Face Images (2014) (42)
- Blind Image Super-Resolution: A Survey and Beyond (2021) (42)
- Sparse Deep Transfer Learning for Convolutional Neural Network (2017) (40)
- Refining Pseudo Labels with Clustering Consensus over Generations for Unsupervised Object Re-identification (2021) (39)
- Exploring Motion Boundary based Sampling and Spatial-Temporal Context Descriptors for Action Recognition (2013) (38)
- A Multi-task Learning Approach for Image Captioning (2018) (37)
- F-divergence Is a Generalized Invariant Measure between Distributions (2008) (36)
- Affordance Transfer Learning for Human-Object Interaction Detection (2021) (35)
- Speech Structure and Its Application to Robust Speech Processing (2010) (34)
- Multi-feature canonical correlation analysis for face photo-sketch image retrieval (2013) (33)
- Dual Learning for Cross-domain Image Captioning (2017) (33)
- Speech generation from hand gestures based on space mapping (2009) (33)
- Cascade Attention Networks For Group Emotion Recognition with Face, Body and Image Cues (2018) (32)
- Suppressing Model Overfitting for Image Super-Resolution Networks (2019) (32)
- Exploring Emotion Features and Fusion Strategies for Audio-Video Emotion Recognition (2019) (31)
- The Chinese University of Hong Kong , Hong Kong , China (2012) (31)
- The Chinese University of Hong Kong , Hong Kong , China (2012) (31)
- Multi-scale Joint Encoding of Local Binary Patterns for Texture and Material Classification (2013) (31)
- Boosting Optical Character Recognition: A Super-Resolution Approach (2015) (30)
- A Theory of Phase Singularities for Image Representation and its Applications to Object Tracking and Image Matching (2009) (27)
- Mixture of Probabilistic Linear Regressions: A unified view of GMM-based mapping techiques (2009) (27)
- Constellation of phase singularities in a speckle-like pattern for optical vortex metrology applied to biological kinematic analysis. (2008) (27)
- Detecting Human-Object Interaction via Fabricated Compositional Learning (2021) (26)
- A Local Approximation of Fundamental Measure Theory Incorporated into Three Dimensional Poisson–Nernst–Planck Equations to Account for Hard Sphere Repulsion Among Ions (2015) (26)
- WildFish: A Large Benchmark for Fish Recognition in the Wild (2018) (26)
- Prostate Segmentation using 2D Bridged U-net (2018) (26)
- P2SGrad: Refined Gradients for Optimizing Deep Face Models (2019) (25)
- CT-Net: Channel Tensorization Network for Video Classification (2021) (25)
- Large Margin Dimensionality Reduction for Action Similarity Labeling (2014) (25)
- Deep classification of vehicle makers and models: The effectiveness of pre-training and data enhancement (2015) (24)
- Motion boundary based sampling and 3D co-occurrence descriptors for action recognition (2014) (24)
- Face recognition based on gradient gabor feature and Efficient Kernel Fisher analysis (2010) (23)
- W-net: Bridged U-net for 2D Medical Image Segmentation (2018) (23)
- Marine Animal Detection and Recognition with Advanced Deep Learning Models (2017) (23)
- Joint retina segmentation and classification for early glaucoma diagnosis. (2019) (23)
- Self-supervised Multi-view Stereo via Effective Co-Segmentation and Data-Augmentation (2021) (23)
- Analysis and utilization of MLLR speaker adaptation technique for learners' pronunciation evaluation (2009) (22)
- Residual Compensation Networks for Heterogeneous Face Recognition (2019) (21)
- DEVELOPMENT OF COMPLEX SOCIETIES IN THE YILUO REGION: A GIS BASED POPULATION AND AGRICULTURAL AREA ANALYSIS (2007) (21)
- Better Exploiting OS-CNNs for Better Event Recognition in Images (2015) (21)
- An image-based intelligent system for pointer instrument reading (2014) (21)
- Affine Invariant Dynamic Time Warping and its Application to Online Rotated Handwriting Recognition (2006) (20)
- A Joint Evaluation of Dictionary Learning and Feature Encoding for Action Recognition (2014) (20)
- Bridging Music and Image via Cross-Modal Ranking Analysis (2016) (20)
- Cascade multi-head attention networks for action recognition (2020) (20)
- Depth driven people counting using deep region proposal network (2017) (19)
- Temporal Hallucinating for Action Recognition with Few Still Images (2018) (19)
- Random discriminant structure analysis for automatic recognition of connected vowels (2007) (19)
- Learning to Predict Context-adaptive Convolution for Semantic Segmentation (2020) (19)
- DeepDeblur: text image recovery from blur to sharp (2019) (18)
- FeatherCNN: Fast Inference Computation with TensorGEMM on ARM Architectures (2020) (18)
- CUHK & SIAT Submission for THUMOS 15 Action Recognition Challenge (2015) (18)
- Exploring Fisher vector and deep networks for action spotting (2015) (17)
- Pairwise Nonparametric Discriminant Analysis for Binary Plankton Image Recognition (2014) (17)
- Dense Correlation Network for Automated Multi-Label Ocular Disease Detection with Paired Color Fundus Photographs (2020) (17)
- Fast single image dehazing through Edge-Guided Interpolated Filter (2015) (17)
- Product Image Recognition with Guidance Learning and Noisy Supervision (2019) (16)
- Modeling selective ion adsorption into cylindrical nanopores (2018) (16)
- Local Multi-Grouped Binary Descriptor With Ring-Based Pooling Configuration and Optimization (2015) (16)
- Becoming Linguistically Mature: Modeling English and German Children’s Writing Development Across School Grades (2020) (15)
- An analysis of collective damage for short fatigue cracks based on equilibrium of crack numerical density (1998) (15)
- Deep rehabilitation gait learning for modeling knee joints of lower-limb exoskeleton (2016) (15)
- Bootstrap Model Ensemble and Rank Loss for Engagement Intensity Regression (2019) (14)
- Deep Relation Transformer for Diagnosing Glaucoma With Optical Coherence Tomography and Visual Field Function (2021) (13)
- Automatic music video generation: cross matching of music and image (2012) (13)
- Exploring dense trajectory feature and encoding methods for human interaction recognition (2013) (13)
- Integration of multilayer regression analysis with structure-based pronunciation assessment (2010) (13)
- Learning Category Correlations for Multi-label Image Recognition with Graph Networks (2019) (13)
- MixStyle Neural Networks for Domain Generalization and Adaptation (2021) (13)
- Affine invariant features and their application to speech recognition (2009) (13)
- Structured Triplet Learning with POS-Tag Guided Attention for Visual Question Answering (2018) (13)
- Cross matching of music and image (2012) (12)
- Unsupervised Person Re-Identification with Multi-Label Learning Guided Self-Paced Clustering (2021) (12)
- Dual-supervised attention network for deep cross-modal hashing (2019) (12)
- Regularized-MLLR speaker adaptation for computer-assisted language learning system (2010) (12)
- An analysis on overall crack-number-density of short-fatigue-cracks (1999) (12)
- SSN3D: Self-Separated Network to Align Parts for 3D Convolution in Video Person Re-Identification (2021) (11)
- Dialect-based Speaker Classification of Chinese Using Structural Representation of Pronunciation (2009) (11)
- Codebook enhancement of vlad representation for visual recognition (2016) (11)
- StripNet: Towards Topology Consistent Strip Structure Segmentation (2018) (11)
- Progressive Object Transfer Detection (2020) (11)
- On invariant structural representation for speech recognition: theoretical validation and experimental improvement (2009) (11)
- A study on Hidden Structural Model and its application to labeling sequences (2009) (10)
- Very Lightweight Photo Retouching Network with Conditional Sequential Modulation (2021) (10)
- Exploring Regularizations with Face, Body and Image Cues for Group Cohesion Prediction (2019) (10)
- Optimal event search using a structural cost function - improvement of structure to speech conversion (2009) (9)
- Structural analysis of dialects, sub-dialects and sub-sub-dialects of Chinese (2009) (9)
- RBF-Softmax: Learning Deep Representative Prototypes with Radial Basis Function Softmax (2020) (9)
- Implementation of Robust Speech Recognition by Simulating Infants' Speech Perception Based on the Invariant Sound Shape Embedded in Utterances (2009) (9)
- TTPP: Temporal Transformer with Progressive Prediction for Efficient Action Anticipation (2020) (9)
- Exploring Cross-Channel Texture Correlation for Color Texture Classification (2013) (9)
- PC-HMR: Pose Calibration for 3D Human Mesh Recovery from 2D Images/Videos (2021) (9)
- Investigate Indistinguishable Points in Semantic Segmentation of 3D Point Cloud (2021) (8)
- Simulations and experiments of stochastic characteristics for collective short fatigue cracks in steels (2002) (8)
- Boosting up Scene Text Detectors with Guided CNN (2018) (8)
- Single Shot TextSpotter with Explicit Alignment and Attention (2018) (7)
- DID: Disentangling-Imprinting-Distilling for Continuous Low-Shot Detection (2020) (7)
- Fast Texture Synthesis via Pseudo Optimizer (2020) (7)
- Self-speculation of clinical features based on knowledge distillation for accurate ocular disease classification (2021) (7)
- Multi-Dimension Modulation for Image Restoration with Dynamic Controllable Residual Learning (2019) (7)
- Deep face attributes recognition using spatial transformer network (2016) (7)
- Multiple Domain Experts Collaborative Learning: Multi-Source Domain Generalization For Person Re-Identification (2021) (7)
- Visual-Textual Sentiment Analysis in Product Reviews (2019) (6)
- Language that Captivates the Audience: Predicting Affective Ratings of TED Talks in a Multi-Label Classification Task (2021) (6)
- Transferring Object-Scene Convolutional Neural Networks for Event Recognition in Still Images (2016) (6)
- Alzheimer's Disease Detection from Spontaneous Speech through Combining Linguistic Complexity and (Dis)Fluency Features with Pretrained Language Models (2021) (6)
- Multiple Transfer Learning and Multi-label Balanced Training Strategies for Facial AU Detection In the Wild (2020) (6)
- Classification of Ocular Diseases Employing Attention-Based Unilateral and Bilateral Feature Weighting and Fusion (2020) (6)
- LTD: Local Ternary Descriptor for image matching (2013) (5)
- Local Color Contrastive Descriptor for Image Classification (2015) (5)
- Dialect-based speaker classification using speaker-invariant dialect features (2010) (5)
- A semantic model for video based face recognition (2013) (5)
- Pronunciation Proficiency Estimation Based on Multilayer Regression Analysis Using Speaker-independent Structural Features (2010) (5)
- Exploring Multi-Scale Feature Propagation and Communication for Image Super Resolution (2020) (5)
- S-D Net: Joint Segmentation and Diagnosis Revealing the Diagnostic Significance of Using Entire RNFL Thickness in Glaucoma (2018) (5)
- A Study on Unsupervised Dictionary Learning and Feature Encoding for Action Classification (2013) (5)
- A Study on Bag of Gaussian Model with Application to Voice Conversion (2011) (5)
- Flower image retrieval with category attributes (2014) (4)
- Dynamic Sampling Network for Semantic Segmentation (2020) (4)
- Neighbourhood-guided Feature Reconstruction for Occluded Person Re-Identification (2021) (4)
- Improving scale invariant feature transform with local color contrastive descriptor for image classification (2017) (4)
- Visual Field Based Automatic Diagnosis of Glaucoma Using Deep Convolutional Neural Network (2018) (4)
- Automated Classification of Written Proficiency Levels on the CEFR-Scale through Complexity Contours and RNNs (2021) (4)
- Unsupervised optimal phoneme segmentation: theory and experimental evaluation (2013) (4)
- Structural representation with a general form of invariant divergence ∗ ◎ (2008) (3)
- F-divergence based local contrastive descriptor for image classification (2014) (3)
- Metric learning for unsupervised phoneme segmentation (2008) (3)
- RDS-Denoiser: a Detail-preserving Convolutional Neural Network for Image Denoising (2018) (3)
- Face recognition based on Gradient Gabor feature (2008) (3)
- Road segmentation via iterative deep analysis (2015) (3)
- Intelligent Glaucoma Diagnosis Via Active Learning And Adversarial Data Augmentation (2019) (3)
- Refined Gate: A Simple and Effective Gating Mechanism for Recurrent Units (2020) (3)
- HMM-based sequence-to-frame mapping for voice conversion (2010) (3)
- A Comprehensive Study on Temporal Modeling for Online Action Detection (2020) (2)
- Finding hard faces with better proposals and classifier (2020) (2)
- Good Practice on Deep Scene Classification: from Local Supervision to Knowledge Guided Disambiguation (2017) (2)
- Shenzhen Institutes of Advanced Technology, CAS, China at TRECVID INS 2016 (2016) (2)
- Understanding the Dynamics of Second Language Writing through Keystroke Logging and Complexity Contours (2020) (2)
- Bridging Music and Image : A Preliminary Study with Multiple Ranking CCA Learning (2012) (2)
- An analysis framework of two-level sampling subspace for speaker verification (2013) (2)
- Adaptive Part-Level Model Knowledge Transfer for Gender Classification (2016) (2)
- Unsupervised Phoneme Segmentation Using Transformed Cepstrum Features (2008) (2)
- Correction to: Automatic differentiation of Glaucoma visual field from non-glaucoma visual field using deep convolutional neural network (2019) (2)
- Regularized Maximum Likelihood Linear Regression Adaptation for Computer-Assisted Language Learning Systems (2011) (2)
- Gesture Design of Hand-to-Speech Converter Derived from Speech-to-Hand Converter Based on Probabilistic Integration Model (2011) (2)
- Phase singularities for image representation and matching (2008) (1)
- Robust Text Line Detection in Equipment Nameplate Images* (2019) (1)
- Learning multiple local binary descriptors for image matching (2017) (1)
- Human action recognition with DeepAction Kernel Gaussian Process (2016) (1)
- MIL: Music Exploration and Visualization via Lyric and Image (2015) (1)
- A Literature Review: Geometric Methods and Their Applications in Human-Related Analysis (2019) (1)
- Toward Optimal Unsupervised Phoneme Segmentation -A Theoretical and Experimental Investigation (2007) (1)
- Singularity Characteristics for a Lip-Shaped Crack Subjected to Remote Biaxial Loading (1999) (1)
- Bayesian Mixture of Probabilistic Linear Regressions for Voice Conversion (2012) (1)
- Improvement of Structure to Speech Conversion Using Iterative Optimization (2009) (1)
- Dimension Reduction and Discriminant Analysis for Japanese Connected Vowel Recognition ∗ ◎ (2008) (1)
- The Impact of ASR on the Automatic Analysis of Linguistic Complexity and Sophistication in Spontaneous L2 Speech (2021) (1)
- Voice conversion using Bayesian mixture of Probabilistic Linear Regressions and dynamic kernel features (2012) (0)
- Statistical sequence-to-frame mapping techniques for voice conversion (2010) (0)
- Orientation-Aware Text Proposals Network for Scene Text Detection (2017) (0)
- Understanding Vocabulary Growth Through An Adaptive Language Learning System (2019) (0)
- Unsupervised phoneme segmentation using Mahalanobis distance (Speech) -- (国際ワークショップ"Asian workshop on speech science and technology") (2008) (0)
- TianJin F 17 Middle Area LinQu F 18 Middle Area LinQu M 19 Middle Area LinQu F 20 Middle Area LinQu F 21 Middle Area LinQu M 22 Middle Area LinQu M 23 Middle Area LinQu (2009) (0)
- Trimmed Event Recognition : submission to ActivityNet Challenge 2018 (2018) (0)
- Features of Poems in Praise of Spring from Six Dynasties of Han to Wei (2010) (0)
- SAR Target Recognition Via 2DPCA and Weighted Sparse Representation (2019) (0)
- Holistic and Prosodic Representation of the Segmental Aspect of Speech (2008) (0)
- entation of Robust Speech Recognition by Si m ulating Infants ’ Speech Perception Based on the Invariant Sound Shape E m bedded in Utterances (2009) (0)
- Weakly Supervised PatchNets : Learning Aggregated Patch Descriptors for Scene Recognition (2017) (0)
- of Structure to Speech Conversion Using Iterative Opti m ization (2009) (0)
- Using Fishervoice to enhance the performance of I-vector based speaker verification system (2014) (0)
- Unsupervised Phoneme Segmentation Using Mahalanobis Distance (2008) (0)
- Isolated word recognition based on speech structures and discriminant analysis (2008) (0)
- An Investigation of Hidden Structure Model (2009) (0)
- griffith . edu . au Face Recognition based on Gradient Gabor feature (2017) (0)
- Development of a speech generator from hand motions based on space mapping (2008) (0)
- Structural Analysis of Chinese Dialect Speakers and Their Automatic Classification (2009) (0)
- Structure-constrained distribution matching using quadratic programming and its application to pronunciation evaluation (2011) (0)
- Collaborative Multi-View Convolutions With Gating For Accurate And Fast Volumetric Medical Image Segmentation (2021) (0)
- Knowledge-based Fully Convolutional Network and Its Application in Segmentation of Lung CT Images (2018) (0)
- Experimental Study of Using Spectrum-based Features for Structural Representation of Speech (2008) (0)
- The Equipment Nameplate Dataset for Scene Text Detection and Recognition∗ (2019) (0)
- Structure-Preserving Super Resolution with Gradient Guidance Supplementary Material (2020) (0)
- Proposal of a method for acoustic separation of the linguistic information and the extra-linguistic information conveyed in a single speech stream — An attempt toward realizing human-like speech processing on machines — (2010) (0)
- Orientation Robust Scene Text Recognition in Natural Scene* (2019) (0)
- Improvements in Pronunciation Evaluation Based on Speech Technology (2012) (0)
- Proposal of Hidden Structure Model ∗ ◎ (2009) (0)
- Supplementary Material for Fast Texture Synthesis via Pseudo Optimizer (2020) (0)
- Free hand sketch understanding using SVMs-chain modeling for spatial and temporal patterns (2009) (0)
- Proposal of a Method to Extract the Linguistic Information in Speech Based on Acoustic Separation of the Linguistic and Extra-Linguistic Aspects of Speech — — An Attempt toward Realizing Human-Like Speech Processing on Machines — — (2010) (0)
- Multi-feature subspace analysis for audio-vidoe based multi-modal person recognition (2014) (0)
This paper list is powered by the following services: