Taghi M. Khoshgoftaar

Taghi M. Khoshgoftaar's AcademicInfluence.com Rankings

Computer Science

#2629

World Rank

#2749

Historical Rank

Data Mining

#26

World Rank

#26

Historical Rank

Machine Learning

#175

World Rank

#176

Historical Rank

Database

#360

World Rank

#376

Historical Rank

computer-science Degrees

Download Badge

Computer Science

Why Is Taghi M. Khoshgoftaar Influential?

(Suggest an Edit or Addition)

(See a Problem?)

Taghi M. Khoshgoftaar's Published Works

Number of citations in a given year to any of this author's works

Total number of citations to an author for the works they published in a given year. This highlights publication of the most important work(s) by the author

Published Works

A survey on Image Data Augmentation for Deep Learning (2019) (4477)
A Survey of Collaborative Filtering Techniques (2009) (3535)
Deep learning applications and challenges in big data analytics (2015) (1681)
A survey of transfer learning (2016) (1483)
RUSBoost: A Hybrid Approach to Alleviating Class Imbalance (2010) (1349)
Survey on deep learning with class imbalance (2019) (1082)
Experimental perspectives on learning from imbalanced data (2007) (682)
The Detection of Fault-Prone Programs (1992) (495)
A survey of open source tools for machine learning with big data in the Hadoop ecosystem (2015) (383)
A survey on addressing high-class imbalance in big data (2018) (351)
An Empirical Study of Learning from Imbalanced Data Using Random Forest (2007) (329)
Survey of review spam detection using machine learning techniques (2015) (327)
A review of data mining using big data in health informatics (2014) (294)
Choosing software metrics for defect prediction: an investigation on feature selection techniques (2011) (265)
Intrusion detection and Big Heterogeneous Data: a Survey (2015) (262)
Predicting Software Development Errors Using Software Complexity Metrics (1990) (254)
Comparing Boosting and Bagging Techniques With Noisy and Imbalanced Data (2011) (254)
RUSBoost: Improving classification performance when training data is skewed (2008) (236)
Application of neural networks to software quality modeling of a very large telecommunications system (1997) (230)
Early Quality Prediction: A Case Study in Telecommunications (1996) (221)
Big Data: Deep Learning for financial sentiment analysis (2018) (204)
Analyzing software measurement data with clustering techniques (2004) (197)
Knowledge discovery from imbalanced and noisy data (2009) (193)
Comparative Assessment of Software Quality Classification Techniques: An Empirical Case Study (2004) (188)
A survey on heterogeneous transfer learning (2017) (185)
CatBoost for big data: an interdisciplinary review (2020) (172)
Can neural networks be easily interpreted in software cost estimation? (2002) (169)
Survey on categorical data for neural networks (2020) (163)
Tree-based software quality estimation models for fault prediction (2002) (155)
Fault Prediction Modeling for Software Quality Estimation: Comparing Commonly Used Techniques (2003) (153)
Attribute Selection and Imbalanced Data: Problems in Software Defect Prediction (2010) (151)
An Empirical Study of the Classification Performance of Learners on Imbalanced and Noisy Software Quality Data (2007) (147)
Deep Learning applications for COVID-19 (2021) (147)
Feature Selection with High-Dimensional Imbalanced Data (2009) (146)
Evolutionary Optimization of Software Quality Modeling with Multiple Repositories (2010) (144)
EMERALD: software metrics and models on the desktop (1996) (144)
A Study on the Relationships of Classifier Performance Metrics (2009) (142)
Unsupervised learning for expert-based software quality estimation (2004) (141)
A neural network approach for early detection of program modules having high risk in the maintenance phase (1995) (138)
CLUSTERING-BASED NETWORK INTRUSION DETECTION (2007) (135)
LOGISTIC REGRESSION MODELING OF SOFTWARE QUALITY (1999) (130)
Collaborative Filtering for Multi-class Data Using Belief Nets Algorithms (2006) (129)
Classification tree models of software quality over multiple releases (1999) (127)
An application of fuzzy clustering to software quality prediction (2000) (124)
Identification of fuzzy models of software cost estimation (2004) (123)
Using regression trees to classify fault-prone software modules (2002) (122)
Regression modelling of software quality: empirical investigation☆ (1990) (114)
Improving Software-Quality Predictions With Data Sampling and Boosting (2009) (114)
Estimating software project effort by analogy based on linguistic values (2002) (114)
A review of the stability of feature selection techniques for bioinformatics data (2012) (110)
Measuring coupling and cohesion of software modules: an information-theory approach (2001) (107)
Improving Software Quality Prediction by Noise Filtering Techniques (2007) (104)
An empirical study of predicting software faults with case-based reasoning (2006) (102)
Learning with limited minority class data (2007) (100)
Big Data fraud detection using multiple medicare data sources (2018) (98)
The Dimensionality Of Program Complexity (1989) (97)
A Comparative Study of Ensemble Feature Selection Techniques for Software Defect Prediction (2010) (97)
Detection of software modules with high debug code churn in a very large legacy system (1996) (96)
Text Data Augmentation for Deep Learning (2021) (94)
An application of zero-inflated Poisson regression for software fault prediction (2001) (93)
Analogy-Based Practical Classification Rules for Software Quality Estimation (2003) (93)
Incomplete-Case Nearest Neighbor Imputation in Software Measurement Data (2007) (90)
Software Quality Analysis of Unlabeled Program Modules With Semisupervised Clustering (2007) (88)
Case-Based Software Quality Prediction (2000) (88)
Using neural networks to predict software faults during testing (1996) (87)
Predictive Modeling Techniques of Software Quality from Software Measures (1992) (86)
Predicting software errors, during development, using nonlinear regression models: a comparative study (1992) (86)
Software quality estimation with limited fault data: a semi-supervised learning perspective (2007) (86)
Measuring coupling and cohesion: an information-theory approach (1999) (85)
Software quality classification modeling using the SPRINT decision tree algorithm (2002) (83)
A Comprehensive Empirical Study of Count Models for Software Fault Prediction (2007) (81)
Imputation-boosted collaborative filtering using machine learning classifiers (2008) (80)
The pairwise attribute noise detection algorithm (2007) (79)
A comparative study of iterative and non-iterative feature selection techniques for software defect prediction (2014) (79)
A neural network approach for predicting software development faults (1992) (78)
A review of statistical and machine learning methods for modeling cancer risk using structured clinical data (2018) (77)
A comparative study of pattern recognition techniques for quality evaluation of telecommunications software (1994) (75)
Predicting fault-prone modules with case-based reasoning (1997) (75)
Application of neural networks for predicting program faults (1995) (74)
Using Random Undersampling to Alleviate Class Imbalance on Tweet Sentiment Data (2015) (74)
Applications of a relative complexity metric for software project management (1990) (73)
Predicting testability of program modules using a neural network (2000) (72)
Machine prediction of personality from Facebook profiles (2012) (72)
Hybrid sampling for imbalanced data (2008) (70)
Enhancing software quality estimation using ensemble-classifier based noise filtering (2005) (70)
MODELING SOFTWARE QUALITY WITH CLASSIFICATION TREES (2001) (69)
Evolutionary Sampling and Software Quality Modeling of High-Assurance Systems (2009) (68)
Supervised Neural Network Modeling: An Empirical Investigation Into Learning From Imbalanced Data With Labeling Errors (2010) (67)
Using Twitter Content to Predict Psychopathy (2012) (66)
Using Process History to Predict Software Quality (1998) (66)
Unsupervised multiscale color image segmentation based on MDL principle (2006) (66)
How Many Software Metrics Should be Selected for Defect Prediction? (2011) (65)
Resampling or Reweighting: A Comparison of Boosting Implementations (2008) (65)
Balancing Misclassification Rates in Classification-Tree Models of Software Quality (2000) (63)
Identifying learners robust to low quality data (2008) (63)
A multiobjective module-order model for software quality enhancement (2004) (62)
A survey on the state of healthcare upcoding fraud analysis and detection (2017) (62)
Analyzing software quality with limited fault-proneness defect data (2005) (61)
A survey and analysis of intrusion detection models based on CSE-CIC-IDS2018 Big Data (2020) (60)
Predicting susceptibility to social bots on Twitter (2013) (60)
Ordering Fault-Prone Software Modules (2003) (60)
A practical classification-rule for software-quality models (2000) (59)
The improved grey model based on particle swarm optimization algorithm for time series prediction (2016) (59)
Data Mining for Predictors of Software Quality (1999) (59)
A clustering approach to wireless network intrusion detection (2005) (58)
The necessity of assuring quality in software measurement data (2004) (57)
Genetic programming-based decision trees for software quality classification (2003) (56)
Impact of Feature Selection Techniques for Tweet Sentiment Classification (2015) (56)
Application of fuzzy expert systems in assessing operational risk of software (2003) (56)
Feature Selection with Imbalanced Data for Software Defect Prediction (2009) (56)
Improving deep neural network design with new text data representations (2017) (55)
Measurement of data structure complexity (1993) (55)
Modeling the relationship between source code complexity and maintenance difficulty (1994) (54)
An Empirical Study of Feature Ranking Techniques for Software Quality Prediction (2012) (53)
Classification of Fault-Prone Software Modules: Prior Probabilities, Costs, and Model Evaluation (1998) (52)
Medicare fraud detection using neural networks (2019) (52)
A Comparative Study of Data Sampling and Cost Sensitive Learning (2008) (52)
Building Useful Models from Imbalanced Data with Sampling and Boosting (2008) (52)
A Multi-Objective Software Quality Classification Model Using Genetic Programming (2007) (52)
Software reliability model selection: a cast study (1991) (51)
Measuring dynamic program complexity (1992) (50)
Medicare Fraud Detection Using Machine Learning Methods (2017) (50)
A comprehensive empirical evaluation of missing value imputation in noisy software measurement data (2008) (50)
Comparative Analysis of DNA Microarray Data through the Use of Feature Selection Techniques (2010) (49)
Software quality analysis by combining multiple projects and learners (2009) (49)
Predicting Faults in High Assurance Software (2010) (48)
Threshold-based feature selection techniques for high-dimensional bioinformatics data (2012) (48)
Machine Learning for Detecting Brute Force Attacks at the Network Level (2014) (48)
Hybrid Collaborative Filtering Algorithms Using a Mixture of Experts (2007) (47)
Predicting Medical Provider Specialties to Detect Anomalous Insurance Claims (2016) (47)
Reducing overfitting in genetic programming models for software quality classification (2004) (46)
Software measurement data reduction using ensemble techniques (2012) (46)
A comparative evaluation of feature ranking methods for high dimensional bioinformatics data (2011) (46)
Random forest: A reliable tool for patient response prediction (2011) (46)
A Comparative Study of Threshold-Based Feature Selection Techniques (2010) (46)
Comparison of Data Sampling Approaches for Imbalanced Bioinformatics Data (2014) (45)
Exploring the behaviour of neural network software quality models (1995) (44)
Improving code churn predictions during the system test and maintenance phases (1994) (44)
An empirical investigation of filter attribute selection techniques for software quality classification (2009) (44)
Detection of fault-prone software modules during a spiral life cycle (1996) (44)
Mining Data with Rare Events: A Case Study (2007) (44)
Genetic programming model for software quality classification (2001) (43)
Investigating soft computing in case-based reasoning for software cost estimation (2002) (43)
Cost-sensitive boosting in software quality modeling (2002) (42)
Evolutionary neural networks: a robust approach to software reliability problems (1997) (42)
Predicting high-risk program modules by selecting the right software measurements (2012) (42)
The Effects of Random Undersampling with Simulated Class Imbalance for Big Data (2018) (41)
Medicare Fraud Detection Using Random Forest with Class Imbalanced Big Data (2018) (41)
Application of fuzzy expert system in test case selection for system regression test (2005) (40)
The effects of varying class distribution on learner behavior for medicare fraud detection with imbalanced big data (2018) (40)
Prediction of software faults using fuzzy nonlinear regression modeling (2000) (40)
Accuracy of software quality models over multiple releases (2000) (40)
The use of software complexity metrics in software reliability modeling (1991) (39)
Intrusion detection in wireless networks using clustering techniques with expert analysis (2005) (39)
First Order Statistics Based Feature Selection: A Diverse and Powerful Family of Feature Seleciton Techniques (2012) (38)
Detection of fault-prone program modules in a very large telecommunications system (1995) (38)
A comparative study of filter-based feature ranking techniques (2010) (37)
Using classification trees for software quality models: lessons learned (1998) (37)
An extensive comparison of feature ranking aggregation techniques in bioinformatics (2012) (37)
Count Models for Software Quality Estimation (2007) (37)
Severely imbalanced Big Data challenges: investigating data sampling approaches (2019) (37)
A Multi-dimensional Comparison of Toolkits for Machine Learning with Big Data (2015) (37)
Empirical Case Studies in Attribute Noise Detection (2005) (36)
An empirical comparison of repetitive undersampling techniques (2009) (36)
Using the genetic algorithm to build optimal neural networks for fault-prone module detection (1996) (35)
A Novel Method for Fraudulent Medicare Claims Detection from Expected Payment Deviations (Application Paper) (2016) (35)
Controlling Overfitting in Classification-Tree Models of Software Quality (2001) (35)
A comparative study of predictive models for program changes during system testing and maintenance (1993) (35)
Metric Selection for Software Defect Prediction (2011) (35)
Predicting the order of fault-prone modules in legacy software (1998) (35)
Generating multiple noise elimination filters with the ensemble-partitioning filter (2004) (34)
The Effect of Data Sampling When Using Random Forest on Imbalanced Bioinformatics Data (2015) (34)
Application of a usage profile in software quality models (1999) (34)
Controlling overfitting in software quality models: experiments with regression trees and classification (2001) (34)
Process measures for predicting software quality (1997) (34)
Cross-Domain Sentiment Analysis: An Empirical Investigation (2016) (34)
Noise identification with the k-means algorithm (2004) (33)
Impact of noise and data sampling on stability of feature ranking techniques for biological datasets (2012) (33)
Active learning with neural networks for intrusion detection (2010) (33)
Reducing Feature Set Explosion to Facilitate Real-World Review Spam Detection (2016) (33)
A Comparative Study of Ordering and Classification of Fault-Prone Software Modules (1999) (33)
Software quality modeling: The impact of class noise on the random forest classifier (2008) (33)
A survey of stability analysis of feature subset selection techniques (2013) (32)
The Effects of Data Sampling with Deep Learning and Highly Imbalanced Big Data (2020) (32)
System regression test planning with a fuzzy expert system (2014) (32)
The Effect of Dataset Size on Training Tweet Sentiment Classifiers (2015) (32)
A literature review on one-class classification and its potential applications in big data (2021) (32)
Predicting fault-prone software modules in embedded systems with classification trees (1999) (31)
PREDICTING SOFTWARE QUALITY, DURING TESTING, USING NEURAL NETWORK MODELS: A COMPARATIVE STUDY (1994) (31)
Stability of Filter- and Wrapper-Based Feature Subset Selection (2013) (31)
Uncertain Classification of Fault-Prone Software Modules (2002) (31)
An Empirical Study on Class Rarity in Big Data (2018) (30)
Designing a Better Data Representation for Deep Neural Networks and Text Classification (2016) (30)
A Probabilistic Programming Approach for Outlier Detection in Healthcare Claims (2016) (30)
Using Ensemble Learners to Improve Classifier Performance on Tweet Sentiment Data (2015) (30)
Class noise detection using frequent itemsets (2006) (30)
Assessment of a New Three-Group Software Quality Classification Technique: An Empirical Case Study (2005) (30)
The lines of code metric as a predictor of program faults: a critical analysis (1990) (30)
The use of decision trees for cost‐sensitive classification: an empirical study in software quality prediction (2011) (29)
Improving usefulness of software quality classification models based on Boolean discriminant functions (2002) (29)
A tree-based classification model for analysis of a military software system (1996) (29)
Efficient image segmentation by mean shift clustering and MDL-guided region merging (2004) (28)
Comparing software fault predictions of pure and zero-inflated Poisson regression models (2005) (28)
Using Imputation Techniques to Help Learn Accurate Classifiers (2008) (28)
Software quality assessment using a multi-strategy classifier (2014) (28)
Classification Performance of Rank Aggregation Techniques for Ensemble Gene Selection (2013) (27)
Detecting noisy instances with the rule-based classification model (2005) (27)
A New Intrusion Detection Benchmarking System (2015) (27)
The Detection of Medicare Fraud Using Machine Learning Methods with Excluded Provider Labels (2018) (27)
Deep Learning and Data Sampling with Imbalanced Big Data (2019) (27)
The effects of class rarity on the evaluation of supervised healthcare fraud detection models (2019) (26)
A neural network modeling methodology for the detection of high-risk programs (1993) (26)
ATTRIBUTE SELECTION USING ROUGH SETS IN SOFTWARE QUALITY CLASSIFICATION (2009) (26)
Ontology-Based Business Process Customization for Composite Web Services (2011) (26)
Software Defect Prediction for High-Dimensional and Class-Imbalanced Data (2011) (26)
The impact of software enhancement on software reliability (1995) (25)
A Mixture Imputation-Boosted Collaborative Filter (2008) (25)
Impact of class distribution on the detection of slow HTTP DoS attacks using Big Data (2019) (25)
Modeling software quality: the Software Measurement Analysis and Reliability Toolkit (2000) (25)
Mean Aggregation versus Robust Rank Aggregation for Ensemble Gene Selection (2012) (25)
Multivariate outlier detection in medicare claims payments applying probabilistic programming methods (2017) (25)
Fuzzy case-based reasoning models for software cost estimation (2004) (25)
Imputation techniques for multivariate missingness in software measurement data (2008) (25)
Detecting Outliers Using Rule-Based Modeling for Improving CBR-Based Software Quality Classification Models (2003) (25)
Medical Provider Specialty Predictions for the Detection of Anomalous Medicare Insurance Claims (2017) (25)
Skewed Class Distributions and Mislabeled Examples (2007) (24)
Ensemble Feature Ranking Methods for Data Intensive Computing Applications (2011) (24)
Collaborative Filtering for Multi-Class Data Using Bayesian Networks (2008) (24)
Social media for polling and predicting United States election outcome (2018) (24)
Random forest implementation and optimization for Big Data analytics on LexisNexis’s high performance computing cluster platform (2019) (24)
Imputed Neighborhood Based Collaborative Filtering (2008) (23)
Combining Feature Subset Selection and Data Sampling for Coping with Highly Imbalanced Software Data (2015) (23)
Data Sampling Approaches with Severely Imbalanced Big Data for Medicare Fraud Detection (2018) (23)
Detecting program modules with low testability (1995) (23)
Survey of Clinical Data Mining Applications on Big Data in Health Informatics (2013) (23)
Examining characteristics of predictive models with imbalanced big data (2019) (22)
A Session Based Approach for Aggregating Network Traffic Data -- The SANTA Dataset (2014) (22)
Detection of SSH Brute Force Attacks Using Aggregated Netflow Data (2015) (22)
Evaluation of maxout activations in deep learning across several big data domains (2019) (22)
Using evolutionary sampling to mine imbalanced data (2007) (22)
Integrating metrics and models for software risk assessment (1996) (22)
Ensemble Feature Selection Technique for Software Quality Classification (2010) (21)
Alternative approaches for the use of metrics to order programs by complexity (1994) (21)
Comparison of approaches to alleviate problems with high-dimensional and class-imbalanced data (2011) (21)
Rule-based noise detection for software measurement data (2004) (21)
Resource-sensitive intrusion detection models for network traffic (2004) (21)
Cost-Benefit Analysis of Software Quality Models (2004) (21)
RUDY Attack: Detection at the Network Level and Its Important Features (2016) (21)
An Empirical Study of the Noise Impact on Cost-Sensitive Learning (2007) (21)
Dynamic system complexity (1993) (21)
High-Dimensional Software Engineering Data and Feature Selection (2009) (21)
Large-scale distributed L-BFGS (2017) (20)
Identifying Medicare Provider Fraud with Unsupervised Machine Learning (2018) (20)
Classification of Ships in Surveillance Video (2006) (20)
Are the principal components of software complexity data stable across software products? (1994) (20)
Semi-supervised learning for software quality estimation (2004) (20)
Stability Analysis of Feature Ranking Techniques on Biological Datasets (2011) (20)
Gradient Boosted Decision Tree Algorithms for Medicare Fraud Detection (2021) (20)
User Behavior Anomaly Detection for Application Layer DDoS Attacks (2017) (20)
Efficient learning from big data for cancer risk modeling: A case study with melanoma (2019) (20)
Robustness of Filter-Based Feature Ranking: A Case Study (2011) (19)
Empirical case studies of combining software quality classification models (2003) (19)
The impact of costs of misclassification on software quality modeling (1997) (19)
Modeling and tracking Covid-19 cases using Big Data analytics on HPCC system platform (2021) (19)
Combining Feature Selection and Ensemble Learning for Software Quality Estimation (2014) (19)
Impact of Data Sampling on Stability of Feature Selection for Software Measurement Data (2011) (19)
Comparing Two New Gene Selection Ensemble Approaches with the Commonly-Used Approach (2012) (19)
Which software modules have faults which will be discovered by customers (1999) (19)
Software Quality Prediction for High-Assurance Network Telecommunications Systems (2001) (19)
Improving tree-based models of software quality with principal components analysis (2000) (18)
Survey on RNN and CRF models for de-identification of medical free text (2020) (18)
Medicare Fraud Detection using CatBoost (2020) (18)
Exploring the Effectiveness of Twitter at Polling the United States 2016 Presidential Election (2017) (18)
Mining Data from Multiple Software Development Projects (2009) (18)
Multivariate assessment of complex software systems: a comparative study (1995) (18)
The impact of software evolution and reuse on software quality (2004) (18)
A Novel Noise Filtering Algorithm for Imbalanced Data (2010) (17)
Investigating Random Undersampling and Feature Selection on Bioinformatics Big Data (2019) (17)
An assessment of software quality in a C++ environment (1995) (17)
Evaluating Feature Selection Methods for Network Intrusion Detection with Kyoto Data (2016) (17)
Using Classifier-Based Nominal Imputation to Improve Machine Learning (2011) (17)
Detecting web attacks using random undersampling and ensemble learners (2021) (17)
Editorial: Special issue on mining low-quality data (2007) (17)
Improving detection of untrustworthy online reviews using ensemble learners combined with feature selection (2017) (16)
Utilizing Netflow Data to Detect Slow Read Attacks (2018) (16)
A parallel and distributed stochastic gradient descent implementation using commodity clusters (2019) (16)
A Survey of Medicare Data Processing and Integration for Fraud Detection (2018) (16)
OCEAN TURBINES — A RELIABILITY ASSESSMENT (2009) (16)
Identifying noise in an attribute of interest (2005) (16)
Predicting Fault-Prone Modules in Embedded Systems Using Analogy-Based Classification Models (2002) (16)
Application of an attribute selection method to CBR-based software quality classification (2003) (16)
Software Engineering with Computational Intelligence (2003) (16)
Performance of CatBoost and XGBoost in Medicare Fraud Detection (2020) (15)
Resource-oriented software quality classification models (2005) (15)
Measuring robustness of Feature Selection techniques on software engineering datasets (2011) (15)
An empirical study of the impact of count models predictions on module-order models (2002) (15)
Robustness of Threshold-Based Feature Rankers with Data Sampling on Noisy and Imbalanced Data (2012) (15)
Improving neural network predictions of software quality using principal components analysis (1994) (15)
A Review of Ensemble Classification for DNA Microarrays Data (2013) (15)
Aggregating performance metrics for classifier evaluation (2009) (15)
Which Software Modules have Faults which will be Discovered by Customers? (1999) (15)
NEURAL NETWORKS FOR SOFTWARE QUALITY PREDICTION (1998) (15)
Simplifying the Utilization of Machine Learning Techniques for Bioinformatics (2013) (15)
An empirical study of program quality during testing and maintenance (1994) (14)
An Empirical Study of Software Metrics Selection Using Support Vector Machine (2011) (14)
A Hybrid Approach to Coping with High Dimensionality and Class Imbalance for Software Defect Prediction (2012) (14)
Comparison of Four Performance Metrics for Evaluating Sampling Techniques for Low Quality Class-Imbalanced Data (2008) (14)
Empirical Assessment of a Software Metric: The Information Content of Operators (2001) (14)
Detecting Noisy Instances with the Ensemble Filter: a Study in Software Quality Estimation (2006) (14)
The Impact of Malicious Accounts on Political Tweet Sentiment (2018) (14)
Assessing uncertain predictions of software quality (1999) (14)
Hidden dependencies between class imbalance and difficulty of learning for bioinformatics datasets (2013) (14)
Investigating the relationship between time and predictive model maintenance (2020) (13)
Selecting the Appropriate Data Sampling Approach for Imbalanced and High-Dimensional Bioinformatics Datasets (2014) (13)
From Web Service Artifact to a Readable and Verifiable Model (2009) (13)
Return on investment of software quality predictions (1998) (13)
Making an accurate classifier ensemble by voting on classifications from imputed learning sets (2009) (13)
A novel dataset-similarity-aware approach for evaluating stability of software metric selection techniques (2012) (13)
Approaches for identifying U.S. medicare fraud in provider claims data (2018) (13)
Noise elimination with partitioning filter for software quality estimation (2006) (13)
Applications of information theory to software engineering measurement (1994) (13)
Modeling fault-prone modules of subsystems (2000) (13)
Feature Selection Algorithms for Mining High Dimensional DNA Microarray Data (2011) (13)
Fuzzy logic techniques for software reliability engineering (2001) (13)
THE USE OF UNDER- AND OVERSAMPLING WITHIN ENSEMBLE FEATURE SELECTION AND CLASSIFICATION FOR SOFTWARE QUALITY PREDICTION (2014) (13)
Which Users Reply to and Interact with Twitter Social Bots? (2013) (13)
Enhancing Ensemble Learners with Data Sampling on High-Dimensional Imbalanced Tweet Sentiment Data (2016) (13)
A Study of Software Metric Selection Techniques: stability Analysis and Defect Prediction Model Performance (2013) (13)
An application of genetic programming to software quality prediction (1998) (13)
Transfer Learning Techniques (2016) (13)
Using product, process, and execution metrics to predict fault-prone software modules with classification trees (2000) (13)
Investigating class rarity in big data (2020) (12)
Stability of filter- and wrapper-based software metric selection techniques (2014) (12)
Boosted Noise Filters for Identifying Mislabeled Data (2005) (12)
Detecting Cybersecurity Attacks Using Different Network Features with LightGBM and XGBoost Learners (2020) (12)
Fault-tolerant software reliability modeling using Petri Nets (1991) (12)
Deep Learning Techniques in Big Data Analytics (2016) (12)
Preparing measurements of legacy software for predicting operational faults (1999) (12)
Using Feature Selection in Combination with Ensemble Learning Techniques to Improve Tweet Sentiment Classification Performance (2015) (12)
Deep Learning and Thresholding with Class-Imbalanced Big Data (2019) (12)
Ensemble vs. Data Sampling: Which Option Is Best Suited to Improve Classification Performance of Imbalanced Bioinformatics Data? (2015) (12)
Software reliability model selection (1992) (12)
Machine Learning in Modeling High School Sport Concussion Symptom Resolve. (2019) (12)
Evaluating indirect and direct classification techniques for network intrusion detection (2005) (12)
Evolutionary data analysis for the class imbalance problem (2010) (12)
Comparison of Stability for Different Families of Filter-Based and Wrapper-Based Feature Selection (2013) (12)
Improving Learner Performance with Data Sampling and Boosting (2008) (12)
Rule-Based Multiple Object Tracking for Traffic Surveillance Using Collaborative Background Extraction (2007) (12)
An Information Theory-Based Approach to Quantifying the Contribution of a Software Metric (1997) (12)
Detecting cybersecurity attacks across different network features and learners (2021) (11)
Data quality in data mining and machine learning (2007) (11)
An Empirical Study on the Stability of Feature Selection for Imbalanced Software Engineering Data (2012) (11)
The application of fuzzy enhanced case-based reasoning for identifying fault-prone modules (1998) (11)
Similarity analysis of feature ranking techniques on imbalanced DNA microarray datasets (2012) (11)
Selecting the Appropriate Ensemble Learning Approach for Balanced Bioinformatics Data (2015) (11)
Module-order modeling using an evolutionary multi-objective optimization approach (2004) (11)
Investigating ARIMA models of software system quality (1995) (11)
CREATING ENTREPRENEURIAL UNIVERSITY (2013) (11)
Exploring Software Quality Classification with a Wrapper-Based Feature Ranking Technique (2009) (11)
Sample size determination for biomedical big data with limited labels (2020) (11)
An Empirical Study on Estimating Motions in Video Stabilization (2007) (11)
Data Mining of Software Development Databases (2001) (11)
A Review and Analysis of the Bot-IoT Dataset (2021) (11)
Identifying modules which do not propagate errors (1999) (10)
Predictive modeling of software quality for very large telecommunications systems (1996) (10)
Evaluation of Wrapper-Based Feature Selection Using Hard, Moderate, and Easy Bioinformatics Data (2014) (10)
Investigating Transfer Learners for Robustness to Domain Class Imbalance (2016) (10)
Rotation invariant face recognition survey (2014) (10)
Software Engineering with Computational Intelligence and Machine Learning A Novel Software Metric Selection Technique Using the Area Under ROC Curves (2010) (10)
Modernizing Analytics for Melanoma with a Large-Scale Research Dataset (2017) (10)
Reliability Evaluation Model of Component‐Based Software Based on Complex Network Theory (2017) (10)
Monitoring Ocean Turbines : a Reliability Assessment (2009) (10)
An application of a rule-based model in software quality classification (2007) (10)
A Procedure for Collecting and Labeling Man-in-the-Middle Attack Traffic (2017) (10)
A COMPARATIVE STUDY OF FILTER-BASED AND WRAPPER-BASED FEATURE RANKING TECHNIQUES FOR SOFTWARE QUALITY MODELING (2011) (10)
Comparing Transfer Learning and Traditional Learning Under Domain Class Imbalance (2017) (10)
Detecting Slow HTTP POST DoS Attacks Using Netflow Features (2019) (10)
Thresholding Strategies for Deep Learning with Highly Imbalanced Big Data (2020) (9)
Classification performance of three approaches for combining data sampling and gene selection on bioinformatics data (2014) (9)
Software Quality Imputation in the Presence of Noisy Data (2006) (9)
Wrapper-Based Feature Ranking for Software Engineering Metrics (2009) (9)
Location-Based Twitter Sentiment Analysis for Predicting the U.S. 2016 Presidential Election (2018) (9)
Gene selection stability's dependence on dataset difficulty (2013) (9)
An Investigation of Transfer Learning and Traditional Machine Learning Algorithms (2016) (9)
The importance of performance metrics within wrapper feature selection (2013) (9)
Threshold Based Optimization of Performance Metrics with Severely Imbalanced Big Security Data (2019) (9)
Software and communications architecture for Prognosis and Health Monitoring of ocean-based power generator (2011) (9)
A Review of Prognostics and Health Monitoring Techniques for Autonomous Ocean Systems (2010) (9)
Random Forest with 200 Selected Features: An Optimal Model for Bioinformatics Research (2013) (9)
Episodic-Memory Performance in Machine Learning Modeling for Predicting Cognitive Health Status Classification (2019) (9)
Evaluation of the importance of data pre-processing order when combining feature selection and data sampling (2012) (8)
An Empirical Study on Wrapper-Based Feature Ranking (2009) (8)
An Empirical Investigation on Wrapper-Based Feature Selection for Predicting Software Quality (2015) (8)
Using Correlation-Based Feature Selection for a Diverse Collection of Bioinformatics Datasets (2014) (8)
Contrast Pattern Mining with Gap Constraints for Peptide Folding Prediction (2008) (8)
Computational Intelligence in Empirical Software Engineering (2004) (8)
A Novel Hybrid Search Algorithm for Feature Selection (2009) (8)
A Study on First Order Statistics-Based Feature Selection Techniques on Software Metric Data (2013) (8)
An Evaluation of Sampling on Filter-Based Feature Selection Methods (2010) (8)
Is Data Sampling Required When Using Random Forest for Classification on Imbalanced Bioinformatics Data? (2016) (8)
Identification of microRNA biomarkers for cancer by combining multiple feature selection techniques (2011) (8)
Fourier transforms for vibration analysis: A review and case study (2011) (8)
Assessment of a Multi-Strategy Classifier for an Embedded Software System (2006) (8)
Applying Feature Selection to Short Time Wavelet Transformed Vibration Data for Reliability Analysis of an Ocean Turbine (2012) (8)
Medical Provider Embeddings for Healthcare Fraud Detection (2021) (8)
Identifying noisy features with the Pairwise Attribute Noise Detection Algorithm (2005) (7)
Using Genetic Programming to Determine Software Quality (1999) (7)
A noise-based stability evaluation of threshold-based feature selection techniques (2011) (7)
Optimizing Wrapper-Based Feature Selection for Use on Bioinformatics Data (2014) (7)
Building Decision Tree Software Quality Classification Models Using Genetic Programming (2003) (7)
Applications of Data Fusion in Monitoring Inaccessible Ocean Machinery (2010) (7)
Feature Extraction for Class Imbalance Using a Convolutional Autoencoder and Data Sampling (2021) (7)
Utility of MemTrax and Machine Learning Modeling in Classification of Mild Cognitive Impairment (2020) (7)
Noise Correction using Bayesian Multiple Imputation (2006) (7)
Integrating Multiple Data Sources to Enhance Sentiment Prediction (2016) (7)
Designing a Testing Framework for Transfer Learning Algorithms (Application Paper) (2016) (7)
Improving software quality estimation by combining feature selection strategies with sampled ensemble learning (2014) (7)
Deep Neural Network Architecture for Character-Level Learning on Short Text (2017) (7)
Software metric-based neural network classification models of a very large telecommunications system (1996) (7)
On the Stability of Feature Selection Methods in Software Quality Prediction: An Empirical Investigation (2015) (7)
Differentiating between Educational Data Mining and Learning Analytics: A Bibliometric Approach (2019) (7)
Evaluating the impact of data quality on sampling (2010) (7)
A reconstruction error-based framework for label noise detection (2021) (7)
Melanoma Risk Prediction with Structured Electronic Health Records (2018) (7)
Utilizing Ensemble, Data Sampling and Feature Selection Techniques for Improving Classification Performance on Tweet Sentiment Data (2015) (7)
An Empirical Study of Predictive Modeling Techniques of Software Quality (2010) (6)
Noise Elimination with Ensemble-Classifier Filtering: A Case-Study in Software Quality Engineerin (2004) (6)
Multivariate Anomaly Detection in Medicare using Model Residuals and Probabilistic Programming (2017) (6)
Feature Level Sensor Fusion for Improved Fault Detection in MCM Systems for Ocean Turbines (2011) (6)
A Comparison of Software Fault Imputation Procedures (2006) (6)
A Comparison of Performance Metrics with Severely Imbalanced Network Security Big Data (2019) (6)
Dynamic Two-phase Truncated Rayleigh Model for Release Date Prediction of Software (2010) (6)
Fault severity in models of fault-correction activity (1995) (6)
The multiple imputation quantitative noise corrector (2007) (6)
A System-Level Modeling Methodology for Performance-Driven Component Selection in Multicore Architectures (2012) (6)
A Text Mining Approach for Anomaly Detection in Application Layer DDoS Attacks (2017) (6)
Investigating Two Approaches for Adding Feature Ranking to Sampled Ensemble Learning for Software Quality Estimation (2015) (6)
The use of generative adversarial networks to alleviate class imbalance in tabular data: a survey (2022) (6)
Filter- and wrapper-based feature selection for predicting user interaction with Twitter bots (2013) (6)
Studying the Effect of Class Imbalance in Ocean Turbine Fault Data on Reliable State Detection (2012) (6)
The effect of measurement approach and noise level on gene selection stability (2012) (6)
Effects of the Use of Boosting on Classification Performance of Imbalanced Bioinformatics Datasets (2014) (6)
Extracting Knowledge from Technical Reports for the Valuation of West Texas Intermediate Crude Oil Futures (2017) (6)
Is Gene Selection Enough for Imbalanced Bioinformatics Data? (2018) (6)
A Survey of 2D Face Databases (2015) (6)
A Dynamometer for an Ocean Turbine Prototype: Reliability through Automated Monitoring (2011) (6)
Exploring an iterative feature selection technique for highly imbalanced data sets (2012) (6)
A Comparative Study of Different Strategies for Predicting Software Quality (2011) (6)
An Investigation of Ensemble Techniques for Detection of Spam Reviews (2016) (6)
The partitioning- and rule-based filter for noise detection (2005) (6)
Impact of Hyperparameter Tuning in Classifying Highly Imbalanced Big Data (2021) (6)
An exploration of learning when data is noisy and imbalanced (2011) (6)
The Use of Ensemble-Based Data Preprocessing Techniques for Software Defect Prediction (2014) (6)
Multiple Imputation of Software Measurement Data: A Case Study (2006) (5)
Approximating general distributions by a uniform coxian distribution (1988) (5)
The Effect of Time on the Maintenance of a Predictive Model (2019) (5)
Stability and Classification Performance of Feature Selection Techniques (2011) (5)
Assuring Timeliness in an e-Science Service-Oriented Architecture (2008) (5)
VoB predictors: Voting on bagging classifications (2008) (5)
Aggregating Data Sampling with Feature Subset Selection to Address Skewed Software Defect Data (2015) (5)
Select-Bagging: Effectively Combining Gene Selection and Bagging for Balanced Bioinformatics Data (2014) (5)
A Hybrid Approach to Cleansing Software Measurement Data (2006) (5)
Using Feature Selection to Determine Optimal Depth for Wavelet Packet Decomposition of Vibration Signals for Ocean System Reliability (2011) (5)
An Empirical Evaluation of Repetitive Undersampling Techniques (2010) (5)
Hierarchical indexing of ocean survey video by mean shift clustering and MDL principle (2005) (5)
Analysis of Transfer Learning Performance Measures (2017) (5)
Using feature selection and classification to build effective and efficient firewalls (2014) (5)
Software quality estimation with case-based reasoning (2004) (5)
Feature Selection for Highly Imbalanced Software Measurement Data (2012) (5)
Impact of Noise and Data Sampling on Stability of Feature Selection (2011) (5)
How the Choice of Wrapper Learner and Performance Metric Affects Subset Evaluation (2013) (5)
Encoding Techniques for High-Cardinality Features and Ensemble Learners (2021) (5)
A novel feature selection technique for highly imbalanced data (2010) (5)
Resource oriented selection of rule-based classification models: An empirical case study (2006) (5)
Semantic Embeddings for Medical Providers and Fraud Detection (2020) (5)
Feature Selection for Optimization of Wavelet Packet Decomposition in Reliability Analysis of Systems (2013) (5)
The effect of noise level and distribution on classification of easy gene microarray data (2014) (4)
The Effect of Number of Iterations on Ensemble Gene Selection (2012) (4)
Feature Selection on Dynamometer Data for Reliability Analysis (2011) (4)
Evaluating Model Predictive Performance: A Medicare Fraud Detection Case Study (2019) (4)
THREE-GROUP SOFTWARE QUALITY CLASSIFICATION MODELING USING AN AUTOMATED REASONING APPROACH (2004) (4)
Maxout Neural Network for Big Data Medical Fraud Detection (2019) (4)
A Comparative Study on the Stability of Software Metric Selection Techniques (2012) (4)
CLASSIFYING SOFTWARE MODULES INTO THREE RISK GROUPS (2004) (4)
Leveraging LightGBM for Categorical Big Data (2021) (4)
A performance analysis of the IBM subsystem control block architecture in a video conferencing environment (1993) (4)
An empirical model of enhancement-induced defect activity in software (1995) (4)
Addressing Class Imbalance in Non-binary Classification Problems (2008) (4)
Maximizing Classification Performance for Patient Response Datasets (2013) (4)
Multi-Objective Optimization by CBR GA-Optimizer for Module-Order Modeling (2004) (4)
Developing an Effective Validation Strategy for Genetic Programming Models Based on Multiple Datasets (2006) (4)
Proceedings, 16th IEEE International Conference on Tools with Artificial Intelligence : ICTAI 2004 : 15-17 November 2004, Boca Raton, Florida (2004) (4)
Building an Effective Classification Model for Breast Cancer Patient Response Data (2015) (4)
WRAPPER-BASED FEATURE RANKING TECHNIQUES FOR DETERMINING RELEVANCE OF SOFTWARE ENGINEERING METRICS (2010) (4)
Predicting sentinel node status in melanoma from a real-world EHR dataset (2017) (4)
Should the Same Learners Be Used Both within Wrapper Feature Selection and for Building Classification Models? (2013) (4)
Comparing Feature Selection Techniques for Software Quality Estimation Using Data-Sampling-Based Boosting Algorithms (2015) (4)
Melanoma risk modeling from limited positive samples (2019) (4)
On the Rarity of Fault-prone Modules in Knowledge-based Software Quality Modeling (2008) (4)
Ensemble Gene Selection Versus Single Gene Selection: Which Is Better? (2013) (4)
Performance evaluation of the communications protocol processor (1990) (4)
A Survey on Classifying Big Data with Label Noise (2022) (4)
The Effects of Data Sampling with Deep Learning and Highly Imbalanced Big Data (2020) (4)
The Effects of Class Label Noise on Highly-Imbalanced Big Data (2021) (4)
Determining noisy instances relative to attributes of interest (2006) (4)
Building a Novel GP-Based Software Quality Classifier Using Multiple Validation Datasets (2007) (4)
Guest Editor's Introduction: Software Metrics (1994) (4)
Learning Curve Estimation with Large Imbalanced Datasets (2019) (4)
Patient response datasets: Challenges and opportunities (2013) (4)
Network Traffic Prediction Models for Near- and Long-Term Predictions (2014) (4)
Exploring filter-based feature selection techniques for software quality classification (2012) (4)
Detection of Phishing Webpages Using Heterogeneous Transfer Learning (2017) (4)
Comparing Approaches for Combining Data Sampling and Feature Selection to Address Key Data Quality Issues in Tweet Sentiment Analysis (2016) (3)
A survey on heterogeneous transfer learning (2017) (3)
KerasBERT: Modeling the Keras Language (2021) (3)
A Progressive Edge-Based Stereo Correspondence Method (2007) (3)
Measuring stability of feature ranking techniques: a noise-based approach (2012) (3)
Investigating the Variation of Ensemble Size on Bagging-Based Classifier Performance in Imbalanced Bioinformatics Datasets (2016) (3)
Comparison of rank-based vs. score-based aggregation for ensemble gene selection (2013) (3)
Detecting Slow Application-Layer DoS Attacks With PCA (2021) (3)
Investigating rarity in web attacks with ensemble learners (2021) (3)
Multiple Imputation of Missing Values in Software Measurement Data (2007) (3)
Overcoming Big Data Challenges (2013) (3)
Performance analysis of advanced I/O architectures for PC-based video servers (1994) (3)
A New Fixed-Overlap Partitioning Algorithm for Determining Stability of Bioinformatics Gene Rankers (2012) (3)
An Information Theoretic Approach to Predicting Software Faults (1998) (3)
Predicting Cancer Relapse with Clinical Data: A Survey of Current Techniques (2016) (3)
Measuring Stability of Feature Selection Techniques on Real-World Software Datasets (2013) (3)
Alterations to the Bootstrapping Process within Random Forest: A Case Study on Imbalanced Bioinformatics Data (2015) (3)
An Empirical Study on Wrapper-Based Feature Selection for Software Engineering Data (2013) (3)
Evaluation of Transfer Learning Algorithms Using Different Base Learners (2017) (3)
TESTING AND FORMAL VERIFICATION OF SERVICE ORIENTED ARCHITECTURES (2009) (3)
A Novel Noise-Resistant Boosting Algorithm for Class-Skewed Data (2012) (3)
Output Thresholding for Ensemble Learners and Imbalanced Big Data (2021) (3)
Canonical modeling of software complexity and fault correction activity (1994) (3)
Using Weather and Playing Surface to Predict the Occurrence of Injury in Major League Soccer Games: A Case Study (2017) (3)
Detection Methods of Slow Read DoS Using Full Packet Capture Data (2020) (3)
Measuring Stability of Threshold-Based Feature Selection Techniques (2011) (3)
Assessments of Feature Selection Techniques with Respect to Data Sampling for Highly Imbalanced Software Measurement Data (2015) (3)
A study on rare fraud predictions with big Medicare claims fraud data (2020) (3)
Labeling Network Event Records for Intrusion Detection in aWireless LAN (2006) (3)
Quality Problem in Software Measurement Data (2006) (3)
Hcpcs2Vec: Healthcare Procedure Embeddings for Medicare Fraud Prediction (2020) (3)
Indirect classification approaches: a comparative study in network intrusion detection (2006) (3)
Robust Thresholding Strategies for Highly Imbalanced and Noisy Data (2021) (3)
An Empirical Investigation of Combining Filter-Based Feature Subset Selection and Data Sampling for Software Defect Prediction (2015) (3)
Arbitrarily-Shaped Window Based Stereo Matching using the Go-Light Optimization Algorithm (2007) (3)
Learning from Software Quality Data with Class Imbalance and Noise (2007) (3)
IoT information theft prediction using ensemble feature selection (2022) (2)
Introduction to the Special Issue on Quality Engineering with Computational Intelligence (2003) (2)
Detecting SQL Injection Web Attacks Using Ensemble Learners and Data Sampling (2021) (2)
Mitigating Class Imbalance for IoT Network Intrusion Detection: A Survey (2021) (2)
Feature Selection for Vibration Sensor Data Transformed by a Streaming Wavelet Packet Decomposition (2011) (2)
Improved Fault-Prone Detection Analysis of Software Modules Using an Evolutionary Neural Network Approach (2003) (2)
Exploring Ensemble-Based Data Preprocessing Techniques for Software Quality Estimation (2013) (2)
A comparative study of iterative and non-iterative feature selection techniques for software defect prediction (2013) (2)
The Effects of Random Undersampling for Big Data Medicare Fraud Detection (2022) (2)
Evaluating noise elimination techniques for software quality estimation (2005) (2)
Using Neural Networks to Predict Software Faults During (1996) (2)
Software Fault Imputation in Noisy and Incomplete Measurement Data (2008) (2)
Comparison of Two Frameworks for Measuring the Stability of Gene-Selection Techniques on Noisy Class-Imbalanced Data (2013) (2)
Detecting Network Attacks Based on Behavioral Commonalities (2016) (2)
Contrasting Undersampled Boosting with Internal and External Feature Selection for Patient Response Datasets (2013) (2)
Toward Model Checking Web Services Over the Web (2008) (2)
Software Metrics: Charting the Course - Guest Editors' Introduction (1994) (2)
The use of balance-aware subsampling for bioinformatics datasets (2013) (2)
Building and Interpreting Risk Models from Imbalanced Clinical Data (2018) (2)
Feature Popularity Between Different Web Attacks with Supervised Feature Selection Rankers (2021) (2)
Detecting Information Theft Attacks in the Bot-IoT Dataset (2021) (2)
The effect of feature extraction and data sampling on credit card fraud detection (2023) (2)
Stability of Three Forms of Feature Selection Methods on Software Engineering Data (2015) (2)
Low-Effort Labeling of Network Events for Intrusion Detection in WLANs (2008) (2)
Determining the Number of Iterations Appropriate for Ensemble Gene Selection on Microarray Data (2012) (2)
Investigating the Generalization of Image Classifiers with Augmented Test Sets (2021) (2)
Experimental Studies on the Impact of Data Sampling with Severely Imbalanced Big Data (2020) (2)
Maxout Networks for Visual Recognition (2019) (2)
Inconsistent M-estimators: nonlinear regression with multiplicative error (1992) (2)
Estimating Outlier Score Probabilities (2017) (2)
Approximating Learning Curves for Imbalanced Big Data with Limited Labels (2019) (2)
Filter-Based Subset Selection for Easy, Moderate, and Hard Bioinformatics Data (2018) (2)
A Short Survey of LSTM Models for De-identification of Medical Free Text (2020) (1)
[32] Functional and Performance Requirements Specification for the Earth Observing System Data and Information Sys- Tem (eosdis) Core System. Revision a and Ch-01 (1996) (1)
Decision Level Fusion of Wavelet Features for Ocean Turbine State Detection (2012) (1)
An Extendible Translation of BPEL to a Machine-verifiable Model (2009) (1)
Deep Learning with Maxout Activations for Visual Recognition and Verification (2019) (1)
Fusing Wavelet Features for Ocean Turbine Fault Detection (2016) (1)
Stability of Filter-Based Feature Selection Methods for Imbalanced Software Measurement Data (2012) (1)
Predicting the Severity of COVID-19 Respiratory Illness with Deep Learning (2022) (1)
Hyperparameter Tuning for Medicare Fraud Detection in Big Data (2022) (1)
Training Convolutional Networks on Truncated Text (2017) (1)
Value-Based Software Quality Modeling (2009) (1)
Data Cleansing for Remote Battery System Monitoring (1)
How ranker and learner choice affects classification performance on noisy bioinformatics data (2014) (1)
Tree-Based Software Quality Classification Using Genetic Programming (2006) (1)
Choosing an Appropriate Ensemble Classifier for Balanced Bioinformatics Data (2015) (1)
Choosing the Best Classification Performance Metric for Wrapper-based Software Metric Selection for Defect Prediction (2014) (1)
Software Quality Modeling with Limited Apriori Defect Data (2009) (1)
Investigation of Maxout Activations on Convolutional Neural Networks for Big Data Text Sentiment Analysis (2019) (1)
A Case Studv in Telecommunications (1996) (1)
Netflow Feature Evaluation for the Detection of Slow Read HTTP Attacks (2020) (1)
Modelling software quality with GP (1999) (1)
Early operational risk assessment of software using fuzzy expert systems (2002) (1)
Fraud Detection with a Limited Number of Known Fraudulent Medicare Providers (2018) (1)
An Exploration of Consistency Learning with Data Augmentation (2022) (1)
A Class-Imbalanced Study with Feature Extraction via PCA and Convolutional Autoencoder (2022) (1)
Investigating New Bootstrapping Approaches of Bagging Classifiers to Account for Class Imbalance in Bioinformatics Datasets (2015) (1)
A Comparison of House Price Classification with Structured and Unstructured Text Data (2022) (1)
A high-level performance analysis of the IBM subsystem control block (SCB) architecture (1993) (1)
Optimizing Ensemble Trees for Big Data Healthcare Fraud Detection (2022) (1)
Analyzing the Impact of Attribute Noise on Software Quality Classification (2008) (1)
Ensemble Coordination for Discrete Event Control (2011) (1)
A RULE-BASED SOFTWARE QUALITY CLASSIFICATION MODEL (2008) (1)
Informative Evaluation Metrics for Highly Imbalanced Big Data Classification (2022) (1)
Analysis and differentiation of software system environments (1996) (1)
How to Optimally Combine Univariate and Multivariate Feature Selection with Data Sampling for Classifying Noisy, High Dimensional and Class Imbalanced DNA Microarray Data# (2020) (1)
Survey of review spam detection using machine learning techniques (2015) (1)
Using Inductive Transfer Learning to Improve Hotel Review Spam Detection (2021) (1)
Can a software quality model hit a moving target? (1998) (1)
Reliability of fault-tolerant software based on a system architecture with a recovery metaprogram (1989) (1)
DYNAMIC MODELS FOR TESTING BASED ON TIME SERIES ANALYSIS (2006) (1)
Data Intensive Computing: A Biomedical Case Study in Gene Selection and Filtering (2011) (1)
A performance analysis of advanced I/O architectures for PC-based network file servers (1994) (1)
Software quality modeling and analysis with limited or without defect data (2005) (1)
2010 Ninth International Conference on Machine Learning and Applications ICMLA 2010 Table of Contents (2010) (1)
Threshold-based feature selection techniques for high-dimensional bioinformatics data (2012) (1)
Survey of Data Cleansing and Monitoring for Large-Scale Battery Backup Installations (2013) (1)
IoT Reconnaissance Attack Classification with Random Undersampling and Ensemble Feature Selection (2021) (1)
Evaluating The Number of Trainable Parameters on Deep Maxout and LReLU Networks for Visual Recognition (2020) (1)
Necessity of Feature Selection when Augmenting Tweet Sentiment Feature Spaces with Emoticons (2016) (1)
Decision Trees for Software Quality Classification (2003) (1)
Encoding High-Dimensional Procedure Codes for Healthcare Fraud Detection (2022) (1)
A Study on Software Metric Selection for Software Fault Prediction (2019) (1)
Flexible hardware architecture for multi-media communications processing (1989) (1)
A Review of Performance Evaluation on 2D Face Databases (2017) (1)
On the impact of software product dissimilarity on software quality models (1994) (1)
Can metrics and models be applied across multiple releases or projects? (1999) (1)
Improving Software Quality Estimation by Combining Boosting and Feature Selection (2013) (1)
High Consequence Systems and Semantic Computing (2013) (1)
A Practical Software Quality Classification Model Using Genetic Programming (2007) (1)
Social media for polling and predicting United States election outcome (2018) (1)
Deep Learning applications for COVID-19 (2021) (0)
Extracting Knowledge from Technical Reports for the Valuation of West Texas Intermediate Crude Oil Futures (2018) (0)
Exploring Language-Interfaced Fine-Tuning for COVID-19 Patient Survival Classification (2022) (0)
Large-scale distributed L-BFGS (2017) (0)
Approaches for identifying U.S. medicare fraud in provider claims data (2018) (0)
Learning from Highly Imbalanced Big Data with Label Noise (2023) (0)
EFFICIENT IMPLEMENTATION AND COMPUTATIONAL ANALYSIS OF PRIVACY-PRESERVING PROTOCOLS FOR SECURING THE FINANCIAL MARKETS by (2018) (0)
A parallel and distributed stochastic gradient descent implementation using commodity clusters (2019) (0)
Severely imbalanced Big Data challenges: investigating data sampling approaches (2019) (0)
Institutional Knowledge at Singapore Management University Ontology-based business process customization for composite web services (2019) (0)
Comparative aspects of software complexity metrics and program modules — a multidimensional scaling approach (1992) (0)
An Examination of Neural Networks on Cluster Computers (2021) (0)
Prediction Error Average (1 Step) Maximum (1 Step) Average (2 Step) Average (3 Step) First Data Set 11% 27% 21% 30% Second Data Set 7% 22% 14% 22% (0)
Exploring Ensemble Filters for Software Defect Prediction (2013) (0)
An Easy-to-Classify Approach for the Bot-IoT Dataset (2021) (0)
An approach to application-layer DoS detection (2023) (0)
A Comparative Approach to Threshold Optimization for Classifying Imbalanced Data (2022) (0)
Efficient Modeling of User-Entity Preference in Big Social Networks (2015) (0)
Software Quality Modeling as a Reliability Tool (2008) (0)
Healthcare Provider Summary Data for Fraud Classification (2022) (0)
Investigating class rarity in big data (2020) (0)
A performance analysis of personal computers in a video conferencing environment (1994) (0)
BASELINE-DIFFERENCING: A NOVEL APPROACH FOR BUILDING GENERALIZABLE OCEAN TURBINE RELIABILITY MODELS (2012) (0)
Improving deep neural network design with new text data representations (2017) (0)
Survey on RNN and CRF models for de-identification of medical free text (2020) (0)
Examining characteristics of predictive models with imbalanced big data (2019) (0)
Cost-Sensitive Ensemble Learning for Highly Imbalanced Classification (2022) (0)
The use of generative adversarial networks to alleviate class imbalance in tabular data: a survey (2022) (0)
The effects of class rarity on the evaluation of supervised healthcare fraud detection models (2019) (0)
IoT information theft prediction using ensemble feature selection (2022) (0)
Feature evaluation for IoT botnet traffic classification (2022) (0)
Evaluating Performance Metrics for Credit Card Fraud Classification (2022) (0)
How to Develop Engineering Entrepreneurship: Case Study – Florida Atlantic University (2014) (0)
Polishing Noise in Continuous Software Measurement Data (2006) (0)
A Poisson Regression Model of Software Quality: A Comparative Study (2006) (0)
Improving neural network models of defect content in complex software systems (1996) (0)
Knowledge and Information Systems REGULAR PAPER (0)
Hitting the Moving Target: Trials and Tribulations of Modeling Quality in Evolving Software Systems (1998) (0)
A Stack Based Multimodal Machine Learning Model for Breast Cancer Diagnosis (2022) (0)
Software measurement for the space shuttle HAL/S maintenance environment (1992) (0)
Melanoma risk modeling from limited positive samples (2019) (0)
Predicting Cyberattacks with Destination Port Through Various Input Feature Scenario (2022) (0)
A Framework of Combining Data Pre-Processing Methods and Boosting for Software Quality Classification (2015) (0)
Cbr-based software quality models and quality of data (2005) (0)
Detecting cybersecurity attacks across different network features and learners (2021) (0)
Composition analysis of the Bot-IoT dataset (2022) (0)
Defining task-to-task dispatch and interrupt response times for real-time systems (1993) (0)
A new feature popularity framework for detecting cyberattacks using popular features (2022) (0)
Video and image analysis using statistical and machine learning techniques (2007) (0)
A workload model for frame-based real-time applications on distributed systems (1992) (0)
A review of data mining using big data in health informatics (2014) (0)
Software Module Risk Analysis (2007) (0)
The Impact of Feature Selection Techniques on a Hybrid Boosting and Data Sampling Approach for Software Quality Estimation (2014) (0)
Mining and Storing Data Streams for Reliability Analysis (2010) (0)
Detecting Web Attacks in Severely Imbalanced Network Traffic Data (2021) (0)
Investigating rarity in web attacks with ensemble learners (2021) (0)
Using Random Undersampling and Ensemble Feature Selection for IoT Attack Prediction (2023) (0)
Investigating the relationship between time and predictive model maintenance (2020) (0)
Editorial (2013) (0)
Proceedings of the 2004 IEEE International Conference on Information Reuse and Integration, IRI - 2004, November 8-10, 2004, Las Vegas Hilton, Las Vegas, NV, USA (2004) (0)
Predicting high-risk program modules by selecting the right software measurements (2011) (0)
Survey on deep learning with class imbalance (2019) (0)
Assessing the Risk of Faults in Software Modules (2001) (0)
Software metrics and the quality of telecommunication software (1992) (0)
Fast and Efficient Hashing for Sequence Similarity Search using Substring Extraction in DNA Sequence Databases (2020) (0)
A survey of transfer learning (2016) (0)
Threshold optimization and random undersampling for imbalanced credit card data (2023) (0)
Detecting SSH and FTP Brute Force Attacks in Big Data (2021) (0)
Rough set-based software quality models and quality of data (2008) (0)
Application of Fuzzy Rule Extraction to Minimize the Costs of Misclassification on Software Quality Modeling (2003) (0)
Empirical Bayes methods in time series analysis (1982) (0)
Software reliability engineering with genetic programming (2003) (0)
Big Data fraud detection using multiple medicare data sources (2018) (0)
Panel: Using information re-use and integration principles in big data (2012) (0)
Verifying the Security Characteristics of a Secure Physical Access Control Protocol (2016) (0)
Evaluation of maxout activations in deep learning across several big data domains (2019) (0)
Guest Editors ’ Introduction Software Metrics : Charting the Course (2001) (0)
VipBoost: A More Accurate Boosting Algorithm (2009) (0)
Big Data and Class Imbalance in Medicare Fraud Detection (2020) (0)
A Comparative Study of Sampled Feature Ranker Ensembles for Software Quality Classification (2012) (0)
Observing the Effect of the Choice of Classifier on Bioinformatics Data with Varying Levels of Data Quality and Class Balance (2015) (0)
Deep learning applications and challenges in big data analytics (2015) (0)
A survey of open source tools for machine learning with big data in the Hadoop ecosystem (2015) (0)
Improving detection of untrustworthy online reviews using ensemble learners combined with feature selection (2017) (0)
Feature list aggregation approaches for ensemble gene selection on patient response datasets (2013) (0)
Performance analysis of a peer-to-peer I/O architecture in video server environments (1995) (0)
Neural network application in support of software reliability engineering (1995) (0)
lVlo deling Fault -Yrone (2000) (0)
An Empirical Study of Software Metric Selection Techniques for Defect Prediction (2012) (0)
The effects of varying class distribution on learner behavior for medicare fraud detection with imbalanced big data (2018) (0)
A reconstruction error-based framework for label noise detection (2021) (0)
Impact of class distribution on the detection of slow HTTP DoS attacks using Big Data (2019) (0)
Comparing Two Approaches for Adding Feature Ranking to Sampled Ensemble Learning for Software Quality Estimation (2014) (0)
Random forest implementation and optimization for Big Data analytics on LexisNexis’s high performance computing cluster platform (2019) (0)
Does the Inclusion of Data Sampling Improve the Performance of Boosting Algorithms on Imbalanced Bioinformatics Data? (2015) (0)
ATTRIBUTE NOISE DETECTION USING MULTI-RESOLUTION ANALYSIS (2006) (0)
Modeling and tracking Covid-19 cases using Big Data analytics on HPCC system platform (2021) (0)
Intrusion detection and Big Heterogeneous Data: a Survey (2015) (0)
IoT attack prediction using big Bot-IoT data (2022) (0)
I/O performance analysis for network file servers (1994) (0)
Assessing the Adaptability of Large Software Systems (2007) (0)
GANs for Class-Imbalanced Data: A Meta-Analysis of GitHub Projects (2022) (0)
VCI predictors: Voting on classifications from imputed learning sets (2008) (0)
A simulation study of the performance of several estimation procedures for linear regression models (1994) (0)
A Study of the Impact of Base Traditional Learners on Transfer Learning Algorithms (2018) (0)
Detecting web attacks using random undersampling and ensemble learners (2021) (0)
A Novel Approach for Unsupervised Learning of Highly-Imbalanced Data (2022) (0)
Sample size determination for biomedical big data with limited labels (2020) (0)
STRATEGY AND APPLICATION OF DATA-DRIVEN TESTING OF AN OCEAN TURBINE DRIVETRAIN (2011) (0)
Comparative Analysis on the Stability of Feature Selection Techniques Using Three Frameworks on Biological Datasets (2013) (0)
Survey on categorical data for neural networks (2020) (0)
Predicting Traffic Incidents in Road Networks Using Vehicle Detector Data (2021) (0)
Evaluating classifier performance with highly imbalanced Big Data (2023) (0)
Proceedings of the 9th [i. e. 12th] IASTED International Conference on Software Engineering and Applications, November 16-18, 2008, Orland, Florida, USA (2008) (0)
A performance analysis of an object-based I/O architecture in a video server environment (1995) (0)
A survey on addressing high-class imbalance in big data (2018) (0)
Performance of Filter-based Feature Subset Selection for Software Quality Data Classification (2014) (0)
Accelerated Deep Learning on HPCC Systems (2020) (0)
Big Data: Deep Learning for financial sentiment analysis (2018) (0)

This paper list is powered by the following services:

What Schools Are Affiliated With Taghi M. Khoshgoftaar?

Taghi M. Khoshgoftaar is affiliated with the following schools:

Florida Atlantic University

Taghi M. Khoshgoftaar's Academic­Influence.com Rankings

Why Is Taghi M. Khoshgoftaar Influential?

Taghi M. Khoshgoftaar's Published Works

Published Works

What Schools Are Affiliated With Taghi M. Khoshgoftaar?

Taghi M. Khoshgoftaar's AcademicInfluence.com Rankings