Taghi M. Khoshgoftaar
#78,925
Most Influential Person Now
Taghi M. Khoshgoftaar's AcademicInfluence.com Rankings
Taghi M. Khoshgoftaarcomputer-science Degrees
Computer Science
#2629
World Rank
#2749
Historical Rank
Data Mining
#26
World Rank
#26
Historical Rank
Machine Learning
#175
World Rank
#176
Historical Rank
Database
#360
World Rank
#376
Historical Rank
Download Badge
Computer Science
Why Is Taghi M. Khoshgoftaar Influential?
(Suggest an Edit or Addition)Taghi M. Khoshgoftaar's Published Works
Published Works
- A survey on Image Data Augmentation for Deep Learning (2019) (4477)
- A Survey of Collaborative Filtering Techniques (2009) (3535)
- Deep learning applications and challenges in big data analytics (2015) (1681)
- A survey of transfer learning (2016) (1483)
- RUSBoost: A Hybrid Approach to Alleviating Class Imbalance (2010) (1349)
- Survey on deep learning with class imbalance (2019) (1082)
- Experimental perspectives on learning from imbalanced data (2007) (682)
- The Detection of Fault-Prone Programs (1992) (495)
- A survey of open source tools for machine learning with big data in the Hadoop ecosystem (2015) (383)
- A survey on addressing high-class imbalance in big data (2018) (351)
- An Empirical Study of Learning from Imbalanced Data Using Random Forest (2007) (329)
- Survey of review spam detection using machine learning techniques (2015) (327)
- A review of data mining using big data in health informatics (2014) (294)
- Choosing software metrics for defect prediction: an investigation on feature selection techniques (2011) (265)
- Intrusion detection and Big Heterogeneous Data: a Survey (2015) (262)
- Predicting Software Development Errors Using Software Complexity Metrics (1990) (254)
- Comparing Boosting and Bagging Techniques With Noisy and Imbalanced Data (2011) (254)
- RUSBoost: Improving classification performance when training data is skewed (2008) (236)
- Application of neural networks to software quality modeling of a very large telecommunications system (1997) (230)
- Early Quality Prediction: A Case Study in Telecommunications (1996) (221)
- Big Data: Deep Learning for financial sentiment analysis (2018) (204)
- Analyzing software measurement data with clustering techniques (2004) (197)
- Knowledge discovery from imbalanced and noisy data (2009) (193)
- Comparative Assessment of Software Quality Classification Techniques: An Empirical Case Study (2004) (188)
- A survey on heterogeneous transfer learning (2017) (185)
- CatBoost for big data: an interdisciplinary review (2020) (172)
- Can neural networks be easily interpreted in software cost estimation? (2002) (169)
- Survey on categorical data for neural networks (2020) (163)
- Tree-based software quality estimation models for fault prediction (2002) (155)
- Fault Prediction Modeling for Software Quality Estimation: Comparing Commonly Used Techniques (2003) (153)
- Attribute Selection and Imbalanced Data: Problems in Software Defect Prediction (2010) (151)
- An Empirical Study of the Classification Performance of Learners on Imbalanced and Noisy Software Quality Data (2007) (147)
- Deep Learning applications for COVID-19 (2021) (147)
- Feature Selection with High-Dimensional Imbalanced Data (2009) (146)
- Evolutionary Optimization of Software Quality Modeling with Multiple Repositories (2010) (144)
- EMERALD: software metrics and models on the desktop (1996) (144)
- A Study on the Relationships of Classifier Performance Metrics (2009) (142)
- Unsupervised learning for expert-based software quality estimation (2004) (141)
- A neural network approach for early detection of program modules having high risk in the maintenance phase (1995) (138)
- CLUSTERING-BASED NETWORK INTRUSION DETECTION (2007) (135)
- LOGISTIC REGRESSION MODELING OF SOFTWARE QUALITY (1999) (130)
- Collaborative Filtering for Multi-class Data Using Belief Nets Algorithms (2006) (129)
- Classification tree models of software quality over multiple releases (1999) (127)
- An application of fuzzy clustering to software quality prediction (2000) (124)
- Identification of fuzzy models of software cost estimation (2004) (123)
- Using regression trees to classify fault-prone software modules (2002) (122)
- Regression modelling of software quality: empirical investigation☆ (1990) (114)
- Improving Software-Quality Predictions With Data Sampling and Boosting (2009) (114)
- Estimating software project effort by analogy based on linguistic values (2002) (114)
- A review of the stability of feature selection techniques for bioinformatics data (2012) (110)
- Measuring coupling and cohesion of software modules: an information-theory approach (2001) (107)
- Improving Software Quality Prediction by Noise Filtering Techniques (2007) (104)
- An empirical study of predicting software faults with case-based reasoning (2006) (102)
- Learning with limited minority class data (2007) (100)
- Big Data fraud detection using multiple medicare data sources (2018) (98)
- The Dimensionality Of Program Complexity (1989) (97)
- A Comparative Study of Ensemble Feature Selection Techniques for Software Defect Prediction (2010) (97)
- Detection of software modules with high debug code churn in a very large legacy system (1996) (96)
- Text Data Augmentation for Deep Learning (2021) (94)
- An application of zero-inflated Poisson regression for software fault prediction (2001) (93)
- Analogy-Based Practical Classification Rules for Software Quality Estimation (2003) (93)
- Incomplete-Case Nearest Neighbor Imputation in Software Measurement Data (2007) (90)
- Software Quality Analysis of Unlabeled Program Modules With Semisupervised Clustering (2007) (88)
- Case-Based Software Quality Prediction (2000) (88)
- Using neural networks to predict software faults during testing (1996) (87)
- Predictive Modeling Techniques of Software Quality from Software Measures (1992) (86)
- Predicting software errors, during development, using nonlinear regression models: a comparative study (1992) (86)
- Software quality estimation with limited fault data: a semi-supervised learning perspective (2007) (86)
- Measuring coupling and cohesion: an information-theory approach (1999) (85)
- Software quality classification modeling using the SPRINT decision tree algorithm (2002) (83)
- A Comprehensive Empirical Study of Count Models for Software Fault Prediction (2007) (81)
- Imputation-boosted collaborative filtering using machine learning classifiers (2008) (80)
- The pairwise attribute noise detection algorithm (2007) (79)
- A comparative study of iterative and non-iterative feature selection techniques for software defect prediction (2014) (79)
- A neural network approach for predicting software development faults (1992) (78)
- A review of statistical and machine learning methods for modeling cancer risk using structured clinical data (2018) (77)
- A comparative study of pattern recognition techniques for quality evaluation of telecommunications software (1994) (75)
- Predicting fault-prone modules with case-based reasoning (1997) (75)
- Application of neural networks for predicting program faults (1995) (74)
- Using Random Undersampling to Alleviate Class Imbalance on Tweet Sentiment Data (2015) (74)
- Applications of a relative complexity metric for software project management (1990) (73)
- Predicting testability of program modules using a neural network (2000) (72)
- Machine prediction of personality from Facebook profiles (2012) (72)
- Hybrid sampling for imbalanced data (2008) (70)
- Enhancing software quality estimation using ensemble-classifier based noise filtering (2005) (70)
- MODELING SOFTWARE QUALITY WITH CLASSIFICATION TREES (2001) (69)
- Evolutionary Sampling and Software Quality Modeling of High-Assurance Systems (2009) (68)
- Supervised Neural Network Modeling: An Empirical Investigation Into Learning From Imbalanced Data With Labeling Errors (2010) (67)
- Using Twitter Content to Predict Psychopathy (2012) (66)
- Using Process History to Predict Software Quality (1998) (66)
- Unsupervised multiscale color image segmentation based on MDL principle (2006) (66)
- How Many Software Metrics Should be Selected for Defect Prediction? (2011) (65)
- Resampling or Reweighting: A Comparison of Boosting Implementations (2008) (65)
- Balancing Misclassification Rates in Classification-Tree Models of Software Quality (2000) (63)
- Identifying learners robust to low quality data (2008) (63)
- A multiobjective module-order model for software quality enhancement (2004) (62)
- A survey on the state of healthcare upcoding fraud analysis and detection (2017) (62)
- Analyzing software quality with limited fault-proneness defect data (2005) (61)
- A survey and analysis of intrusion detection models based on CSE-CIC-IDS2018 Big Data (2020) (60)
- Predicting susceptibility to social bots on Twitter (2013) (60)
- Ordering Fault-Prone Software Modules (2003) (60)
- A practical classification-rule for software-quality models (2000) (59)
- The improved grey model based on particle swarm optimization algorithm for time series prediction (2016) (59)
- Data Mining for Predictors of Software Quality (1999) (59)
- A clustering approach to wireless network intrusion detection (2005) (58)
- The necessity of assuring quality in software measurement data (2004) (57)
- Genetic programming-based decision trees for software quality classification (2003) (56)
- Impact of Feature Selection Techniques for Tweet Sentiment Classification (2015) (56)
- Application of fuzzy expert systems in assessing operational risk of software (2003) (56)
- Feature Selection with Imbalanced Data for Software Defect Prediction (2009) (56)
- Improving deep neural network design with new text data representations (2017) (55)
- Measurement of data structure complexity (1993) (55)
- Modeling the relationship between source code complexity and maintenance difficulty (1994) (54)
- An Empirical Study of Feature Ranking Techniques for Software Quality Prediction (2012) (53)
- Classification of Fault-Prone Software Modules: Prior Probabilities, Costs, and Model Evaluation (1998) (52)
- Medicare fraud detection using neural networks (2019) (52)
- A Comparative Study of Data Sampling and Cost Sensitive Learning (2008) (52)
- Building Useful Models from Imbalanced Data with Sampling and Boosting (2008) (52)
- A Multi-Objective Software Quality Classification Model Using Genetic Programming (2007) (52)
- Software reliability model selection: a cast study (1991) (51)
- Measuring dynamic program complexity (1992) (50)
- Medicare Fraud Detection Using Machine Learning Methods (2017) (50)
- A comprehensive empirical evaluation of missing value imputation in noisy software measurement data (2008) (50)
- Comparative Analysis of DNA Microarray Data through the Use of Feature Selection Techniques (2010) (49)
- Software quality analysis by combining multiple projects and learners (2009) (49)
- Predicting Faults in High Assurance Software (2010) (48)
- Threshold-based feature selection techniques for high-dimensional bioinformatics data (2012) (48)
- Machine Learning for Detecting Brute Force Attacks at the Network Level (2014) (48)
- Hybrid Collaborative Filtering Algorithms Using a Mixture of Experts (2007) (47)
- Predicting Medical Provider Specialties to Detect Anomalous Insurance Claims (2016) (47)
- Reducing overfitting in genetic programming models for software quality classification (2004) (46)
- Software measurement data reduction using ensemble techniques (2012) (46)
- A comparative evaluation of feature ranking methods for high dimensional bioinformatics data (2011) (46)
- Random forest: A reliable tool for patient response prediction (2011) (46)
- A Comparative Study of Threshold-Based Feature Selection Techniques (2010) (46)
- Comparison of Data Sampling Approaches for Imbalanced Bioinformatics Data (2014) (45)
- Exploring the behaviour of neural network software quality models (1995) (44)
- Improving code churn predictions during the system test and maintenance phases (1994) (44)
- An empirical investigation of filter attribute selection techniques for software quality classification (2009) (44)
- Detection of fault-prone software modules during a spiral life cycle (1996) (44)
- Mining Data with Rare Events: A Case Study (2007) (44)
- Genetic programming model for software quality classification (2001) (43)
- Investigating soft computing in case-based reasoning for software cost estimation (2002) (43)
- Cost-sensitive boosting in software quality modeling (2002) (42)
- Evolutionary neural networks: a robust approach to software reliability problems (1997) (42)
- Predicting high-risk program modules by selecting the right software measurements (2012) (42)
- The Effects of Random Undersampling with Simulated Class Imbalance for Big Data (2018) (41)
- Medicare Fraud Detection Using Random Forest with Class Imbalanced Big Data (2018) (41)
- Application of fuzzy expert system in test case selection for system regression test (2005) (40)
- The effects of varying class distribution on learner behavior for medicare fraud detection with imbalanced big data (2018) (40)
- Prediction of software faults using fuzzy nonlinear regression modeling (2000) (40)
- Accuracy of software quality models over multiple releases (2000) (40)
- The use of software complexity metrics in software reliability modeling (1991) (39)
- Intrusion detection in wireless networks using clustering techniques with expert analysis (2005) (39)
- First Order Statistics Based Feature Selection: A Diverse and Powerful Family of Feature Seleciton Techniques (2012) (38)
- Detection of fault-prone program modules in a very large telecommunications system (1995) (38)
- A comparative study of filter-based feature ranking techniques (2010) (37)
- Using classification trees for software quality models: lessons learned (1998) (37)
- An extensive comparison of feature ranking aggregation techniques in bioinformatics (2012) (37)
- Count Models for Software Quality Estimation (2007) (37)
- Severely imbalanced Big Data challenges: investigating data sampling approaches (2019) (37)
- A Multi-dimensional Comparison of Toolkits for Machine Learning with Big Data (2015) (37)
- Empirical Case Studies in Attribute Noise Detection (2005) (36)
- An empirical comparison of repetitive undersampling techniques (2009) (36)
- Using the genetic algorithm to build optimal neural networks for fault-prone module detection (1996) (35)
- A Novel Method for Fraudulent Medicare Claims Detection from Expected Payment Deviations (Application Paper) (2016) (35)
- Controlling Overfitting in Classification-Tree Models of Software Quality (2001) (35)
- A comparative study of predictive models for program changes during system testing and maintenance (1993) (35)
- Metric Selection for Software Defect Prediction (2011) (35)
- Predicting the order of fault-prone modules in legacy software (1998) (35)
- Generating multiple noise elimination filters with the ensemble-partitioning filter (2004) (34)
- The Effect of Data Sampling When Using Random Forest on Imbalanced Bioinformatics Data (2015) (34)
- Application of a usage profile in software quality models (1999) (34)
- Controlling overfitting in software quality models: experiments with regression trees and classification (2001) (34)
- Process measures for predicting software quality (1997) (34)
- Cross-Domain Sentiment Analysis: An Empirical Investigation (2016) (34)
- Noise identification with the k-means algorithm (2004) (33)
- Impact of noise and data sampling on stability of feature ranking techniques for biological datasets (2012) (33)
- Active learning with neural networks for intrusion detection (2010) (33)
- Reducing Feature Set Explosion to Facilitate Real-World Review Spam Detection (2016) (33)
- A Comparative Study of Ordering and Classification of Fault-Prone Software Modules (1999) (33)
- Software quality modeling: The impact of class noise on the random forest classifier (2008) (33)
- A survey of stability analysis of feature subset selection techniques (2013) (32)
- The Effects of Data Sampling with Deep Learning and Highly Imbalanced Big Data (2020) (32)
- System regression test planning with a fuzzy expert system (2014) (32)
- The Effect of Dataset Size on Training Tweet Sentiment Classifiers (2015) (32)
- A literature review on one-class classification and its potential applications in big data (2021) (32)
- Predicting fault-prone software modules in embedded systems with classification trees (1999) (31)
- PREDICTING SOFTWARE QUALITY, DURING TESTING, USING NEURAL NETWORK MODELS: A COMPARATIVE STUDY (1994) (31)
- Stability of Filter- and Wrapper-Based Feature Subset Selection (2013) (31)
- Uncertain Classification of Fault-Prone Software Modules (2002) (31)
- An Empirical Study on Class Rarity in Big Data (2018) (30)
- Designing a Better Data Representation for Deep Neural Networks and Text Classification (2016) (30)
- A Probabilistic Programming Approach for Outlier Detection in Healthcare Claims (2016) (30)
- Using Ensemble Learners to Improve Classifier Performance on Tweet Sentiment Data (2015) (30)
- Class noise detection using frequent itemsets (2006) (30)
- Assessment of a New Three-Group Software Quality Classification Technique: An Empirical Case Study (2005) (30)
- The lines of code metric as a predictor of program faults: a critical analysis (1990) (30)
- The use of decision trees for cost‐sensitive classification: an empirical study in software quality prediction (2011) (29)
- Improving usefulness of software quality classification models based on Boolean discriminant functions (2002) (29)
- A tree-based classification model for analysis of a military software system (1996) (29)
- Efficient image segmentation by mean shift clustering and MDL-guided region merging (2004) (28)
- Comparing software fault predictions of pure and zero-inflated Poisson regression models (2005) (28)
- Using Imputation Techniques to Help Learn Accurate Classifiers (2008) (28)
- Software quality assessment using a multi-strategy classifier (2014) (28)
- Classification Performance of Rank Aggregation Techniques for Ensemble Gene Selection (2013) (27)
- Detecting noisy instances with the rule-based classification model (2005) (27)
- A New Intrusion Detection Benchmarking System (2015) (27)
- The Detection of Medicare Fraud Using Machine Learning Methods with Excluded Provider Labels (2018) (27)
- Deep Learning and Data Sampling with Imbalanced Big Data (2019) (27)
- The effects of class rarity on the evaluation of supervised healthcare fraud detection models (2019) (26)
- A neural network modeling methodology for the detection of high-risk programs (1993) (26)
- ATTRIBUTE SELECTION USING ROUGH SETS IN SOFTWARE QUALITY CLASSIFICATION (2009) (26)
- Ontology-Based Business Process Customization for Composite Web Services (2011) (26)
- Software Defect Prediction for High-Dimensional and Class-Imbalanced Data (2011) (26)
- The impact of software enhancement on software reliability (1995) (25)
- A Mixture Imputation-Boosted Collaborative Filter (2008) (25)
- Impact of class distribution on the detection of slow HTTP DoS attacks using Big Data (2019) (25)
- Modeling software quality: the Software Measurement Analysis and Reliability Toolkit (2000) (25)
- Mean Aggregation versus Robust Rank Aggregation for Ensemble Gene Selection (2012) (25)
- Multivariate outlier detection in medicare claims payments applying probabilistic programming methods (2017) (25)
- Fuzzy case-based reasoning models for software cost estimation (2004) (25)
- Imputation techniques for multivariate missingness in software measurement data (2008) (25)
- Detecting Outliers Using Rule-Based Modeling for Improving CBR-Based Software Quality Classification Models (2003) (25)
- Medical Provider Specialty Predictions for the Detection of Anomalous Medicare Insurance Claims (2017) (25)
- Skewed Class Distributions and Mislabeled Examples (2007) (24)
- Ensemble Feature Ranking Methods for Data Intensive Computing Applications (2011) (24)
- Collaborative Filtering for Multi-Class Data Using Bayesian Networks (2008) (24)
- Social media for polling and predicting United States election outcome (2018) (24)
- Random forest implementation and optimization for Big Data analytics on LexisNexis’s high performance computing cluster platform (2019) (24)
- Imputed Neighborhood Based Collaborative Filtering (2008) (23)
- Combining Feature Subset Selection and Data Sampling for Coping with Highly Imbalanced Software Data (2015) (23)
- Data Sampling Approaches with Severely Imbalanced Big Data for Medicare Fraud Detection (2018) (23)
- Detecting program modules with low testability (1995) (23)
- Survey of Clinical Data Mining Applications on Big Data in Health Informatics (2013) (23)
- Examining characteristics of predictive models with imbalanced big data (2019) (22)
- A Session Based Approach for Aggregating Network Traffic Data -- The SANTA Dataset (2014) (22)
- Detection of SSH Brute Force Attacks Using Aggregated Netflow Data (2015) (22)
- Evaluation of maxout activations in deep learning across several big data domains (2019) (22)
- Using evolutionary sampling to mine imbalanced data (2007) (22)
- Integrating metrics and models for software risk assessment (1996) (22)
- Ensemble Feature Selection Technique for Software Quality Classification (2010) (21)
- Alternative approaches for the use of metrics to order programs by complexity (1994) (21)
- Comparison of approaches to alleviate problems with high-dimensional and class-imbalanced data (2011) (21)
- Rule-based noise detection for software measurement data (2004) (21)
- Resource-sensitive intrusion detection models for network traffic (2004) (21)
- Cost-Benefit Analysis of Software Quality Models (2004) (21)
- RUDY Attack: Detection at the Network Level and Its Important Features (2016) (21)
- An Empirical Study of the Noise Impact on Cost-Sensitive Learning (2007) (21)
- Dynamic system complexity (1993) (21)
- High-Dimensional Software Engineering Data and Feature Selection (2009) (21)
- Large-scale distributed L-BFGS (2017) (20)
- Identifying Medicare Provider Fraud with Unsupervised Machine Learning (2018) (20)
- Classification of Ships in Surveillance Video (2006) (20)
- Are the principal components of software complexity data stable across software products? (1994) (20)
- Semi-supervised learning for software quality estimation (2004) (20)
- Stability Analysis of Feature Ranking Techniques on Biological Datasets (2011) (20)
- Gradient Boosted Decision Tree Algorithms for Medicare Fraud Detection (2021) (20)
- User Behavior Anomaly Detection for Application Layer DDoS Attacks (2017) (20)
- Efficient learning from big data for cancer risk modeling: A case study with melanoma (2019) (20)
- Robustness of Filter-Based Feature Ranking: A Case Study (2011) (19)
- Empirical case studies of combining software quality classification models (2003) (19)
- The impact of costs of misclassification on software quality modeling (1997) (19)
- Modeling and tracking Covid-19 cases using Big Data analytics on HPCC system platform (2021) (19)
- Combining Feature Selection and Ensemble Learning for Software Quality Estimation (2014) (19)
- Impact of Data Sampling on Stability of Feature Selection for Software Measurement Data (2011) (19)
- Comparing Two New Gene Selection Ensemble Approaches with the Commonly-Used Approach (2012) (19)
- Which software modules have faults which will be discovered by customers (1999) (19)
- Software Quality Prediction for High-Assurance Network Telecommunications Systems (2001) (19)
- Improving tree-based models of software quality with principal components analysis (2000) (18)
- Survey on RNN and CRF models for de-identification of medical free text (2020) (18)
- Medicare Fraud Detection using CatBoost (2020) (18)
- Exploring the Effectiveness of Twitter at Polling the United States 2016 Presidential Election (2017) (18)
- Mining Data from Multiple Software Development Projects (2009) (18)
- Multivariate assessment of complex software systems: a comparative study (1995) (18)
- The impact of software evolution and reuse on software quality (2004) (18)
- A Novel Noise Filtering Algorithm for Imbalanced Data (2010) (17)
- Investigating Random Undersampling and Feature Selection on Bioinformatics Big Data (2019) (17)
- An assessment of software quality in a C++ environment (1995) (17)
- Evaluating Feature Selection Methods for Network Intrusion Detection with Kyoto Data (2016) (17)
- Using Classifier-Based Nominal Imputation to Improve Machine Learning (2011) (17)
- Detecting web attacks using random undersampling and ensemble learners (2021) (17)
- Editorial: Special issue on mining low-quality data (2007) (17)
- Improving detection of untrustworthy online reviews using ensemble learners combined with feature selection (2017) (16)
- Utilizing Netflow Data to Detect Slow Read Attacks (2018) (16)
- A parallel and distributed stochastic gradient descent implementation using commodity clusters (2019) (16)
- A Survey of Medicare Data Processing and Integration for Fraud Detection (2018) (16)
- OCEAN TURBINES — A RELIABILITY ASSESSMENT (2009) (16)
- Identifying noise in an attribute of interest (2005) (16)
- Predicting Fault-Prone Modules in Embedded Systems Using Analogy-Based Classification Models (2002) (16)
- Application of an attribute selection method to CBR-based software quality classification (2003) (16)
- Software Engineering with Computational Intelligence (2003) (16)
- Performance of CatBoost and XGBoost in Medicare Fraud Detection (2020) (15)
- Resource-oriented software quality classification models (2005) (15)
- Measuring robustness of Feature Selection techniques on software engineering datasets (2011) (15)
- An empirical study of the impact of count models predictions on module-order models (2002) (15)
- Robustness of Threshold-Based Feature Rankers with Data Sampling on Noisy and Imbalanced Data (2012) (15)
- Improving neural network predictions of software quality using principal components analysis (1994) (15)
- A Review of Ensemble Classification for DNA Microarrays Data (2013) (15)
- Aggregating performance metrics for classifier evaluation (2009) (15)
- Which Software Modules have Faults which will be Discovered by Customers? (1999) (15)
- NEURAL NETWORKS FOR SOFTWARE QUALITY PREDICTION (1998) (15)
- Simplifying the Utilization of Machine Learning Techniques for Bioinformatics (2013) (15)
- An empirical study of program quality during testing and maintenance (1994) (14)
- An Empirical Study of Software Metrics Selection Using Support Vector Machine (2011) (14)
- A Hybrid Approach to Coping with High Dimensionality and Class Imbalance for Software Defect Prediction (2012) (14)
- Comparison of Four Performance Metrics for Evaluating Sampling Techniques for Low Quality Class-Imbalanced Data (2008) (14)
- Empirical Assessment of a Software Metric: The Information Content of Operators (2001) (14)
- Detecting Noisy Instances with the Ensemble Filter: a Study in Software Quality Estimation (2006) (14)
- The Impact of Malicious Accounts on Political Tweet Sentiment (2018) (14)
- Assessing uncertain predictions of software quality (1999) (14)
- Hidden dependencies between class imbalance and difficulty of learning for bioinformatics datasets (2013) (14)
- Investigating the relationship between time and predictive model maintenance (2020) (13)
- Selecting the Appropriate Data Sampling Approach for Imbalanced and High-Dimensional Bioinformatics Datasets (2014) (13)
- From Web Service Artifact to a Readable and Verifiable Model (2009) (13)
- Return on investment of software quality predictions (1998) (13)
- Making an accurate classifier ensemble by voting on classifications from imputed learning sets (2009) (13)
- A novel dataset-similarity-aware approach for evaluating stability of software metric selection techniques (2012) (13)
- Approaches for identifying U.S. medicare fraud in provider claims data (2018) (13)
- Noise elimination with partitioning filter for software quality estimation (2006) (13)
- Applications of information theory to software engineering measurement (1994) (13)
- Modeling fault-prone modules of subsystems (2000) (13)
- Feature Selection Algorithms for Mining High Dimensional DNA Microarray Data (2011) (13)
- Fuzzy logic techniques for software reliability engineering (2001) (13)
- THE USE OF UNDER- AND OVERSAMPLING WITHIN ENSEMBLE FEATURE SELECTION AND CLASSIFICATION FOR SOFTWARE QUALITY PREDICTION (2014) (13)
- Which Users Reply to and Interact with Twitter Social Bots? (2013) (13)
- Enhancing Ensemble Learners with Data Sampling on High-Dimensional Imbalanced Tweet Sentiment Data (2016) (13)
- A Study of Software Metric Selection Techniques: stability Analysis and Defect Prediction Model Performance (2013) (13)
- An application of genetic programming to software quality prediction (1998) (13)
- Transfer Learning Techniques (2016) (13)
- Using product, process, and execution metrics to predict fault-prone software modules with classification trees (2000) (13)
- Investigating class rarity in big data (2020) (12)
- Stability of filter- and wrapper-based software metric selection techniques (2014) (12)
- Boosted Noise Filters for Identifying Mislabeled Data (2005) (12)
- Detecting Cybersecurity Attacks Using Different Network Features with LightGBM and XGBoost Learners (2020) (12)
- Fault-tolerant software reliability modeling using Petri Nets (1991) (12)
- Deep Learning Techniques in Big Data Analytics (2016) (12)
- Preparing measurements of legacy software for predicting operational faults (1999) (12)
- Using Feature Selection in Combination with Ensemble Learning Techniques to Improve Tweet Sentiment Classification Performance (2015) (12)
- Deep Learning and Thresholding with Class-Imbalanced Big Data (2019) (12)
- Ensemble vs. Data Sampling: Which Option Is Best Suited to Improve Classification Performance of Imbalanced Bioinformatics Data? (2015) (12)
- Software reliability model selection (1992) (12)
- Machine Learning in Modeling High School Sport Concussion Symptom Resolve. (2019) (12)
- Evaluating indirect and direct classification techniques for network intrusion detection (2005) (12)
- Evolutionary data analysis for the class imbalance problem (2010) (12)
- Comparison of Stability for Different Families of Filter-Based and Wrapper-Based Feature Selection (2013) (12)
- Improving Learner Performance with Data Sampling and Boosting (2008) (12)
- Rule-Based Multiple Object Tracking for Traffic Surveillance Using Collaborative Background Extraction (2007) (12)
- An Information Theory-Based Approach to Quantifying the Contribution of a Software Metric (1997) (12)
- Detecting cybersecurity attacks across different network features and learners (2021) (11)
- Data quality in data mining and machine learning (2007) (11)
- An Empirical Study on the Stability of Feature Selection for Imbalanced Software Engineering Data (2012) (11)
- The application of fuzzy enhanced case-based reasoning for identifying fault-prone modules (1998) (11)
- Similarity analysis of feature ranking techniques on imbalanced DNA microarray datasets (2012) (11)
- Selecting the Appropriate Ensemble Learning Approach for Balanced Bioinformatics Data (2015) (11)
- Module-order modeling using an evolutionary multi-objective optimization approach (2004) (11)
- Investigating ARIMA models of software system quality (1995) (11)
- CREATING ENTREPRENEURIAL UNIVERSITY (2013) (11)
- Exploring Software Quality Classification with a Wrapper-Based Feature Ranking Technique (2009) (11)
- Sample size determination for biomedical big data with limited labels (2020) (11)
- An Empirical Study on Estimating Motions in Video Stabilization (2007) (11)
- Data Mining of Software Development Databases (2001) (11)
- A Review and Analysis of the Bot-IoT Dataset (2021) (11)
- Identifying modules which do not propagate errors (1999) (10)
- Predictive modeling of software quality for very large telecommunications systems (1996) (10)
- Evaluation of Wrapper-Based Feature Selection Using Hard, Moderate, and Easy Bioinformatics Data (2014) (10)
- Investigating Transfer Learners for Robustness to Domain Class Imbalance (2016) (10)
- Rotation invariant face recognition survey (2014) (10)
- Software Engineering with Computational Intelligence and Machine Learning A Novel Software Metric Selection Technique Using the Area Under ROC Curves (2010) (10)
- Modernizing Analytics for Melanoma with a Large-Scale Research Dataset (2017) (10)
- Reliability Evaluation Model of Component‐Based Software Based on Complex Network Theory (2017) (10)
- Monitoring Ocean Turbines : a Reliability Assessment (2009) (10)
- An application of a rule-based model in software quality classification (2007) (10)
- A Procedure for Collecting and Labeling Man-in-the-Middle Attack Traffic (2017) (10)
- A COMPARATIVE STUDY OF FILTER-BASED AND WRAPPER-BASED FEATURE RANKING TECHNIQUES FOR SOFTWARE QUALITY MODELING (2011) (10)
- Comparing Transfer Learning and Traditional Learning Under Domain Class Imbalance (2017) (10)
- Detecting Slow HTTP POST DoS Attacks Using Netflow Features (2019) (10)
- Thresholding Strategies for Deep Learning with Highly Imbalanced Big Data (2020) (9)
- Classification performance of three approaches for combining data sampling and gene selection on bioinformatics data (2014) (9)
- Software Quality Imputation in the Presence of Noisy Data (2006) (9)
- Wrapper-Based Feature Ranking for Software Engineering Metrics (2009) (9)
- Location-Based Twitter Sentiment Analysis for Predicting the U.S. 2016 Presidential Election (2018) (9)
- Gene selection stability's dependence on dataset difficulty (2013) (9)
- An Investigation of Transfer Learning and Traditional Machine Learning Algorithms (2016) (9)
- The importance of performance metrics within wrapper feature selection (2013) (9)
- Threshold Based Optimization of Performance Metrics with Severely Imbalanced Big Security Data (2019) (9)
- Software and communications architecture for Prognosis and Health Monitoring of ocean-based power generator (2011) (9)
- A Review of Prognostics and Health Monitoring Techniques for Autonomous Ocean Systems (2010) (9)
- Random Forest with 200 Selected Features: An Optimal Model for Bioinformatics Research (2013) (9)
- Episodic-Memory Performance in Machine Learning Modeling for Predicting Cognitive Health Status Classification (2019) (9)
- Evaluation of the importance of data pre-processing order when combining feature selection and data sampling (2012) (8)
- An Empirical Study on Wrapper-Based Feature Ranking (2009) (8)
- An Empirical Investigation on Wrapper-Based Feature Selection for Predicting Software Quality (2015) (8)
- Using Correlation-Based Feature Selection for a Diverse Collection of Bioinformatics Datasets (2014) (8)
- Contrast Pattern Mining with Gap Constraints for Peptide Folding Prediction (2008) (8)
- Computational Intelligence in Empirical Software Engineering (2004) (8)
- A Novel Hybrid Search Algorithm for Feature Selection (2009) (8)
- A Study on First Order Statistics-Based Feature Selection Techniques on Software Metric Data (2013) (8)
- An Evaluation of Sampling on Filter-Based Feature Selection Methods (2010) (8)
- Is Data Sampling Required When Using Random Forest for Classification on Imbalanced Bioinformatics Data? (2016) (8)
- Identification of microRNA biomarkers for cancer by combining multiple feature selection techniques (2011) (8)
- Fourier transforms for vibration analysis: A review and case study (2011) (8)
- Assessment of a Multi-Strategy Classifier for an Embedded Software System (2006) (8)
- Applying Feature Selection to Short Time Wavelet Transformed Vibration Data for Reliability Analysis of an Ocean Turbine (2012) (8)
- Medical Provider Embeddings for Healthcare Fraud Detection (2021) (8)
- Identifying noisy features with the Pairwise Attribute Noise Detection Algorithm (2005) (7)
- Using Genetic Programming to Determine Software Quality (1999) (7)
- A noise-based stability evaluation of threshold-based feature selection techniques (2011) (7)
- Optimizing Wrapper-Based Feature Selection for Use on Bioinformatics Data (2014) (7)
- Building Decision Tree Software Quality Classification Models Using Genetic Programming (2003) (7)
- Applications of Data Fusion in Monitoring Inaccessible Ocean Machinery (2010) (7)
- Feature Extraction for Class Imbalance Using a Convolutional Autoencoder and Data Sampling (2021) (7)
- Utility of MemTrax and Machine Learning Modeling in Classification of Mild Cognitive Impairment (2020) (7)
- Noise Correction using Bayesian Multiple Imputation (2006) (7)
- Integrating Multiple Data Sources to Enhance Sentiment Prediction (2016) (7)
- Designing a Testing Framework for Transfer Learning Algorithms (Application Paper) (2016) (7)
- Improving software quality estimation by combining feature selection strategies with sampled ensemble learning (2014) (7)
- Deep Neural Network Architecture for Character-Level Learning on Short Text (2017) (7)
- Software metric-based neural network classification models of a very large telecommunications system (1996) (7)
- On the Stability of Feature Selection Methods in Software Quality Prediction: An Empirical Investigation (2015) (7)
- Differentiating between Educational Data Mining and Learning Analytics: A Bibliometric Approach (2019) (7)
- Evaluating the impact of data quality on sampling (2010) (7)
- A reconstruction error-based framework for label noise detection (2021) (7)
- Melanoma Risk Prediction with Structured Electronic Health Records (2018) (7)
- Utilizing Ensemble, Data Sampling and Feature Selection Techniques for Improving Classification Performance on Tweet Sentiment Data (2015) (7)
- An Empirical Study of Predictive Modeling Techniques of Software Quality (2010) (6)
- Noise Elimination with Ensemble-Classifier Filtering: A Case-Study in Software Quality Engineerin (2004) (6)
- Multivariate Anomaly Detection in Medicare using Model Residuals and Probabilistic Programming (2017) (6)
- Feature Level Sensor Fusion for Improved Fault Detection in MCM Systems for Ocean Turbines (2011) (6)
- A Comparison of Software Fault Imputation Procedures (2006) (6)
- A Comparison of Performance Metrics with Severely Imbalanced Network Security Big Data (2019) (6)
- Dynamic Two-phase Truncated Rayleigh Model for Release Date Prediction of Software (2010) (6)
- Fault severity in models of fault-correction activity (1995) (6)
- The multiple imputation quantitative noise corrector (2007) (6)
- A System-Level Modeling Methodology for Performance-Driven Component Selection in Multicore Architectures (2012) (6)
- A Text Mining Approach for Anomaly Detection in Application Layer DDoS Attacks (2017) (6)
- Investigating Two Approaches for Adding Feature Ranking to Sampled Ensemble Learning for Software Quality Estimation (2015) (6)
- The use of generative adversarial networks to alleviate class imbalance in tabular data: a survey (2022) (6)
- Filter- and wrapper-based feature selection for predicting user interaction with Twitter bots (2013) (6)
- Studying the Effect of Class Imbalance in Ocean Turbine Fault Data on Reliable State Detection (2012) (6)
- The effect of measurement approach and noise level on gene selection stability (2012) (6)
- Effects of the Use of Boosting on Classification Performance of Imbalanced Bioinformatics Datasets (2014) (6)
- Extracting Knowledge from Technical Reports for the Valuation of West Texas Intermediate Crude Oil Futures (2017) (6)
- Is Gene Selection Enough for Imbalanced Bioinformatics Data? (2018) (6)
- A Survey of 2D Face Databases (2015) (6)
- A Dynamometer for an Ocean Turbine Prototype: Reliability through Automated Monitoring (2011) (6)
- Exploring an iterative feature selection technique for highly imbalanced data sets (2012) (6)
- A Comparative Study of Different Strategies for Predicting Software Quality (2011) (6)
- An Investigation of Ensemble Techniques for Detection of Spam Reviews (2016) (6)
- The partitioning- and rule-based filter for noise detection (2005) (6)
- Impact of Hyperparameter Tuning in Classifying Highly Imbalanced Big Data (2021) (6)
- An exploration of learning when data is noisy and imbalanced (2011) (6)
- The Use of Ensemble-Based Data Preprocessing Techniques for Software Defect Prediction (2014) (6)
- Multiple Imputation of Software Measurement Data: A Case Study (2006) (5)
- Approximating general distributions by a uniform coxian distribution (1988) (5)
- The Effect of Time on the Maintenance of a Predictive Model (2019) (5)
- Stability and Classification Performance of Feature Selection Techniques (2011) (5)
- Assuring Timeliness in an e-Science Service-Oriented Architecture (2008) (5)
- VoB predictors: Voting on bagging classifications (2008) (5)
- Aggregating Data Sampling with Feature Subset Selection to Address Skewed Software Defect Data (2015) (5)
- Select-Bagging: Effectively Combining Gene Selection and Bagging for Balanced Bioinformatics Data (2014) (5)
- A Hybrid Approach to Cleansing Software Measurement Data (2006) (5)
- Using Feature Selection to Determine Optimal Depth for Wavelet Packet Decomposition of Vibration Signals for Ocean System Reliability (2011) (5)
- An Empirical Evaluation of Repetitive Undersampling Techniques (2010) (5)
- Hierarchical indexing of ocean survey video by mean shift clustering and MDL principle (2005) (5)
- Analysis of Transfer Learning Performance Measures (2017) (5)
- Using feature selection and classification to build effective and efficient firewalls (2014) (5)
- Software quality estimation with case-based reasoning (2004) (5)
- Feature Selection for Highly Imbalanced Software Measurement Data (2012) (5)
- Impact of Noise and Data Sampling on Stability of Feature Selection (2011) (5)
- How the Choice of Wrapper Learner and Performance Metric Affects Subset Evaluation (2013) (5)
- Encoding Techniques for High-Cardinality Features and Ensemble Learners (2021) (5)
- A novel feature selection technique for highly imbalanced data (2010) (5)
- Resource oriented selection of rule-based classification models: An empirical case study (2006) (5)
- Semantic Embeddings for Medical Providers and Fraud Detection (2020) (5)
- Feature Selection for Optimization of Wavelet Packet Decomposition in Reliability Analysis of Systems (2013) (5)
- The effect of noise level and distribution on classification of easy gene microarray data (2014) (4)
- The Effect of Number of Iterations on Ensemble Gene Selection (2012) (4)
- Feature Selection on Dynamometer Data for Reliability Analysis (2011) (4)
- Evaluating Model Predictive Performance: A Medicare Fraud Detection Case Study (2019) (4)
- THREE-GROUP SOFTWARE QUALITY CLASSIFICATION MODELING USING AN AUTOMATED REASONING APPROACH (2004) (4)
- Maxout Neural Network for Big Data Medical Fraud Detection (2019) (4)
- A Comparative Study on the Stability of Software Metric Selection Techniques (2012) (4)
- CLASSIFYING SOFTWARE MODULES INTO THREE RISK GROUPS (2004) (4)
- Leveraging LightGBM for Categorical Big Data (2021) (4)
- A performance analysis of the IBM subsystem control block architecture in a video conferencing environment (1993) (4)
- An empirical model of enhancement-induced defect activity in software (1995) (4)
- Addressing Class Imbalance in Non-binary Classification Problems (2008) (4)
- Maximizing Classification Performance for Patient Response Datasets (2013) (4)
- Multi-Objective Optimization by CBR GA-Optimizer for Module-Order Modeling (2004) (4)
- Developing an Effective Validation Strategy for Genetic Programming Models Based on Multiple Datasets (2006) (4)
- Proceedings, 16th IEEE International Conference on Tools with Artificial Intelligence : ICTAI 2004 : 15-17 November 2004, Boca Raton, Florida (2004) (4)
- Building an Effective Classification Model for Breast Cancer Patient Response Data (2015) (4)
- WRAPPER-BASED FEATURE RANKING TECHNIQUES FOR DETERMINING RELEVANCE OF SOFTWARE ENGINEERING METRICS (2010) (4)
- Predicting sentinel node status in melanoma from a real-world EHR dataset (2017) (4)
- Should the Same Learners Be Used Both within Wrapper Feature Selection and for Building Classification Models? (2013) (4)
- Comparing Feature Selection Techniques for Software Quality Estimation Using Data-Sampling-Based Boosting Algorithms (2015) (4)
- Melanoma risk modeling from limited positive samples (2019) (4)
- On the Rarity of Fault-prone Modules in Knowledge-based Software Quality Modeling (2008) (4)
- Ensemble Gene Selection Versus Single Gene Selection: Which Is Better? (2013) (4)
- Performance evaluation of the communications protocol processor (1990) (4)
- A Survey on Classifying Big Data with Label Noise (2022) (4)
- The Effects of Data Sampling with Deep Learning and Highly Imbalanced Big Data (2020) (4)
- The Effects of Class Label Noise on Highly-Imbalanced Big Data (2021) (4)
- Determining noisy instances relative to attributes of interest (2006) (4)
- Building a Novel GP-Based Software Quality Classifier Using Multiple Validation Datasets (2007) (4)
- Guest Editor's Introduction: Software Metrics (1994) (4)
- Learning Curve Estimation with Large Imbalanced Datasets (2019) (4)
- Patient response datasets: Challenges and opportunities (2013) (4)
- Network Traffic Prediction Models for Near- and Long-Term Predictions (2014) (4)
- Exploring filter-based feature selection techniques for software quality classification (2012) (4)
- Detection of Phishing Webpages Using Heterogeneous Transfer Learning (2017) (4)
- Comparing Approaches for Combining Data Sampling and Feature Selection to Address Key Data Quality Issues in Tweet Sentiment Analysis (2016) (3)
- A survey on heterogeneous transfer learning (2017) (3)
- KerasBERT: Modeling the Keras Language (2021) (3)
- A Progressive Edge-Based Stereo Correspondence Method (2007) (3)
- Measuring stability of feature ranking techniques: a noise-based approach (2012) (3)
- Investigating the Variation of Ensemble Size on Bagging-Based Classifier Performance in Imbalanced Bioinformatics Datasets (2016) (3)
- Comparison of rank-based vs. score-based aggregation for ensemble gene selection (2013) (3)
- Detecting Slow Application-Layer DoS Attacks With PCA (2021) (3)
- Investigating rarity in web attacks with ensemble learners (2021) (3)
- Multiple Imputation of Missing Values in Software Measurement Data (2007) (3)
- Overcoming Big Data Challenges (2013) (3)
- Performance analysis of advanced I/O architectures for PC-based video servers (1994) (3)
- A New Fixed-Overlap Partitioning Algorithm for Determining Stability of Bioinformatics Gene Rankers (2012) (3)
- An Information Theoretic Approach to Predicting Software Faults (1998) (3)
- Predicting Cancer Relapse with Clinical Data: A Survey of Current Techniques (2016) (3)
- Measuring Stability of Feature Selection Techniques on Real-World Software Datasets (2013) (3)
- Alterations to the Bootstrapping Process within Random Forest: A Case Study on Imbalanced Bioinformatics Data (2015) (3)
- An Empirical Study on Wrapper-Based Feature Selection for Software Engineering Data (2013) (3)
- Evaluation of Transfer Learning Algorithms Using Different Base Learners (2017) (3)
- TESTING AND FORMAL VERIFICATION OF SERVICE ORIENTED ARCHITECTURES (2009) (3)
- A Novel Noise-Resistant Boosting Algorithm for Class-Skewed Data (2012) (3)
- Output Thresholding for Ensemble Learners and Imbalanced Big Data (2021) (3)
- Canonical modeling of software complexity and fault correction activity (1994) (3)
- Using Weather and Playing Surface to Predict the Occurrence of Injury in Major League Soccer Games: A Case Study (2017) (3)
- Detection Methods of Slow Read DoS Using Full Packet Capture Data (2020) (3)
- Measuring Stability of Threshold-Based Feature Selection Techniques (2011) (3)
- Assessments of Feature Selection Techniques with Respect to Data Sampling for Highly Imbalanced Software Measurement Data (2015) (3)
- A study on rare fraud predictions with big Medicare claims fraud data (2020) (3)
- Labeling Network Event Records for Intrusion Detection in aWireless LAN (2006) (3)
- Quality Problem in Software Measurement Data (2006) (3)
- Hcpcs2Vec: Healthcare Procedure Embeddings for Medicare Fraud Prediction (2020) (3)
- Indirect classification approaches: a comparative study in network intrusion detection (2006) (3)
- Robust Thresholding Strategies for Highly Imbalanced and Noisy Data (2021) (3)
- An Empirical Investigation of Combining Filter-Based Feature Subset Selection and Data Sampling for Software Defect Prediction (2015) (3)
- Arbitrarily-Shaped Window Based Stereo Matching using the Go-Light Optimization Algorithm (2007) (3)
- Learning from Software Quality Data with Class Imbalance and Noise (2007) (3)
- IoT information theft prediction using ensemble feature selection (2022) (2)
- Introduction to the Special Issue on Quality Engineering with Computational Intelligence (2003) (2)
- Detecting SQL Injection Web Attacks Using Ensemble Learners and Data Sampling (2021) (2)
- Mitigating Class Imbalance for IoT Network Intrusion Detection: A Survey (2021) (2)
- Feature Selection for Vibration Sensor Data Transformed by a Streaming Wavelet Packet Decomposition (2011) (2)
- Improved Fault-Prone Detection Analysis of Software Modules Using an Evolutionary Neural Network Approach (2003) (2)
- Exploring Ensemble-Based Data Preprocessing Techniques for Software Quality Estimation (2013) (2)
- A comparative study of iterative and non-iterative feature selection techniques for software defect prediction (2013) (2)
- The Effects of Random Undersampling for Big Data Medicare Fraud Detection (2022) (2)
- Evaluating noise elimination techniques for software quality estimation (2005) (2)
- Using Neural Networks to Predict Software Faults During (1996) (2)
- Software Fault Imputation in Noisy and Incomplete Measurement Data (2008) (2)
- Comparison of Two Frameworks for Measuring the Stability of Gene-Selection Techniques on Noisy Class-Imbalanced Data (2013) (2)
- Detecting Network Attacks Based on Behavioral Commonalities (2016) (2)
- Contrasting Undersampled Boosting with Internal and External Feature Selection for Patient Response Datasets (2013) (2)
- Toward Model Checking Web Services Over the Web (2008) (2)
- Software Metrics: Charting the Course - Guest Editors' Introduction (1994) (2)
- The use of balance-aware subsampling for bioinformatics datasets (2013) (2)
- Building and Interpreting Risk Models from Imbalanced Clinical Data (2018) (2)
- Feature Popularity Between Different Web Attacks with Supervised Feature Selection Rankers (2021) (2)
- Detecting Information Theft Attacks in the Bot-IoT Dataset (2021) (2)
- The effect of feature extraction and data sampling on credit card fraud detection (2023) (2)
- Stability of Three Forms of Feature Selection Methods on Software Engineering Data (2015) (2)
- Low-Effort Labeling of Network Events for Intrusion Detection in WLANs (2008) (2)
- Determining the Number of Iterations Appropriate for Ensemble Gene Selection on Microarray Data (2012) (2)
- Investigating the Generalization of Image Classifiers with Augmented Test Sets (2021) (2)
- Experimental Studies on the Impact of Data Sampling with Severely Imbalanced Big Data (2020) (2)
- Maxout Networks for Visual Recognition (2019) (2)
- Inconsistent M-estimators: nonlinear regression with multiplicative error (1992) (2)
- Estimating Outlier Score Probabilities (2017) (2)
- Approximating Learning Curves for Imbalanced Big Data with Limited Labels (2019) (2)
- Filter-Based Subset Selection for Easy, Moderate, and Hard Bioinformatics Data (2018) (2)
- A Short Survey of LSTM Models for De-identification of Medical Free Text (2020) (1)
- [32] Functional and Performance Requirements Specification for the Earth Observing System Data and Information Sys- Tem (eosdis) Core System. Revision a and Ch-01 (1996) (1)
- Decision Level Fusion of Wavelet Features for Ocean Turbine State Detection (2012) (1)
- An Extendible Translation of BPEL to a Machine-verifiable Model (2009) (1)
- Deep Learning with Maxout Activations for Visual Recognition and Verification (2019) (1)
- Fusing Wavelet Features for Ocean Turbine Fault Detection (2016) (1)
- Stability of Filter-Based Feature Selection Methods for Imbalanced Software Measurement Data (2012) (1)
- Predicting the Severity of COVID-19 Respiratory Illness with Deep Learning (2022) (1)
- Hyperparameter Tuning for Medicare Fraud Detection in Big Data (2022) (1)
- Training Convolutional Networks on Truncated Text (2017) (1)
- Value-Based Software Quality Modeling (2009) (1)
- Data Cleansing for Remote Battery System Monitoring (1)
- How ranker and learner choice affects classification performance on noisy bioinformatics data (2014) (1)
- Tree-Based Software Quality Classification Using Genetic Programming (2006) (1)
- Choosing an Appropriate Ensemble Classifier for Balanced Bioinformatics Data (2015) (1)
- Choosing the Best Classification Performance Metric for Wrapper-based Software Metric Selection for Defect Prediction (2014) (1)
- Software Quality Modeling with Limited Apriori Defect Data (2009) (1)
- Investigation of Maxout Activations on Convolutional Neural Networks for Big Data Text Sentiment Analysis (2019) (1)
- A Case Studv in Telecommunications (1996) (1)
- Netflow Feature Evaluation for the Detection of Slow Read HTTP Attacks (2020) (1)
- Modelling software quality with GP (1999) (1)
- Early operational risk assessment of software using fuzzy expert systems (2002) (1)
- Fraud Detection with a Limited Number of Known Fraudulent Medicare Providers (2018) (1)
- An Exploration of Consistency Learning with Data Augmentation (2022) (1)
- A Class-Imbalanced Study with Feature Extraction via PCA and Convolutional Autoencoder (2022) (1)
- Investigating New Bootstrapping Approaches of Bagging Classifiers to Account for Class Imbalance in Bioinformatics Datasets (2015) (1)
- A Comparison of House Price Classification with Structured and Unstructured Text Data (2022) (1)
- A high-level performance analysis of the IBM subsystem control block (SCB) architecture (1993) (1)
- Optimizing Ensemble Trees for Big Data Healthcare Fraud Detection (2022) (1)
- Analyzing the Impact of Attribute Noise on Software Quality Classification (2008) (1)
- Ensemble Coordination for Discrete Event Control (2011) (1)
- A RULE-BASED SOFTWARE QUALITY CLASSIFICATION MODEL (2008) (1)
- Informative Evaluation Metrics for Highly Imbalanced Big Data Classification (2022) (1)
- Analysis and differentiation of software system environments (1996) (1)
- How to Optimally Combine Univariate and Multivariate Feature Selection with Data Sampling for Classifying Noisy, High Dimensional and Class Imbalanced DNA Microarray Data# (2020) (1)
- Survey of review spam detection using machine learning techniques (2015) (1)
- Using Inductive Transfer Learning to Improve Hotel Review Spam Detection (2021) (1)
- Can a software quality model hit a moving target? (1998) (1)
- Reliability of fault-tolerant software based on a system architecture with a recovery metaprogram (1989) (1)
- DYNAMIC MODELS FOR TESTING BASED ON TIME SERIES ANALYSIS (2006) (1)
- Data Intensive Computing: A Biomedical Case Study in Gene Selection and Filtering (2011) (1)
- A performance analysis of advanced I/O architectures for PC-based network file servers (1994) (1)
- Software quality modeling and analysis with limited or without defect data (2005) (1)
- 2010 Ninth International Conference on Machine Learning and Applications ICMLA 2010 Table of Contents (2010) (1)
- Threshold-based feature selection techniques for high-dimensional bioinformatics data (2012) (1)
- Survey of Data Cleansing and Monitoring for Large-Scale Battery Backup Installations (2013) (1)
- IoT Reconnaissance Attack Classification with Random Undersampling and Ensemble Feature Selection (2021) (1)
- Evaluating The Number of Trainable Parameters on Deep Maxout and LReLU Networks for Visual Recognition (2020) (1)
- Necessity of Feature Selection when Augmenting Tweet Sentiment Feature Spaces with Emoticons (2016) (1)
- Decision Trees for Software Quality Classification (2003) (1)
- Encoding High-Dimensional Procedure Codes for Healthcare Fraud Detection (2022) (1)
- A Study on Software Metric Selection for Software Fault Prediction (2019) (1)
- Flexible hardware architecture for multi-media communications processing (1989) (1)
- A Review of Performance Evaluation on 2D Face Databases (2017) (1)
- On the impact of software product dissimilarity on software quality models (1994) (1)
- Can metrics and models be applied across multiple releases or projects? (1999) (1)
- Improving Software Quality Estimation by Combining Boosting and Feature Selection (2013) (1)
- High Consequence Systems and Semantic Computing (2013) (1)
- A Practical Software Quality Classification Model Using Genetic Programming (2007) (1)
- Social media for polling and predicting United States election outcome (2018) (1)
- Deep Learning applications for COVID-19 (2021) (0)
- Extracting Knowledge from Technical Reports for the Valuation of West Texas Intermediate Crude Oil Futures (2018) (0)
- Exploring Language-Interfaced Fine-Tuning for COVID-19 Patient Survival Classification (2022) (0)
- Large-scale distributed L-BFGS (2017) (0)
- Approaches for identifying U.S. medicare fraud in provider claims data (2018) (0)
- Learning from Highly Imbalanced Big Data with Label Noise (2023) (0)
- EFFICIENT IMPLEMENTATION AND COMPUTATIONAL ANALYSIS OF PRIVACY-PRESERVING PROTOCOLS FOR SECURING THE FINANCIAL MARKETS by (2018) (0)
- A parallel and distributed stochastic gradient descent implementation using commodity clusters (2019) (0)
- Severely imbalanced Big Data challenges: investigating data sampling approaches (2019) (0)
- Institutional Knowledge at Singapore Management University Ontology-based business process customization for composite web services (2019) (0)
- Comparative aspects of software complexity metrics and program modules — a multidimensional scaling approach (1992) (0)
- An Examination of Neural Networks on Cluster Computers (2021) (0)
- Prediction Error Average (1 Step) Maximum (1 Step) Average (2 Step) Average (3 Step) First Data Set 11% 27% 21% 30% Second Data Set 7% 22% 14% 22% (0)
- Exploring Ensemble Filters for Software Defect Prediction (2013) (0)
- An Easy-to-Classify Approach for the Bot-IoT Dataset (2021) (0)
- An approach to application-layer DoS detection (2023) (0)
- A Comparative Approach to Threshold Optimization for Classifying Imbalanced Data (2022) (0)
- Efficient Modeling of User-Entity Preference in Big Social Networks (2015) (0)
- Software Quality Modeling as a Reliability Tool (2008) (0)
- Healthcare Provider Summary Data for Fraud Classification (2022) (0)
- Investigating class rarity in big data (2020) (0)
- A performance analysis of personal computers in a video conferencing environment (1994) (0)
- BASELINE-DIFFERENCING: A NOVEL APPROACH FOR BUILDING GENERALIZABLE OCEAN TURBINE RELIABILITY MODELS (2012) (0)
- Improving deep neural network design with new text data representations (2017) (0)
- Survey on RNN and CRF models for de-identification of medical free text (2020) (0)
- Examining characteristics of predictive models with imbalanced big data (2019) (0)
- Cost-Sensitive Ensemble Learning for Highly Imbalanced Classification (2022) (0)
- The use of generative adversarial networks to alleviate class imbalance in tabular data: a survey (2022) (0)
- The effects of class rarity on the evaluation of supervised healthcare fraud detection models (2019) (0)
- IoT information theft prediction using ensemble feature selection (2022) (0)
- Feature evaluation for IoT botnet traffic classification (2022) (0)
- Evaluating Performance Metrics for Credit Card Fraud Classification (2022) (0)
- How to Develop Engineering Entrepreneurship: Case Study – Florida Atlantic University (2014) (0)
- Polishing Noise in Continuous Software Measurement Data (2006) (0)
- A Poisson Regression Model of Software Quality: A Comparative Study (2006) (0)
- Improving neural network models of defect content in complex software systems (1996) (0)
- Knowledge and Information Systems REGULAR PAPER (0)
- Hitting the Moving Target: Trials and Tribulations of Modeling Quality in Evolving Software Systems (1998) (0)
- A Stack Based Multimodal Machine Learning Model for Breast Cancer Diagnosis (2022) (0)
- Software measurement for the space shuttle HAL/S maintenance environment (1992) (0)
- Melanoma risk modeling from limited positive samples (2019) (0)
- Predicting Cyberattacks with Destination Port Through Various Input Feature Scenario (2022) (0)
- A Framework of Combining Data Pre-Processing Methods and Boosting for Software Quality Classification (2015) (0)
- Cbr-based software quality models and quality of data (2005) (0)
- Detecting cybersecurity attacks across different network features and learners (2021) (0)
- Composition analysis of the Bot-IoT dataset (2022) (0)
- Defining task-to-task dispatch and interrupt response times for real-time systems (1993) (0)
- A new feature popularity framework for detecting cyberattacks using popular features (2022) (0)
- Video and image analysis using statistical and machine learning techniques (2007) (0)
- A workload model for frame-based real-time applications on distributed systems (1992) (0)
- A review of data mining using big data in health informatics (2014) (0)
- Software Module Risk Analysis (2007) (0)
- The Impact of Feature Selection Techniques on a Hybrid Boosting and Data Sampling Approach for Software Quality Estimation (2014) (0)
- Mining and Storing Data Streams for Reliability Analysis (2010) (0)
- Detecting Web Attacks in Severely Imbalanced Network Traffic Data (2021) (0)
- Investigating rarity in web attacks with ensemble learners (2021) (0)
- Using Random Undersampling and Ensemble Feature Selection for IoT Attack Prediction (2023) (0)
- Investigating the relationship between time and predictive model maintenance (2020) (0)
- Editorial (2013) (0)
- Proceedings of the 2004 IEEE International Conference on Information Reuse and Integration, IRI - 2004, November 8-10, 2004, Las Vegas Hilton, Las Vegas, NV, USA (2004) (0)
- Predicting high-risk program modules by selecting the right software measurements (2011) (0)
- Survey on deep learning with class imbalance (2019) (0)
- Assessing the Risk of Faults in Software Modules (2001) (0)
- Software metrics and the quality of telecommunication software (1992) (0)
- Fast and Efficient Hashing for Sequence Similarity Search using Substring Extraction in DNA Sequence Databases (2020) (0)
- A survey of transfer learning (2016) (0)
- Threshold optimization and random undersampling for imbalanced credit card data (2023) (0)
- Detecting SSH and FTP Brute Force Attacks in Big Data (2021) (0)
- Rough set-based software quality models and quality of data (2008) (0)
- Application of Fuzzy Rule Extraction to Minimize the Costs of Misclassification on Software Quality Modeling (2003) (0)
- Empirical Bayes methods in time series analysis (1982) (0)
- Software reliability engineering with genetic programming (2003) (0)
- Big Data fraud detection using multiple medicare data sources (2018) (0)
- Panel: Using information re-use and integration principles in big data (2012) (0)
- Verifying the Security Characteristics of a Secure Physical Access Control Protocol (2016) (0)
- Evaluation of maxout activations in deep learning across several big data domains (2019) (0)
- Guest Editors ’ Introduction Software Metrics : Charting the Course (2001) (0)
- VipBoost: A More Accurate Boosting Algorithm (2009) (0)
- Big Data and Class Imbalance in Medicare Fraud Detection (2020) (0)
- A Comparative Study of Sampled Feature Ranker Ensembles for Software Quality Classification (2012) (0)
- Observing the Effect of the Choice of Classifier on Bioinformatics Data with Varying Levels of Data Quality and Class Balance (2015) (0)
- Deep learning applications and challenges in big data analytics (2015) (0)
- A survey of open source tools for machine learning with big data in the Hadoop ecosystem (2015) (0)
- Improving detection of untrustworthy online reviews using ensemble learners combined with feature selection (2017) (0)
- Feature list aggregation approaches for ensemble gene selection on patient response datasets (2013) (0)
- Performance analysis of a peer-to-peer I/O architecture in video server environments (1995) (0)
- Neural network application in support of software reliability engineering (1995) (0)
- lVlo deling Fault -Yrone (2000) (0)
- An Empirical Study of Software Metric Selection Techniques for Defect Prediction (2012) (0)
- The effects of varying class distribution on learner behavior for medicare fraud detection with imbalanced big data (2018) (0)
- A reconstruction error-based framework for label noise detection (2021) (0)
- Impact of class distribution on the detection of slow HTTP DoS attacks using Big Data (2019) (0)
- Comparing Two Approaches for Adding Feature Ranking to Sampled Ensemble Learning for Software Quality Estimation (2014) (0)
- Random forest implementation and optimization for Big Data analytics on LexisNexis’s high performance computing cluster platform (2019) (0)
- Does the Inclusion of Data Sampling Improve the Performance of Boosting Algorithms on Imbalanced Bioinformatics Data? (2015) (0)
- ATTRIBUTE NOISE DETECTION USING MULTI-RESOLUTION ANALYSIS (2006) (0)
- Modeling and tracking Covid-19 cases using Big Data analytics on HPCC system platform (2021) (0)
- Intrusion detection and Big Heterogeneous Data: a Survey (2015) (0)
- IoT attack prediction using big Bot-IoT data (2022) (0)
- I/O performance analysis for network file servers (1994) (0)
- Assessing the Adaptability of Large Software Systems (2007) (0)
- GANs for Class-Imbalanced Data: A Meta-Analysis of GitHub Projects (2022) (0)
- VCI predictors: Voting on classifications from imputed learning sets (2008) (0)
- A simulation study of the performance of several estimation procedures for linear regression models (1994) (0)
- A Study of the Impact of Base Traditional Learners on Transfer Learning Algorithms (2018) (0)
- Detecting web attacks using random undersampling and ensemble learners (2021) (0)
- A Novel Approach for Unsupervised Learning of Highly-Imbalanced Data (2022) (0)
- Sample size determination for biomedical big data with limited labels (2020) (0)
- STRATEGY AND APPLICATION OF DATA-DRIVEN TESTING OF AN OCEAN TURBINE DRIVETRAIN (2011) (0)
- Comparative Analysis on the Stability of Feature Selection Techniques Using Three Frameworks on Biological Datasets (2013) (0)
- Survey on categorical data for neural networks (2020) (0)
- Predicting Traffic Incidents in Road Networks Using Vehicle Detector Data (2021) (0)
- Evaluating classifier performance with highly imbalanced Big Data (2023) (0)
- Proceedings of the 9th [i. e. 12th] IASTED International Conference on Software Engineering and Applications, November 16-18, 2008, Orland, Florida, USA (2008) (0)
- A performance analysis of an object-based I/O architecture in a video server environment (1995) (0)
- A survey on addressing high-class imbalance in big data (2018) (0)
- Performance of Filter-based Feature Subset Selection for Software Quality Data Classification (2014) (0)
- Accelerated Deep Learning on HPCC Systems (2020) (0)
- Big Data: Deep Learning for financial sentiment analysis (2018) (0)
This paper list is powered by the following services:
What Schools Are Affiliated With Taghi M. Khoshgoftaar?
Taghi M. Khoshgoftaar is affiliated with the following schools: