Results 1 - 10
of
10
Using an Ensemble of One-Class SVM Classifiers to Harden Payload-based Anomaly Detection Systems
- In Proceedings of the IEEE International Conference on Data Mining (ICDM’06
, 2006
"... Unsupervised or unlabeled learning approaches for network anomaly detection have been recently proposed. In particular, recent work on unlabeled anomaly detection focused on high speed classification based on simple payload statistics. For example, PAYL, an anomaly IDS, measures the occurrence frequ ..."
Abstract
-
Cited by 18 (6 self)
- Add to MetaCart
Unsupervised or unlabeled learning approaches for network anomaly detection have been recently proposed. In particular, recent work on unlabeled anomaly detection focused on high speed classification based on simple payload statistics. For example, PAYL, an anomaly IDS, measures the occurrence frequency in the payload of n-grams. A simple model of normal traffic is then constructed according to this description of the packets ’ content. It has been demonstrated that anomaly detectors based on payload statistics can be “evaded ” by mimicry attacks using byte substitution and padding techniques. In this paper we propose a new approach to construct high speed payload-based anomaly IDS intended to be accurate and hard to evade. We propose a new technique to extract the features from the payload. We use a feature clustering algorithm originally proposed for text classification problems to reduce the dimensionality of the feature space. Accuracy and hardness of evasion are obtained by constructing our anomaly-based IDS using an ensemble of one-class SVM classifiers that work on different feature spaces. 1
Intrusion detection in computer networks by a modular ensemble of one-class classifiers
- Information Fusion, Special Issue on Applications of Ensemble Methods
, 2008
"... Since the early days of research on Intrusion Detection, anomaly-based approaches have been proposed to detect intrusion attempts. Attacks are detected as anomalies when compared to a model of normal (legitimate) events. Anomaly-based approaches typically produce a relatively large number of false a ..."
Abstract
-
Cited by 5 (0 self)
- Add to MetaCart
Since the early days of research on Intrusion Detection, anomaly-based approaches have been proposed to detect intrusion attempts. Attacks are detected as anomalies when compared to a model of normal (legitimate) events. Anomaly-based approaches typically produce a relatively large number of false alarms compared to signature-based IDS. However, anomaly-based IDS are able to detect never-before-seen attacks. As new types of attacks are generated at an increasing pace and the process of signature generation is slow, it turns out that signature-based IDS can be easily evaded by new attacks. The ability of anomaly-based IDS to detect attacks never observed in the wild has stirred up a renewed interest in anomaly detection. In particular, recent work focused on unsupervised or unlabeled anomaly detection, due to the fact that it is very hard and expensive to obtain a labeled dataset containing only pure normal events. The unlabeled approaches proposed so far for network IDS focused on modeling the normal network traffic considered as a whole. As network traffic related to different protocols or services exhibits different characteristics, this paper proposes an unlabeled Network Anomaly IDS based on a modular Multiple Classifier System (MCS). Each module is designed to model a particular group of similar protocols or network services. The use of a modular MCS allows the designer to choose a different model and decision threshold for different (groups of) network services. This also allows the designer to tune the false alarm rate and detection rate produced by each module to optimize the overall performance of the ensemble. Experimental results on the KDD-Cup 1999 dataset show that the proposed anomaly IDS achieves high attack detection rate and low false alarm rate at the same time. 1
A Discussion on the Classifier Projection Space for Classifier Combining
, 2002
"... In classifier combining , one tries to fuse the information that isg iven by a set of base classifiers. In such a process, one of the di#culties is ho to deal ith the variability bet een classifiers. Althoug h various measures and many combining rules have been sugwxE ed in the past, the problem of ..."
Abstract
-
Cited by 4 (1 self)
- Add to MetaCart
In classifier combining , one tries to fuse the information that isg iven by a set of base classifiers. In such a process, one of the di#culties is ho to deal ith the variability bet een classifiers. Althoug h various measures and many combining rules have been sugwxE ed in the past, the problem of constructing optimal combiners is still heavily studied. In this paper, e discuss and illustrate the possibilities of classifier embedding in order to analyse the variability of base classifiers, as ell as their combining rules. Thereby, a space is constructed in hich classifiers can be represented as points. Such a space of a lo dimensionality is a Classifier Projection Space (CPS). In the first instance, it is used to desig n a visual tool thatg ives more insig ht into the di#erences of various combining techniques. This is illustrated by some examples. In the end, e discuss ho the CPS may also be used as a basis for constructing ne combining rules. 1
Employing Optimized Combinations of One-Class Classifiers for Automated Currency Validation
- Pattern Recognition
, 2003
"... Automated currency validation requires a decision to be made regarding the authenticity of a banknote presented to the validation system. This decision often has to be made with little or no information regarding the characteristics of possible counterfeits as is the case for issues of new currency. ..."
Abstract
-
Cited by 3 (0 self)
- Add to MetaCart
Automated currency validation requires a decision to be made regarding the authenticity of a banknote presented to the validation system. This decision often has to be made with little or no information regarding the characteristics of possible counterfeits as is the case for issues of new currency. A method for automated currency validation is presented which segments the whole banknote into di#erent regions, builds individual classifiers on each region and then combines a small subset of the region specific classifiers to provide an overall decision. The segmentation and combination of region specific classifiers to provide optimized false positive and false negative rates is achieved by employing a genetic algorithm. Experiments based on high value notes of Sterling currency were carried out to assess the e#ectiveness of the proposed solution.
Behavior-based Email Analysis with Application to Spam Detection
, 2006
"... Email is the “killer network application”. Email is ubiquitous and pervasive. In a relatively short timeframe, the Internet has become irrevocably and deeply entrenched in our modern society primarily due to the power of its communication substrate linking people and organizations around the globe. ..."
Abstract
-
Cited by 3 (0 self)
- Add to MetaCart
Email is the “killer network application”. Email is ubiquitous and pervasive. In a relatively short timeframe, the Internet has become irrevocably and deeply entrenched in our modern society primarily due to the power of its communication substrate linking people and organizations around the globe. Much work on email technology has focused on making email easy to use, permitting a wide variety of information and information types to be conveniently, reliably, and efficiently sent throughout the Internet. However, the analysis of the vast storehouse of email content accumulated or produced by individual users has received relatively little attention other than for specific tasks such as spam and virus filtering. As one paper in the literature puts it, ”the state of the art is still a messy desktop” (Denning,
One-class Classification by Combining Density and Class Probability Estimation
"... Abstract. One-class classification has important applications such as outlier and novelty detection. It is commonly tackled using density estimation techniques or by adapting a standard classification algorithm to the problem of carving out a decision boundary that describes the location of the targ ..."
Abstract
-
Cited by 2 (1 self)
- Add to MetaCart
Abstract. One-class classification has important applications such as outlier and novelty detection. It is commonly tackled using density estimation techniques or by adapting a standard classification algorithm to the problem of carving out a decision boundary that describes the location of the target data. In this paper we investigate a simple method for one-class classification that combines the application of a density estimator, used to form a reference distribution, with the induction of a standard model for class probability estimation. In this method, the reference distribution is used to generate artificial data that is employed to form a second, artificial class. In conjunction with the target class, this artificial class is the basis for a standard two-class learning problem. We explain how the density function of the reference distribution can be combined with the class probability estimates obtained in this way to form an adjusted estimate of the density function of the target class. Using UCI datasets, and data from a typist recognition problem, we show that the combined model, consisting of both a density estimator and a class probability estimator, can improve on using either component technique alone when used for one-class classification. We also compare the method to one-class classification using support vector machines. 1
Discrimination between digits and outliers in handwritten documents applied to the extraction of numerical fields
, 2006
"... In this article, we propose a numerical field extraction system from unconstrained handwritten documents. The system is based on a segmentation driven by recognition stage followed by a syntactical analysis which detects the sequences that may compose a numerical field. We focus here on the design o ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
In this article, we propose a numerical field extraction system from unconstrained handwritten documents. The system is based on a segmentation driven by recognition stage followed by a syntactical analysis which detects the sequences that may compose a numerical field. We focus here on the design of a digit classifier embedded in the segmentation/recognition process able to discriminate digits from outliers such as words, fragment of words, noise, etc. For that, we have developed a light classifier used as prior to a standard digit classifier in order to reject “obvious outliers”. Several classifiers have been compared in terms of ROC curve and processing time. 1
A Comparison of One-Class Classifiers for Novelty Detection in Forensic Case Data ⋆
"... Abstract. This paper investigates the application of novelty detection techniques to the problem of drug profiling in forensic science. Numerous one-class classifiers are tried out, from the simple k-means to the more elaborate Support Vector Data Description algorithm. The target application is the ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
Abstract. This paper investigates the application of novelty detection techniques to the problem of drug profiling in forensic science. Numerous one-class classifiers are tried out, from the simple k-means to the more elaborate Support Vector Data Description algorithm. The target application is the classification of illicit drugs samples as part of an existing trafficking network or as a new cluster. A unique chemical database of heroin and cocaine seizures is available and allows assessing the methods. Evaluation is done using the area under the ROC curve of the classifiers. Gaussian mixture models and the SVDD method are trained both with and without outlier examples, and it is found that providing outliers during training improves in some cases the classification performance. Finally, combination schemes of classifiers are also tried out. Results highlight methods that may guide the profiling methodology used in forensic analysis. 1
McPAD: A Multiple Classifier System for Accurate Payload-based Anomaly Detection
"... Anomaly-based network Intrusion Detection Systems (IDS) are valuable tools for the defense-in-depth of computer networks. Unsupervised or unlabeled learning approaches for network anomaly detection have been recently proposed. Such anomaly-based network IDS are able to detect (unknown) zero-day atta ..."
Abstract
- Add to MetaCart
Anomaly-based network Intrusion Detection Systems (IDS) are valuable tools for the defense-in-depth of computer networks. Unsupervised or unlabeled learning approaches for network anomaly detection have been recently proposed. Such anomaly-based network IDS are able to detect (unknown) zero-day attacks, although much care has to be dedicated to controlling the amount of false positives generated by the detection system. As a matter of fact, it is has been shown [2] that the false positive rate is the true limiting factor for the performance of IDS, and that in order to substantially increase the Bayesian detection rate, P(Intrusion|Alarm), the IDS must have a very low false positive rate (e.g., as low as 10 −5 or even lower). In this paper we present McPAD (Multiple-Classifier Payload-based Anomaly Detector), a new accurate payload-based anomaly detection system that consists of an ensemble of one-class classifiers. We show that our anomaly detector is very accurate in detecting network attacks that bear some form of shell-code in the malicious payload. This holds true even in the case of polymorphic attacks and for very low false positive rates. Furthermore, we experiment with advanced polymorphic blending attacks and we show that in some cases even in the presence of such sophisticated attacks and for a low false positive rate our IDS still has a relatively high detection rate.
18 Fault Detection with Bayesian Network
"... Nowadays, process control (or process monitoring) is becoming an essential task especially when dealing with complex manufacturing processes (like automatized processes containing a lot of sensors and actuators). In (Chiang et al., 2001), authors give two principal approaches to perform the process ..."
Abstract
- Add to MetaCart
Nowadays, process control (or process monitoring) is becoming an essential task especially when dealing with complex manufacturing processes (like automatized processes containing a lot of sensors and actuators). In (Chiang et al., 2001), authors give two principal approaches to perform the process control, namely, data-driven techniques and analytical

