Full metadata
Title
Anomaly detection in categorical datasets with artificial contrasts
Description
Anomaly is a deviation from the normal behavior of the system and anomaly detection techniques try to identify unusual instances based on deviation from the normal data. In this work, I propose a machine-learning algorithm, referred to as Artificial Contrasts, for anomaly detection in categorical data in which neither the dimension, the specific attributes involved, nor the form of the pattern is known a priori. I use RandomForest (RF) technique as an effective learner for artificial contrast. RF is a powerful algorithm that can handle relations of attributes in high dimensional data and detect anomalies while providing probability estimates for risk decisions.
I apply the model to two simulated data sets and one real data set. The model was able to detect anomalies with a very high accuracy. Finally, by comparing the proposed model with other models in the literature, I demonstrate superior performance of the proposed model.
I apply the model to two simulated data sets and one real data set. The model was able to detect anomalies with a very high accuracy. Finally, by comparing the proposed model with other models in the literature, I demonstrate superior performance of the proposed model.
Date Created
2016
Contributors
- Mousavi, Seyyedehnasim (Author)
- Runger, George C. (Thesis advisor)
- Wu, Teresa (Committee member)
- Kim, Sunghoon (Committee member)
- Arizona State University (Publisher)
Topical Subject
Resource Type
Extent
viii, 53 pages : illustrations (some color)
Language
eng
Copyright Statement
In Copyright
Primary Member of
Peer-reviewed
No
Open Access
No
Handle
https://hdl.handle.net/2286/R.I.40782
Statement of Responsibility
by Seyyedehnasim Mousavi
Description Source
Retrieved on April 5, 2017
Level of coding
full
Note
thesis
Partial requirement for: M.S., Arizona State University, 2016
bibliography
Includes bibliographical references (pages 51-53)
Field of study: Industrial engineering
System Created
- 2016-12-01 07:04:44
System Modified
- 2021-08-30 01:20:29
- 3 years 2 months ago
Additional Formats