Posts

Anomaly Detection Using Isolation Forest in Python

Image
  From bank fraud to preventative machine maintenance, anomaly detection is an incredibly useful and common application of machine learning. The isolation forest algorithm is a simple yet powerful choice to accomplish this task. In this article we'll cover: An Introduction to Anomaly Detection Use Cases of Anomaly Detection What Is Isolation Forest? Using Isolation Forest for Anomaly Detection Implementation in Python Introduction to Anomaly Detection An  outlier  is nothing but a data point that differs significantly from other data points in the given dataset. Anomaly detection  is the process of finding the outliers in the data, i.e. points that are significantly different from the majority of the other data points. Large, real-world datasets may have very complicated patterns that are difficult to detect by just looking at the data. That's why the study of anomaly detection is an extremely important application of Machine Learning. In this article we are going to implement

Demystifying ‘Confusion Matrix’ Confusion

Image
  If you are Confused about Confusion Matrix, then I hope this post may help you understand it! Happy Reading. We will use the UCI Bank Note Authentication Dataset for demystifying the confusion behind Confusion Matrix. We will predict and evaluate our model, and along the way develop our conceptual understanding. Also will be providing the links to further reading wherever required. Understanding the Data The Dataset contains properties of the wavelet transformed image of 400x400 pixels of a BankNote, and can be found  here . It is recommended for reader to download the dataset and follow along. Further for reference, you can find the Kaggle Notebook  here . #Skipping the necessary Libraries import #Reading the Data File df = pd.read_csv('../input/BankNote_Authentication.csv') df.head(5) Sample Data (using Head) #To check if the data is equally balanced between the target classes df['class'].value_counts() Target Class is balanced enough Building the Model Splitting th