Clustering in Python

Watch the YouTube video and read the two articles provided to answer the questions listed below. Ensure your answers are drawn primarily from the video and articles.

Next, download and open the Assignment 11 Hands-On directions and data file.

Questions

  1. In the YouTube video “K-Means Clustering in Python,” what was the name of the data set used in the example? Why do you think this data set is not a good example for k-means?
  2. In the YouTube video ” K-Means Clustering in Python,” what were the x and y coordinates for the scatterplot created by the Mazen?
  3. In the YouTube video ” K-Means Clustering in Python,” which method did Mazen recommend using when you do not know how many clusters are present in the data?
  4. According to the article, “Why real-time cost anomaly detection for your cloud is non-negotiable,” what percent did senior IT leaders spend over budget for cloud costs? What did senior IT leaders say they worry about when it comes to running big data cloud technologies and applications?
  1. According to the article, “Why real-time cost anomaly detection for your cloud is non-negotiable,” how can enterprises protect themselves from spending way too much on cloud costs? Describe the platform that can help protect enterprises and list one of AWS products.
  1. In the article, “Anomaly detection for insider threats using unsupervised ensembles,” what were the four unsupervised machine learning algorithms used by the authors?
  1. In the article, “Anomaly detection for insider threats using unsupervised ensembles,” how was the performance of the authors’ threat detection measured. Which algorithm performed the best under normal conditions? Poisoning conditions? Severe poisoning conditions?

Hands-On

Use Jupyter Notebook to complete the Python exercises.

  1. Download the directions for K-Means and DBSCAN You will need the Mall_Customers.csv
    Download Mall_Customers.csvdata file for these experiments. Complete the exercises and save your work. Ensure you include Pseudocode for Steps 1-7. Upload your ipynb and pdf files.
Powered by WordPress