ITD 140 · ML I
Module 1 · Assignment

Assignment 1.7: Unsupervised Learning (Clustering, k-means, Iris Dataset)

Points 50Due Jun 28, 2026, 3:59 AMSubmit File uploadFormats doc, docx, pdf
  1. Watch https://www.youtube.com/watch?v=JnnaDNNb380 for a very basic overview of supervised vs. unsupervised learning as well as an intro to k-means clustering. Disregard the last part about autoencoders--that's beyond the scope of this course.
  2. Open Orange with a blank canvas/project.
  3. Drop a 'datasets' widget onto the canvas.
  4. Configure it to pull the Iris dataset. (find it, then double-click on it)
  5. Connect a data table widget to ensure that you're getting the correct data.
  6. From the data table widget, connect a k-means widget (blue, unsupervised learning).
  7. Configure it for three (3) fixed clusters.
  8. Connect a scatter plot widget to visualize the output, selecting the "Color" option to use the Cluster.
  9. Leaving that window open, go back to the k-means widget config and change the number of clusters--2, 4, etc. to see what happens.
  10. Your submission will be:
    1. a minimum of 100 words explaining the basic functionality of k-means, including your observations from changing the 'k' parameter (the number of target clusters)
      1. For a good (and corny but entertaining) explanation of k-means, watch https://youtu.be/4b5d3muPQmA 
    2. a screenshot of your scatter plot with 3 clusters (k=3)
    3. a single Word document with your written explanation and screenshot

 

Other stuff...

Fascinating blog post from an Orange Data Mining developer on an encounter with a statistician stuck in old ways and how data can lead to interesting stories.