Data Mining

Group Project

This page will updated as the project progresses throughout the semester. For now, start thinking about potential project ideas. Even though this is a group project, each student needs to have their own initial (brief) proposal prior to group formation; this initial proposal is the first milestone. For ideas about projects, check out this page on good examples of projects from the Spring 2020 offering of this course. You might also look at some good examples of projects from a past offering of this course by Alona Fyshe.

Your initial proposal will be due into Brightspace on Friday, May 22nd by 11:59pm, so think early and think well! The instructions for what to submit are on Brightspace, in the form of an assignment.

Goal

The purpose of the group project is to get hands-on experience in applying maching learning algorithms to interesting/important problems.

Timeline

Milestone Due date
Initial proposal Friday, May 22nd
Group formation Friday, May 29th
Formal proposal Sunday, June 14th
Progress report Sunday, July 12th
Presentation July 27th and 30th
Final report Thursday, August 6th

How Groups Work

Each group for the group project will have at least 4 students and at most 6 students. A group size of 4 is ideal. Coordination and evenly dividing work becomes difficult as the group grows beyond 4 students.

Each student is expected to be involved with some machine learning component of the project. If, for instance, a project is to get movie ratings data and then apply some fancy machine learning method for it, it is not OK if one person only collects the data. Collecting data is not machine learning (unless, of course, you're using machine learning itself to collect the data!).

Some general advice about the project
Here are some project ideas to help get you started thinking