Starting from:

$25

EECE5644 - FINAL PROJECT - Solved

Teams would perform Exploratory Data Analysis (EDA) and inference using machine learning methods on publicly available data.



You are responsible for forming project teams of 3-4 people for V30, 2-3 for Sec01, and 2-3 for V35. 

Note: teams must be formed from within the class section (Use piazza to find team members)

Project Selection:

Discuss project ideas  It's important that the dataset is  publicly available, please have a look at data science competition sites: Kaggle, DrivenData, CrowdANALYTIX, TunedIT, InnoCentive, Codalab. Its better to choose projects which needs some EDA and also how will you gain knowledge from the data? What do you hope to do, specifically, and why? What is the impact of an answer to that question? Present three procedures from class that you will apply to the data set.

Project Abstract 
The project abstract should not exceed one page in length with 11-point font and single spacing. The first half of the document will describe the publicly available dataset you intend to explore - describe the measurements carefully, so that the readers will see the connection between the goals for exploratory analysis and the available measurements. Present this section in your own words, without copying from archived manuscripts. Assume that the reader provides funding (I.e., with little technical background, but with funds that you hope to receive). Hence, the goal is to convince prospective funder that the project is feasible. For this, demonstrate that your team understands the available data carefully enough to pose interesting hypotheses and explore with at least some low-risk options and a few high-risk, high-reward teasers.

The final half of the document will describe specific goals for your exploratory data analysis (EDA). Write this from the perspective of a team leader, with limited resources and people to perform EDA. How will you gain knowledge from the data? What do you hope to do, specifically, and why? Like what is the application of an answer to that question, i.e., so what, who cares? What do you believe can be answered using methods developed in the class? For each question, present three procedures from class that you will apply to the data set.

You should expect two-thirds of your procedures to fail (excellently) and half of your questions to be not be answered accurately enough. Generate enough questions and procedures so that some procedures will answer some of your questions. Take risks but have a low-risk solution. You will have to document why procedures and questions failed at the Milestone report , so don’t propose more than your team can address in the allotted time – the importance of properly planning (I.e., time management) is especially important in data science. The ability and know-hows are important, but as too is the project planning. Do not overlook this and fall into a trap of putting too much on the plate, or, to the contrary, not planning for work to match the potential of your team (I.e., undershoot expectation).

Milestone report 
The report must be in 12-point font, with single-line spacing and 1-inch margins in 8.5 x 11-inch format.  The report will have a maximum of two pages of text.  Additionally, the report has no more than 1 page of visualizations at the end of the report.  All figures and illustrations will have captions and be referenced in the text.  No links to outside material will be followed.

Please use the report to address the following questions carefully.

●     Ideas in the abstract that were considered in this phase of work.  Describe the expected success or failure of each.

●     New ideas that were not mentioned in the abstract. How did the data suggest that you considering these ideas now?

●     Proposed scope of the work for the Final Report.

Project Report 
The project report will be your opportunity to tell the story from slides in a single, technical write-up. It should consist of the edited slide set, along with detailed notes to accompany each slide. Think of the project report as the script for the project presentation (next). The report format will allow one 8x11 page for each slide: the slide itself will occupy 1/3 of the page, and the notes will complete the page (and no more). When reading the notes aloud at a presentation pace, the report page should take 1.5 minutes on average. Each member of the group will contribute an equal portion to the report.

Project Presentation
Submission Mode: Upload link to recorded presentation on Piazza

The work of each team will be presented as a recorded talk - lasting up to 10 minutes and recorded by the team. Each team member should spend about the same amount of time speaking during the presentat.

More products