$25
Data 624 Predictive Analytics
Project #2
Summer 2021
You are given a simple data set from a beverage manufacturing company. It consists of 2,571 rows/cases of data and 33 columns / variables. Your goal is to use this data to predict PH (a column in the set). Potential for hydrogen (pH) is a measure of acidity/alkalinity, it must conform in a critical range and therefore it is important to understand its influence and predict its values. This is production data. pH is a KPI, Key Performance Indicator.
You are also given a scoring set (267 cases). All variables other than the dependent or target. You will use this data to score your model with your best predictions.