$25
1.1 PROBABILITY
1.1.1 PROBABILITY DISTRIBUTION TABLE
You have been studying hard on your latest assignment and asked your fellow students for some tips:
• Tip 1 “Study hard and you will do well, fail to do so and you will not”
• Tip 2 “Get plenty of rest and you will do well, fail to do so and you will not”
• Tip 3 “Set an an alarm and you will get up in time, fail to do so and you will not”.
You ask your fellow classmates which tips they adhere to and how frequently. After recording their responses you derive the following table:
Create a probability distribution table using appropriate variable names.
1.1.2 BAYESIAN NETWORKS
You have been given the following Random Variables:
• Fossil Fuels
• AI
• GlobalWarming
• Renewable Energy
• Traffic
• Employed
Construct a model of the world using these variables (Note: this is subjective but there should be a rationale behind your world view and your thought process articulated). Implement a Bayesian Network.
• Provide a visual depiction of the network • Detail the associated Conditional Probability Tables.
• Demonstrate querying the network.
Write a clear and concise description regarding Bayesian Networks, the utility of such and the associated pro’s and cons. Support this with examples where possible.
1.2 LEARNING: DEVELOPING AND EVALUATING A MODEL
“Naive Bayes is a simple technique for constructing classifiers: models that assign class labels to problem instances, represented as vectors of feature values, where the class labels are drawn from some finite set. There is not a single algorithm for training such classifiers, but a family of algorithms based on a common principle: all naive Bayes classifiers assume that the value of a particular feature is independent of the value of any other feature, given the class variable.” You will be implementing a Naive Bayes classifier and discussing the theory behind its operation from the perspective of probability, conditional independence, and Bayes theorem.
1.2.1 DATA
Choose an appropriate multivariate datasets with a limited number of classes from: https:
//archive.ics.uci.edu/ml/index.php Take a subset of data from the dataset:
• Compute the “Prior” probabilities for each of the classes.
• Compute the probability of evidence.
• Compute the probability of likelihood of evidences (numerator).
Articulate your understanding of each as applied to Bayesian networks. Tip: Demonstrate the suitability of a dataset or highlight why it is not suitable. Use appropriate visualisations to aid your rationale.
Write a clear and concise description regarding the dataset and its appropriateness for a Naive Bayes Classifier and probabilistic inference in general.
1.2.2 NAIVE BAYES LEARNER
Based onyour studying and reviewing of “learning.ipynb” implement the Naive Bayes learner and evaluate its performance for the selected dataset. Choose appropriate methodologies and/or variants in its utilisation. Present appropriate visualisations demonstrating its performance. Write a clear and concise description of the Naive Bayes Classifier considering the headings outlined: probability, conditional independence, and Bayes theorem
2 TIPS
• Review “probability.ipynb”, “learning.ipynb” and ‘learning_apps.ipynb”.
• TipforallParts: Do not provide masses of text lifted from the text/online or wikipedia. Be your own editor, focus on what is important and articulate that in a rationale way that is technically accurate. Demonstrating a capacity to explain complex topics in a clear and concise manner is indicative of a higher level of understanding and will be rewarded with an appropriate grade. Use results derived from experimental runs to back your claims and make sure your narrative is coherent and consistent.