Starting from:

$25

COMP61332 - Coursework 2 -Social Media Analytics -Text Mining - Solved

Your second coursework for COM61332 will be based on a group project focussed on Social Media Analytics (SMA) enabled by Text Mining.  

Intended Learning Outcomes
●     to develop and employ a social media analytics (SMA) text mining pipeline on a student-chosen problem

●     to analyse the results obtained by employing the SMA pipeline on the chosen problem  

●     to explain how the SMA pipeline was applied to answer questions central to the problem, and summarise results  

●     to act as a responsible member of a team, communicate with team mates, and contribute to the team's self-organisation, planning and conflict resolution for the duration of the group work

Instructions
Given the chosen topic for your group, you will answer the research questions that you set out for that topic by developing a Social Media Analytics (SMA) pipeline.  

Before starting to write any program code, however, your first step should be to identify, together with your group mates, the text mining tasks that will help you answer your research questions. For example:

1)  What is the dominant sentiment towards Covid19 vaccines? ​[Sentiment analysis]

2)  Which Covid-19 vaccine brands/manufacturers have been most talked about recently? Do people prefer any brands? ​[Named entity recognition + Sentiment analysis]

3)  Have people been discussing side-effects of the Covid-19 vaccine? If yes, what are these? 

[Named entity recognition + named entity linking]

A Jupyter notebook with boilerplate code will be made available to the class. This includes code for the following tasks, that you can build upon:

A.    Data pre-processing

B.    Topic modelling

C.   Sentiment analysis

D.   Named entity recognition

E.    Named entity linking

As part of your group project, you are expected to adapt (i.e., develop your own enhancements to) any of the code provided to you, in order to ensure that you can address your research questions adequately. For example, your data set might need additional, custom steps for cleaning, or you might want to use different named entity recognition models.  

You also have the choice to implement any of the above NLP tasks from scratch. For example, if you found a data set relevant to your topic that is already labelled with sentiments, then you are free to train your own sentiment analysis classification model, as long as you can integrate it into the notebook.

Your final SMA pipeline should be in the form of a Jupyter notebook---similar to what you are provided with---which will allow you to document every step in your pipeline, as well as to more easily visualise results. While the notebook provided to you does not include any code for visualisation, you are encouraged to add this, as you might find the generated diagrams useful in preparing your report (see Deliverables section below). 



Template: IEEE Conference format[2]

Recommended length: 2-3 pages (in two-column format), excluding references.

SMA Pipeline 
Functionality 

The SMA pipeline is not functional. 

The SMA pipeline is functional but was not carefully designed to ensure that all of the research questions set out by the group during the proposal stage are answered. 

The SMA pipeline is functional and was designed to answer all of the research questions set out by the group during the proposal stage. 
Adaptations 

There was no attempt to incorporate any enhancements into the code provided to 
 

 
 
the class. 

Some adaptations and enhancements were added to the code provided to the class, although these seem trivial/limited (i.e., not much thought or effort was put into developing these). 

Sufficient adaptations and enhancements were added to the code provided to the class. 
Documentation 

The notebook does not contain any form of documentation (e.g., in-line comments, descriptions).  

The notebook contains some documentation but not enough to clearly explain any adaptations/enhancements incorporated into the pipeline (e.g., how these were implemented or how they improve the pipeline). 

The code contains documentation that clearly explains any 

adaptations/enhancements to the pipeline (e.g., how these were implemented or how they improve the pipeline). 
Short paper 
Academic 

writing 

The short paper looks like a technical report/user manual rather than a research paper. 

The short paper was written for an academic audience. However there are some ideas which were not clearly presented, or it seemed like the discussion lacks originality/argumentation. 

The short paper was written for an academic audience and can potentially be published in a research workshop or symposium. Ideas were presented in a clear and well-argued manner. 
Background and introduction 

The paper does not provide an introduction to the proposed topic (e.g., why it is interesting) and does not clearly present the analytical questions that the SMA project seeks to answer. 

The paper provides an introduction to the proposed topic and presents the analytical questions that the SMA project seeks to answer. However, the motivation for the choice of topic/questions is not argued well enough nor very convincing. 

The paper provides an introduction to the proposed topic and presents the analytical questions that the SMA project seeks to answer. The motivation for the choice of topic/questions is well-argued and convincing. 
Review of related work 

The paper does not provide any review of related work. 

The paper provides a review of related work although there are some shortcomings, e.g., it is not clear how these relate to the group's own work, or some of the mentioned work is outdated/more recent work could have been reviewed.  

The paper provides a good review of related work, showing awareness of recent relevant efforts. How these relate to the group's own work was clearly presented. 
Methodology  

The paper does not provide sufficient details on the group's methodology (including steps for data collection).  

The paper provides details on the group's methodology (including steps for data 
 
 
collection), although some parts need further elaboration. 

The paper provides sufficient and clear details on the group's methodology (including steps for data collection). 
References 

The references included in the paper are not sufficient to support the group's arguments. These were cited and added without following the recommended style/format. 

Sufficient references were included in the paper. However they were cited and provided in a slightly inconsistent style. 

Sufficient references were included in the paper, cited and provided in a consistent style. 
Interpretation of results (should be included as a Results or Discussion section in the paper) 
Analysis and interpretation 

Quantitative results were obtained by the group's SMA pipeline but were not analysed and interpreted in order to answer the research questions set out during the topic proposal stage. 

Quantitative results obtained by the group's SMA pipeline were analysed in order to answer the research questions set out during the proposal stage, but in some parts the interpretation seems exaggerated (or are not aligned with the results). 

Quantitative results obtained by the group's SMA pipeline were adequately interpreted, allowing the group to answer the research questions they set out during the proposal stage.  
Exemplification/ visualisation 

The group did not provide any examples nor make use of visualisation to evidence their analysis and interpretation of quantitative results. 

The group provided examples and made use of visualisation, however some of these do not support or are not aligned with their findings/interpretation. 

The group explained their findings and interpretation by providing supporting examples or making use of suitable visualisation. 
 

In the exceptional case where one or more group members have not put in an acceptable contribution, despite the team’s effort to bring them back into the team and suitable discussions of the issues arising, we implement a grievance procedure. A group can bring forward a "case of grievance" to the COMP61332 teaching staff by providing the following pieces of evidence:

●     minutes of team meetings

●     a written description of the events that led to the break-down of the team work, including a description of the actions that were taken to get the team back on track

The case should be brought forward to the teaching staff no later than one working week after the coursework deadline. The COMP61332 teaching staff will then decide whether the situation is indeed exceptional and warrants a mark re-distribution.


 
[1] https://bit.ly/2tuCUX​ y  
[2] https://www.ieee.org/conferences/publishing/templates.htm​ l  

More products