Starting from:

$30

COMP2550-Assignment 1 Tutorial Topic Survey Paper Solved

Part 1: General Questions
 

1.  What is the branch in the survey paper you find most interesting and why?  

 

2.  Write a summary of the branch that you pick in your own words  

 

3.  What are the three papers you would read next if you were to do a research project on that branch. Please explain why you would pick these papers and give their full references.

 

4.  Find and list at least 2 research groups who conduct state-of-the-art research in this topic. Please justify your answer.  

 

5.  Name two open research problems in the field of this survey paper and explain why they are hard and interesting.  

 

 

Part 2: Paper-specific Questions  
 

1.  What are the significant steps for an end-to-end VQA model? For each step, what are the possible techniques?

 

2.  Why is the attention mechanism positively effective for VQA models?  

 

3.  Is the following statement on the attention mechanism correct? Please justify your answer.

 

 

Outputs of attention layers are individual visual and linguistic feature vectors (i.e. Vi and Vq), which cannot be directly used to generate answers. And the fusion of two feature channels is always needed to get a joint representation no matter whether the two features are attended or not. Therefore, we should regard the attention mechanism acting more like feature extraction than feature fusion in a VQA model. 

 

4.  Among all the fusion methods, which one do you think outperforms others? Please justify your answer  

 

5.  Why is information fusion important to VQA? Is information fusion essential for other visuallinguistic problems? You may explain from either the survey’s perspective or your own opinion.

 

6.  After reading through the whole survey paper, can you design a combination of technique series that may result in the best performance? You may refer to Fig.3. in Section 4 for the end-to-end framework.

More products