$25
Measuring Personality
Carney et al. (2009) use several strategies to assess personality in their investigation of the link between personality and political ideology. This week’s assignment will help you understand aspects of these methodologies and associated challenges.
The survey you took included two different short versions of the Big Five Inventory. You will have the chance to compare these two inventories and to get some experience with the Personal Living Space Cue Inventory (PLSCI) (Gosling et al. 2005), which was also used in Carney et al. (2009).
Data Details:
• File Name: Oct7ClassData.csv
• Source: These data are from the survey you took in class. You took the BFI-10, a short version of the Big Five Inventory (Rammstedt and John 2007) and the Ten-Item Personality Inventory (TIPI), which is a different instrument designed to quickly meausre the Big Five (Gosling et al. 2003). You then answered the same questions used by Carney et al. (2009) to assess political attitudes in some of their studies.
Variable Name
Variable Description
Overall
Self-reported overall ideology on a 1-5 scale with 1 being extremely liberal and 5 being extremely conservative
Social
Self-reported social ideology on a 1-5 scale with 1 being extremely liberal and 5 being extremely conservative
Economic
Self-reported economic ideology on a 1-5 scale with 1 being extremely liberal and 5 being extremely conservative
Random ID
A randomly generated respondent identifier
BFI_extraversion
The average of the two items on the BFI-10 associated with extraversion
BFI_agreeableness
The average of the two items on the BFI-10 associated with agreeableness
BFI_conscientiousness
The average of the two items on the BFI-10 associated with conscientiousness
BFI_emot_stability
The average of the two items on the BFI-10 associated with emotional stability (also known as neuroticism, although low emotional stability is the same as high neuroticism)
BFI_openness
The average of the two items on the BFI-10 associated with openness
TIPI_extraversion
The average of the two items on the TIPI-10 associated with extraversion
Variable Name
Variable Description
TIPI_agreeableness
The average of the two items on the TIPI-10 associated with agreeableness
TIPI_conscientiousness
The average of the two items on the TIPI-10 associated with conscientiousness
TIPI_emot_stability
The average of the two items on the TIPI-10 associated with emotional stability(also known as neuroticism)
TIPI_openness
The average of the two items on the TIPI-10 associated with openness
The last 20 variables in the data are the actual items from the BFI-10 and the TIPI. If you want to refresh your memory about the questions and learn which questions were meant to go together, you can find more about the BFI-10 here (scroll down to the “Is there a shorter version of the BFI available?” question) and the TIPI here.
Don’t forget to load the data.
Question 1
Check the Google Drive for your photo assignments and download the assigned photos of living space (you should be assigned to two photos). Use the worksheet on the Google Drive to do an assessment based on the PLSCI. Based on the PLSCI and your own general impressions, fill out the new class survey with your guess about each person’s openness, conscientiousness, and overall political ideology without looking at the actual data. Make sure to write down your responses to the survey since you won’t have access to them after you submit. (Page 391 of Gosling et al. 2002 might help you figure out which items on the inventory will be most helpful here.) Use the random ID to check your guesses. How did you do? (If you are finding the PLSCI difficult and frustrating, peek ahead to the next few questions.)
personality_data %>% filter(random_id == "40631")
## # A tibble: 1 x 34
## overall social economic random_id bfi_extraversion bfi_agreeablene~
## <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
## 1 2 2 2 40631 2 3
## # ... with 28 more variables: bfi_conscientiousness <dbl>,
## # bfi_emot_stability <dbl>, bfi_openness <dbl>, tipi_extraversion <dbl>,
## # tipi_agreeableness <dbl>, tipi_conscientiousness <dbl>,
## # tipi_emot_stability <dbl>, tipi_openness <dbl>, bfi10_1 <dbl>,
## # bfi10_2 <dbl>, bfi10_3 <dbl>, bfi10_4 <dbl>, bfi10_5 <dbl>, bfi10_6 <dbl>,
## # bfi10_7 <dbl>, bfi10_8 <dbl>, bfi10_9 <dbl>, bfi10_10 <dbl>, tipi_1 <dbl>,
## # tipi_2 <dbl>, tipi_3 <dbl>, tipi_4 <dbl>, tipi_5 <dbl>, tipi_6 <dbl>,
## # tipi_7 <dbl>, tipi_8 <dbl>, tipi_9 <dbl>, tipi_10 <dbl>
ANSWER: I was assigned Random ID 40631, and I was somewhat inaccurate. For example, I gave them a 2 on bfi_openness when they were really a 4. On the other hand, I guessed their bfi_conscientiousness and overall ideology scores correctly. There wasn’t much to work with in their room photo - it was average in neatness and aesthetic and all-in-all sparsely decorated. But I was able to extrapolate a bit from what I saw, which explains why I guessed ideology correctly.
Question 2
Each photo should have at least two people assigned to code it. Confer with each of the people who were also assigned to your photo and see what their ratings were on the PLSCI, as well as their guesses about openness, conscientiousness, and ideology. What does this tell you about the PLSCI? Read a bit about intercoder (aka inter-rater) reliability and reflect on its importance in research like this.
ANSWER: I was unable to find my partner - sorry.
Question 3
What challenges did you encounter administering the PLSCI? What do you notice that might complicate applying this inventory today?
ANSWER: As I stated in Question 1, there was little to work with in the photo. Additionally, the questions were outdated - nobody keeps CDs or magazines anymore, and some people don’t even have books on their shelves (though most Harvard students seem to).
Question 4
How would you redesign the PLSCI to make it more useful/current?
ANSWER: It’d be interesting to see someone’s, say, music streaming history, or even their Instagram profile. Perhaps these are already the standard modern research methods. I think I’m usually able to learn a lot about someone based off of their cultural tastes or by perusing their social media.
Question 5
Let’s see how the class compares to a large dataset of people who have taken one of these personality inventories. Gosling et al. (2003) report findings from administering the TIPI to about 1800 undergraduates.
Compare the class results to the published norms (look at the table on page 526, all ethnicities, Whole sample; i.e. the upper left corner of the table). Make a table or a plot of the class results and then write your thoughts about how these are similar or different to international norms and why that might be the case.
personality_data %>%
filter(!is.na(bfi_conscientiousness)) %>% summarize(mean_E = mean(bfi_extraversion), mean_A = mean(bfi_agreeableness), mean_C = mean(bfi_conscientiousness), mean_ES = mean(bfi_emot_stability), mean_O = mean(bfi_openness))
## # A tibble: 1 x 5
## mean_E mean_A mean_C mean_ES mean_O
## <dbl> <dbl> <dbl> <dbl> <dbl>
## 1 3.48 3.43 3.8 3.01 3.56
Question 6
The BFI-10 and the TIPI are supposed to measure the same five personality traits. To what degree do they seem to be measuring the same constructs in our class sample? Compare each BFI-10 index to its counterpart TIPI index. You can do this numerically by calculating the correlation coefficient, graphically with a plot, or both. Comment on what you find.
Question 7: Data Science Question
We are interested in whether personality is associated with political ideology. Multiple regression is one approach to simultaneously testing associations between several indepdendent variables and a single dependent variable of interest. Pick one of the personality inventories and use all of the trait indices in a regression model with at least one of the political ideology questions as the dependent variable. Interpret the results. (Note that you can use OLS for this question, even though it might not be the most appropriate model. As a bonus, you can explain why OLS might not be the best model and suggest an alternative. As another bonus, take a look back at all of the regressions you have just run. Why might we be skeptical of any individual p-value associated with one of these regression coefficients?) Non-data science students should consider tackling part of this question but only using bivariate regression (one political ideology dependent variable and one personality independent variable.)
mod <- lm(social ~ bfi_openness + tipi_openness, data = personality_data) stargazer(mod, type = "text")
##
## ===============================================
## Dependent variable:
## ---------------------------
## social
## -----------------------------------------------
## bfi_openness
-0.118
##
##
(0.090)
## tipi_openness
-0.032
##
##
(0.087)
## Constant
2.330***
##
(0.482)
##
## -----------------------------------------------
## Observations 81
## R2 0.030
## Adjusted R2 0.005
## Residual Std. Error 0.736 (df = 78)
## F Statistic 1.188 (df = 2; 78)
## ===============================================
## Note: *p<0.1; **p<0.05; ***p<0.01
personality_data %>% ggplot(aes(x = bfi_agreeableness + tipi_openness, y = social)) + geom_jitter(width = 0.05, height = 0.05) + labs(title = "No Relationship Between Self-Reported Social Ideology\n y = "Self-Reported Social Ideology Score") +
geom_smooth(method = "lm") + theme(plot.title = element_text(hjust = 0.5, face = "bold"))
and Sum of TIPI and BFI-10 Ideolo
No Relationship Between Self−Reported Social Ideology and Sum of TIPI and BFI−10 Ideology Scores?
ANSWER: A regression model shows no significant relationship between one’s self-reported social ideology score and the BFI-10 and TIPI personality indexes for openness. The coefficients for these variables is not statistically significant. A scatter (jitter) plot also shows the rather arbitrary spread of the sum of these indexes in relation to the self-reported social ideology score. This would generally be a surprising result, but our sample size was both small (n = 81) and skewed liberal compared to more representative experiments.
Question 8: Data Science Question
Is the Big Five really best characterized as five factors? If we ask ten questions on a personality inventory, we might think that these questions actually reflect only five underlying (or latent) variables. In fact, this is the supposition of the BFI-10 and the TIPI. Factor analysis is one way to examine data and investigate if the dimension of the data can be reduced from many variables to fewer underlying factors. Conduct
a factor analysis on either the BFI-10 or the TIPI questions (note, these are the numbered variables, not the named variables) You may want to use the fa.parallel function from the psych package, which you can read more about here. How many factors does your analysis suggest best explain the class data? Optional bonus: run the code several times. Does your answer change? If so, why?
personality_data_fa <- personality_data %>%
select(overall, bfi10_1:bfi10_10) fa.parallel(personality_data_fa, main = "Personality Data Factor Analysis")
## Parallel analysis suggests that the number of factors = 3 and the number of components = 1
ANSWERL I ran factor analysis on the BFI-10 questions, and parellel factor analysis suggests that there are four factors. This makes sense - researchers extrapolated the Big Five variables from similar analysis, though a small sample size may be the reason I found one fewer factor.
Question 9
If you still have your random ID, take a look at your own BFI-10 and TIPI index scores for each factor.
How do your scores compare to one another across instruments? How do your ratings on each of the Big Five factors compare to your own self image? (Remember that the BFI-10 uses a five point scale while the TIPI uses a seven point scale.)