$30
This assignment uses the data from the previous homework.
You have imaginary data on the monthly yields for Pacific fish trawling companies (fishbycatch.csv). An environmental nonprofit targeted these firms and implemented a program designed to reduce bycatch. As part of the program, the nonprofit contacted firm managers and provided information about best practices to reduce bycatch. The program was implemented in two phases. In January 2018, the nonprofit contacted half of the firms. The next year in January 2019, the nonprofit contacted the remaining firms.
You are interested in whether the program worked or not and decide to use this panel data to empirically estimate the effect of the program. You realize that you have a treatment and control group in pre- and post-treatment periods due to the program’s rollout, so you think a difference-in-differences design is a good approach. You have the following data:
Variable
Description
firm
Firm identification number
shrimp*
Pounds of shrimp in month *
salmon*
Pounds of salmon in month *
bycatch*
Pounds of bycatch in month *
firmsize
Size of fishing fleet
treated
=1 if firm received information treatment in January 2018
Table 1: Variable descriptions for homework 5.
1. You now would like to allow and control for firm-specific fixed-effects. In particular, you would like to allow for an unobserved effect ci that varies at the firm level but not over time:
bycatchi,t = ci + λt + δtreati,t + βXi,t + ui,t. (1)
(a) Generate indicator variables for each firm. Include these indicator variables in your OLS regressionto control for fixed effects directly.
(b) Perform the “within-transformation” on all of the dependent and independent variables by demeaning each variable (i.e. instead of estimating yi,t = βxi,t + ξi,t, estimate yi,t − y¯i = β(xi,t − x¯i) + ei,t).
(c) Display the results of your estimates from (a) and (b) in the same table, using the same standarderrors or confidence intervals from Homework assignments 3 and 4. Omit the estimates of the coefficients on the month and firm indicators in your table. How do the results from (b) compare to (a)? How do these estimates compare to the previous estimates of the treatment effect and how does the interpretation change? (Note for the future that standard errors from (a) are typically “wrong”, but do not worry about that for this homework. In addition, (a) is computationally costly when the panel size is large—in general you should use the within transformation to control for fixed effects.)