The file StudyArea.csv contains wildfire data from 1980 to 2016 for the states California, Oregon, Washington, Idaho, Montana, Wyoming, Colorado, Utah, Nevada, Arizona, and New Mexico. There are about 400k fires and for each one the dates, location, and other relevant data is included.
In this example we analyze the data from this file to know what states have had most large fires and if the number and size of the fires has been increasing over the years. Our focus is on wildfires that burned more than 1000 acres. Remove NAs and use library ggplot2.
1. Has the overall number of wildfires increased or decreased in the past few decades?
2. Plot the number of fires by State and Year to find out in which states the number of wildfires has increased the most. Use facet_wrap(∼STATE) to show for different States, the number of fires by Year.
3. Has the overall acreage burned increased over time?
4. Plot the log of the total acres burned by State and Year to find out in which states the total acres burned of wildfires has increased the most. Use aes(x=YEAR, y=log(ACRES) to plot the log of acres burned and facet_wrap(∼STATE) to see a plot for each state.
5. Is there a difference in the size of wildfires that were caused naturally as opposed to human induced? (Select Natural and Human from column Cause and group by CAUSE and year. Plot the log of total acres burned by Year with two lines, one for Human and other for Natural cause.)
6. Make a violin plot to see the distribution of the log of acres burned by state. Show a boxplot inside each violin. Use layers geom_violin()+geom_boxplot(width=0.1) to create the graph. One vertical violin+boxplot per state.
1
7. Does the average wilfire size differ by federal organization?
8. Has the length of the fire season increased over time? Make a plot with two layers of geom_line() to create lines displaying the earliest and latest start dates of fires by year. You may need to convert the start date of each fire into the day of the year (i.e. February 1st would be 32nd day in the year). Group by Year. Add a smoothed regression line to both line graphs. The length of the fire seasons increased if the gap of these lines increase.