Here we will do a match to identify the names of these airports using the inner_join function in dplyr. One of the other data sets included in the pnwflights14 package is airports that lists the names. This information is helpful but you may not necessarily know to which airport each of these FAA airport codes refers. Summarize(mean_arr_delay = mean(arr_delay, na.rm = TRUE)) %>% We’ll use the top_n function to isolate the 5 worst mean arrival delays. To address the first question, we will use the dplyr package written by Hadley Wickham as below. How many flights departed for each airline from each of the airports? How does the maximum departure delay vary by month for each of the two airports?ģ. Which destinations had the worst arrival delays (on average) from the two PNW airports?Ģ. The questions I will analyze by creating tables areġ. Here I will delve further into some of the questions I addressed in two recent workshops I led in the Fall 2015 Data Reed Research Skills Workshop Series. The dataset provides for the development of a lot of interesting questions. # Install Chester's pnwflights14 package (if not already)ĭevtools::install_github("ismayc/pnwflights14")ĭata("flights", package = "pnwflights14") # If there are any packages in the list that aren't installed, # names of the packages not installed to the variable new.pkg # Check if packages are not installed and assign the # List of packages required for this analysis
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |