In this part, we fix existing columns and add new ones that will be useful later on: We also convert flight_number from being integers to being character values with the astype() method, by noting that these are IDs are have no ordered meaning.We choose to filter out flights that have more than 1 day delay. Looking at the mean and median of departure_delay, we see that values are heavily right-skewed, and we have a maximum delay of 1988 (~ 33 hours).We keep flights departing from airports that we want to look at with the function.We remove rows with missing values with the dropna() method.# Compute statistics of columns flights_df_raw.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |