Jul 11, 2024
Airport Codes Dataset: Contains airport IDs, city, state, and airport name
Flight On-Time Performance Dataset: Includes details such as year, month, and day of flights, carrier, origin, and destination airport IDs, scheduled departure/arrival times, and actual departure/arrival delays
departure_delay, arrival_delay, departure_d15, arrival_d15, canceled, divertedarrival_d15: Binary target variable indicating if a flight is delayed by more than 15 minutesEdit Metadata for Column Naming:
city, state, and name to origin_city, origin_state, and origin_airport for the origin airport IDsJoin Datasets:
Flight On-Time Performance with Airport Codes on origin_airport_iddestination_airport_idorigin_airport_id, destination_airport_id, canceled, and divertedClean Missing Data component to impute missing values
departure_delay and arrival_delay)arrival_d15Two-Class Boosted Decision Treemax leaves, min samples per leaf, learning rate, number of trees)Tune Model Hyperparameters (select metric: F-score for imbalance data)Random Grid search for efficiencyTrain Model with the best hyperparameters obtained