After reviewing the flaws of the previous visualization of the DOT Airline performance data in part 1, I created an improved version with the same recordsets. It is a separate viz because the first version have some mistakes due to the number conversion during the csv import. I cleaned up, checked the data and used calculated fields to derive the sum of delays.
Airline Performance in the US 2015
The basic concept is still the same, the matrix on the top left controls the dashboard, initially you see all data for 2015 combined, clicking into cells drills down.
I changed the barchart to stacked bars comparing total to delayed flight in one bar for each month.
I moved the split delay reasons into a separate bar chart and added a pie chart which reveals the main reason for delays (surprisingly weather and security have the smalles share!) The 2 lists are a Top 10 style lists highlighting the airports and airlines with the most delays.
How does the visualization transport information ? Let’s look at the strong and weak points of the second iteration.
+ The key information presentation is improved. We can see the viz is about delays.
– The dashboard starts to look a bit disorganized and the viewer eyes are moving around without a centre of attention.
+ The barchart now makes sense, you can compare total flights and delays.
– The detail delay reason over time does not create too much value as the distribution of reason is quite similar.
Conclusion: Spending more time on both data and visualizations improved the overall impact, though a bit cluttered.
Going beyond sample datasets and basic visualizations I was looking for open data in my professional domain, the aviation and airport industry. Potential candidates for visualizations are connections, routes, flight plans, airport and airline performance. Performance is usually the comparison of scheduled operations vs. actual milestones. The delay of arriving or departure flights is not only affecting passengers and many parties inside and outside the airport community, but it is driving sentiments, perception and reputation and eventually costs money. This kind of data is not something operators like to release but thanks to the Freedom of Information Act (FOIA), a US Federal law, public gets access to all kind of statistics. From the US DOT (Department of Transportation) you can access and download a variety of datasets, one of them is the On-Time Arrival Performance of US airlines in the US and their delay causes since the year 2003 (link). You can filter by airline, airport and timeframe, review the summary on the DOT website or download the set as CSV for your own analysis. I downloaded the complete dataset for 2015, a 2,25 MB file with roughly 13.500 records.
Arrival Delays in Tableau
Airline Delays in the US in 2015 by DOT
It provides total arriving flights, cancelled and diverted flights, the delay count and total time by reason (weather, carrier, NAS, security, late aircraft) for each month-airport-airline combination for 14 carriers at 322 airports.
I was offline for quite a while because shifting from one continent to another. But now regular posts should be rolling in again.
I am running a couple of instances in pre-production requirement mode and changed from a standard EBS volume to a IOPS volume for the DB instance or the volume with the DB files. I could not identify a reasonable increase of performance, maybe a misconception that IOPS volumes will boost performance, rather provide a defined and consistent random access I/O throughput. I must admit I did not use a value higher than 1000.
I decided to return to a standard ESB volume for my database as its performance did not benefit from the IOPS type (the DB is not overly busy too).
You cant change type and size of an EBS volume on the fly.
After Oracle took over SUN and they dropped OpenSolaris I stopped using or even looking at it. Now a customer is keen on going the Solaris way and I have a chance to get my hands on it (again). I dont have a SUN SPARC sitting here but Oracle continues to support i386 architecture and they have both the iso file with the text installer and even a ready to run virtual appliance ready to download (link). Oracle released version 11 of Solaris on 2011 November 9. Did you know SUN’s first OS goes back to 1982 as Sun UNIX 0.7 ?
I am not sure what happen to Solaris Express 11 which disappeared completely from Oracle’s website. Will there be another Express version for “free”/development/evaluation use ?