Plot Your Data

Originally posted on Data Column: The Collaborative Student Blog for the Institute for Advance Analytics

Visual data exploration or how filters will change your life

 

Plotting your data is a necessary first step with any large data set driven project whether that is forecasting, predictive modeling or just providing summary statistical insights.

There are many ways to plot in as many software packages as you can imagine.  I’ve enjoyed Tableau for a few reasons.

 

1- For larger data sets being able to summarize millions of rows into an interactive picture is a plus

 

2- Especially useful ability to connect directly to the SAS data sets

 

3- Filters.  I love filters to subcategorize your data.  If you are used to SAS for exploring your data just think of Tableau filters as dynamic SAS “where” data step statements.

I use Tableau to connect to my data and then employ the filters to dynamically pinpoint missing and miss keyed values.  The filters allow me to exclude these values from the visualization without altering the data set itself.

 

Once I’ve found some interesting relationships I can select the most useful filter variables and their values as a guide to traditional SAS programming and SQL queries.

Lastly I appreciate the ability to output the data used to create any visualization as well as see and export the full underlying data.

Showing is better than telling right?  Up next an example of visualization built primarily for exploration. . .

 

Visualize Whirled Peas

 

Let’s say for the sake of argument you don’t have any finance or budgeting background.

 

Let’s also pretend you’re given a data of all the General Government state budget line items for the past 13 years for North Carolina, ~ 3 million rows of transactional data.

 

Finally, let’s pretend your team needs to present to representatives from the Office of State Budget Management.

 

How are you going to understand the data you are given with very little domain knowledge well enough to present it to subject matter experts?

 

My answer is plot it and explore it with filters.

Reversion Exploritory Dashboard

 

The visual above was created in Tableau but you could visualize using other programs.

 

I experimented with different fields for both the X and Y initially while referring often to the data dictionary, but instead of writing queries in lines of code for new views it was easy to change the view with a drag and drop.

 

In the above I needed actual spending in relation to authorized to see when departments went over or under budget.  I used the filters to get to the correct actual and authorized fields as well as the correct fund and account category, but I only knew what to look for after inspecting many options and seeing all aspects of the data set. The hierarchy of filters let me select down to the individual account code.

 

The exploratory sheet I used is here

 

Feel free to play with it and create some of the views that just don’t make any sense.   Why was the point and click visual approach better than coding a number of visuals to see data relationships?   To me it felt more like exploring an unfamiliar physical object.  I found it easier to pull variables in turn and see them here rather than coding one variable or set of variables at a time in a static output.   Bottom line The dynamic nature was faster for me insofar as the insights I could glean.   The exploratory sheet was the basis for a suite of dashboards created using the same filter based data exploration.  The process was:

    • explore a set of variables creating a view

 

 

    • combine those views to answer questions and provide insight into the data

 

Ultimately we wanted to allow the state budget office to dynamically explore their data in ways they might not have thought of before.   See Dashboard here

 

There are a number of tips and techniques I learned along the way creating this suite of dashboards which I’ll summarize in a future post.

 

Important note:  The OSBM data set report and presentation was a team effort and while much of the data exploration I discuss here was my own work it is due in no small part to hard work of the entire team.  The above is posted here with their kind permission.  Go Team Blue 3!

 

 

Elevator pitches that just won’t work for IAA Employer Information Sessions

 

I sometimes feel awkward mingling at networking events and the thought of pitching myself in 20 seconds with something memorable gives me the heebie jeebies. Here are a few pitches or memorable phrases that I know won’t work, but may be useful to excercise my mental block demons. Hopefully getting these out of the way I can come up with something that does work

 

Hello, (pause) Evan Miracle (Shake hand, maintain eye contact),

 

  1. I’m an avid kickboxer and Octagon of Doom four time champion.
  2. I’m a childhood survivor of a concerted campaign of wedgies.
  3. My body is made of nearly 50% aftermarket computers parts from Radio Shack.
  4. I have three children so these dark circles are all natural with no zombie makeup required.
  5. I once ate 11 hamburgers and 2 large fries as well as a large vanilla shake in one sitting for a bet.
  6. I am a new convert and Tableau zealot (best used during mingling opportunities at SAS).
  7. Scream “Constant Vigilance!” Best used when their back is turned.
  8. I wandered in here from ABB on the third floor.  I’m just here for the free food.
  9. What’s your sign? You’ve got a very interesting and dynamic aura.
  10. I’d like to recite for you all of the prologue to Chaucer’s Canterbury Tales, “Whan that Aprille with his shoures soote …”
  11. Do you ever feel like squirrels are watching you from the trees and oh so silently judging you? Me too! (Note do not wait for answer before saying “Me too”)

 

It was great to meet you and thank you for coming to talk with us today.

 

Ask question then …

 

I would love to follow up with you by email as I have a few more questions. Can I give you my card?