U.S. Job Market Skid Chart

September 2nd, 2008

Below is a chart that I would not recommend trying to copy for a few reasons.  First, using the little person icon instead of a bar chart distorts the data and adds to the confusion of the chart. 

Next, the sort also adds to the confusion.  The chart is sorted by change in the number of jobs from the previous year and in ascending order.  I would probably have created the chart so the sort was the percentage of people in each sector and in descending order.  Visually, this would have made the chart more organized.  Optimally, in an interactive version, there would be a sort button to toggle between the change in number and percent of jobs.

Finally, I would have dropped the ,000 and just noted that the changes are in thousands.  Better yet, I may have stuck with percentages all around by going with the percentage change in jobs from the previous year.

where_we_work

Source: BusinessWeek, September 8th Issue

Share/Save/Bookmark

Excel Blackbelt: A Review of Chapters 1-5

August 27th, 2008

Recently, I received a copy of Excel Blackbelt and blogged about my excitement surrounding the book in a previous post.  As I write this, I am roughly half way through the book and am looking forward to finishing it and writing part 2 of the review.  Below are my comments on the first five chapters of the book.

black belt image

Chapter 1 - Chapter one is basically a foundation chapter that sets the stage for the book.  The chapter does contain five excellent principals from Edward Tufte’s Beautiful Evidence book.  I found it interesting that that author makes a point at the end of the chapter to state that the book is not:

  • An advanced programming guide
  • Statistics text
  • Single source dictionary of all things Excel

He goes on to state that the book “is intended to be a guide for the sorcerer’s apprentice - for those professionals who want to demonstrate their own genius, and need only the right coaching”…etc.  Someone could read this as, it covers a little bit of everything and not too much of anything.  I guess it can be extremely difficult to write a book for an audience with a wide range of skills.

The last sentence is very fitting and something I believe exists.  “It’s time to shatter the wall between the untouchable programmer and the professional in need”.  So very true! 

Chapter 2 - Contains very basic principals and a few more advanced concepts towards the end.  Basic concepts like, formatting in Excel, how cells work, ranges, worksheets, comments, links and functions.  More advanced concepts are: conditional formatting, VLOOKUP, OFFSET and MATCH.  At this point, the author introduces practice problems at the end of the chapters.  To me, this addition to the book seems to make it into a supplemental text for academia.  I can relate to this chapter because I use the functions previously stated on a daily basis.

Chapter 3 - This chapter starts with an overview of importing a text file to Excel, which I think is extremely important because I also use this often.  The next part that covers integrating Excel with MS MapPoint somewhat annoyed me.  I have used MapPoint and am familiar with its functionality and value.  What annoyed me was that Excel doesn’t come with MapPoint as it is a stand alone product, which isn’t in the MS Office suite.  Also, there are better technologies out there today to integrate data and maps.  I have found MapPoint to have visualization flaws, which is a topic for another day. 

The section of the chapter that discusses online data sources and integrating or importing web data into Excel is valuable.  What caught me off guard was that the text goes from 0-80 instantly when discussing Uniformly Distributed Randoms.  I have many years of Statistics and could follow the heavy statistics principals, but think a lot of people may be lost.  Don’t get me wrong, I think the section is extremely valuable, but could have been transitioned better.

Speedometer

Chapter 4 - Chapter 4 starts with the basics of Excel charting and especially bar charts.  I like that the author points out some tips, like the fact that Excel can sometimes pick the wrong way to transform data into a chart.  An illustration that I think is not effective is Figure 4.10.  This Figure shows two firms’ inventory/sales by quarter for four years.  I would never show this data in a dual series column chart.  I would have chosen a line graph.  Another idea that I would disagree with is using a stacked bar chart due to the baseline being difference for the second series.  I have commented in previous posts about the ineffectiveness of stacked bars, which can be found using the search feature. 

The next section shows a stacked bar chart with a background image of money on page 73.  Unless very (very) subtle and for a good reason, I would never use a background image.  Something that I think is worthwhile to point out is that it would greatly enhance the text if the pages or at least charts/images were in color. 

Pages 76 and 77 discuss 3-D plots, which can be very tricky (ineffective) for data visualization.  Also, the author gives the reader tips on how to upload images to use in a scatter plot instead of circles, which I would question.  The section on pivot tables is pretty good and I definitely can relate to the author regarding his comments on pivot charts being limited. 

 excel_logo

Chapter 5 - Chapter 5 introduces the term Heuristics, which refers to “codified approaches to developing ideas/decisions/solutions”.  In simple terms, this chapter discusses ways to improve decision-making through coding.  This was an important chapter for me because this concept is something I practice regularly.  Some of the concepts in this chapter become very advanced for the average user.  Concepts and tools like, PCA (Principle Components Analysis), XLStat (an Excel statistical analysis add-on that runs about $500), P-Level, Z-Score, and Multidimensional Bins and Splits may not be common with many readers. 

Other concepts that are discussed are MIN/MAX/STDEV/HLOOKUP (also VLOOKUP), which are extremely useful and I use all the time.  I see this chapter as only useful for those that need to do heavy statistical analysis and not really for everyday Excel or spreadsheet users.  Good concepts in theory or in a text book, but not that practical in the corporate world.  I also understand the difficulty in being able to please everyone.

More to come! 

Click on the image below to get your copy from Amazon!

 

Note: there is no payment or quid pro quo arrangement for these comments or review.

Share/Save/Bookmark

Statistical Computing and Graphics

August 11th, 2008

In a post last week, I presented a flight patterns video from Aaron Koblin.  Hadley Wickam was kind enough to point me to the original flight pattern video, which is really impressive given that it was created 14 years ago.  Click on the image below, or here, to see the original flight pattern video.

Original Flight Paths

There are a bunch of great videos spanning the last 30 years or so in the library section of the American Statistical Association website.  Here is a link to the video section. 

If you are interested in statistical computing and a little history, including PRIM-9, check out this page/video.  The video features: John Tukey and Jerome Freedman.

A special thank you goes to Hadley for his contribution on this post.  Hadley is an Assistant Professor at Rice University with a PhD in statistics from Iowa State University.  To learn more about Hadley and his contributions to statistics and data visualization, click here.

Related:

http://supportanalytics.com/blog/2008/08/visualizing-human-systems/

Exploratory Data Analysis (Addison-Wesley Series in Behavioral Science) by John Tukey at Amazon.com

Data Analysis and Regression: A Second Course in Statistics (Addison-Wesley Series in Behavioral Science) by John Tukey at Amazon.com

Share/Save/Bookmark

Support Analytics - ProSourcing

August 6th, 2008

There is a new page on the Support Analytics website called ProSourcing.  Also, the Solutions page has now been changed to be called Consulting.  In addition, there are a few new quick links under the image on all of the pages to easily navigate to the blog or blog subscription page.

The main reason for this change is because Support Analytics now has two distinct capabilities: Analytics Consulting and Analytics ProSourcing.

ProSourcing is simply the outsourcing of data analyses to Support Analytics.  One of the recurring themes that we have experienced in the past is companies not having the resources or knowledge to perform analytics at the level it is needed.  Another situation that warrants ProSourcing is when a key knowledge expert leaves the company without training anyone or leaving adequate documentation on how analytic functions were completed.

There are some other reasons that we have decided to provide this service to our clients, which can be found on the ProSourcing page.

SupportAnalytics_Site

Check it out and feel free to provide feedback!

Share/Save/Bookmark

Visualizing Human Systems

August 4th, 2008

Earth

Source

Recently, CNET featured a video on Aaron Koblin whose name I had never heard before a few viral videos hit.  Aaron is an artist/designer/researcher focused on creating and visualizing human systems.  Many of Aaron’s visualizations are just stunning as you can see in the image above.  One of his most well-known works is the flight pattern simulation shown in the video below.  I have also included some other popular videos regarding Aaron.  Enjoy!

Flight Pattern Video

Sheep Video

Ten Thousand Cents

Radio Head - House of Cards

Check out the video on CNET by clicking on the image below.

Cnet Video

To see Aaron’s portfolio, click here.

Share/Save/Bookmark

Excel Blackbelt

July 28th, 2008

ChristmasStoryPoster

Just last week, I was reminded of the scene from The Christmas Story where the father (Darren McGavin) finally gets his "contest prize" delivered to the house.  There was so much excitement and mystery about what he had won.  Well, I shared that same type of excitement when the big brown truck delivered my copy of Excel Basics to Blackbelt: An Accelerated Guide to Decision Support Designs directly from the publisher.  This book, which reminds of a college text book, just hit the book stores on July 7th.  There are a few Excel books that I wouldn’t do without, but this book is different from the ones in my library.  This book explores the opportunities in Excel for decision support instead of Excel functions.  Throughout my career, I have created many decision support spreadsheets that helped management make more informed decisions.  I think this book will help build upon that résumé and explore new ideas or design methods.

As I get into the book, I will be sure to share my thoughts here on DSA Insights.  Some sections that caught my attention upon a quick flip through the pages were:

4.1 Value of Data Visualization

4.3 Visualizing Constraints

6.0 The Analytics of Optimization

7.0 Complex Optimization

10.0 Visualizing Complex Analytical Dynamics

Excel

 

Note: there is no payment or quid pro quo arrangement for these comments or review.

Share/Save/Bookmark

Donut Charts!

July 21st, 2008

Doughnut

Instead of calling them pie charts, we’ll just call them donut (doughnut) charts and make them magically delicious.  Wrong.  These two charts, which appeared in the July 14th issue of BusinessWeek, are some of the worst charts I’ve seen in a while.  Note: in the print version, the legend doesn’t cover up the actual chart.  I’m not sure why this wasn’t fixed in the online graphic. 

There have been many articles and blog posts written on the ineffectiveness of pie charts, so I won’t belabor the point more.  For more information about why pie charts (donut charts with the center filled in) are not a good data visualization option, see the links below.  There’s no point in saying the same thing in a different way.  The timing of this post is good being that Seth Godin recently created a lot of stir about bar charts, which led to more pie chart discussions.

Related books:

Creating More Effective Graphs by Naomi B. Robbins

Show Me the Numbers: Designing Tables and Graphs to Enlighten by Stephen Few

Related links on discussion of Bar and Pie charts:

DSA Insights

Jon Peltier

Peltier Technical Services

Jorge Camoes - Charts

Juice Analytics Original

Juice Analytics

Stephen Few

Edward Tufte

Junk Charts

Note: the first pie chart is credited to William Playfair roughly 200 years ago.

Share/Save/Bookmark

DSA Insights Adopts WordPress 2.6

July 19th, 2008

In the world of technology and software, since when does a release happen ahead of schedule?  Not often.  Well, WordPress has version 2.6 available for download about a month ahead of schedule.  The upgrade was pretty painless except for one minor issue.  For some reason, the maintenance mode plug-in needed to be deleted to run the upgrade from my browser.

This video describes the new features for WordPress 2.6.  One of the best new features is the image caption feature to help with citing sources or describing the image.

As with the last upgrade back in May, please let me know (using this form) if you find any bugs with this blog.  Who knows, there may just be a prize involved!

Related:

Share/Save/Bookmark

Possibly $100,000 left on the table

July 16th, 2008

Occasionally, here at Support Analytics, we like to share an experience with you in an effort to pass on some valuable [hopefully] insights.  This is a real story that actually happened a few weeks ago. 

207532495_c0df6242f1

I answered the phone to find a sales person on the other end asking to speak with the head of household, which was my first clue there was going to be a sales pitch coming.  I’m not sure what kept me on the phone because the only thing the caller usually hears at that point is the click of me hanging up.  Reluctantly, I said this was the head of the household (my wife may disagree, but that’s for another post).

After that response, it all became a blur.  The woman talked so fast and word-for-word from a script about how now was the best time to get replacement windows and their company was the best … blah, blah, blah.  I tried to get a quick, “not interested” reply in, but she was so fast in talking, I couldn’t get a word in.  I really wasn’t interested because I just purchased new windows six months ago.  Finally, I tried to interrupt with a, “Miss”, but she wasn’t even fazed.  She kept up at a blinding speed apparently in the “zone”.  Getting annoyed, I said again and louder, “MISS”, which again went unanswered.  Finally, I was really annoyed and just said “STOP” in the phone.  She seemed to be taken back a bit and stopped her tirade.  I informed her that I just bought new windows and was not interested.  Click!

Here is where the lesson comes into play.

If this woman had been nice, spoke in conversation or just a polite manner, I may have shared with her that everyone on our street is probably in the market for new windows because of all the houses are 20 years old with poor initial windows installed.  Also, I may have given her the names of probably five neighbors that I knew were interested in windows.  At almost $20k each, I guess that one call was a missed $100k opportunity for that sales person. 

If she merely asked nicely if I was interested and spoke in a civil manner, I may have shared this information with her.  Those sales probably would have made her year’s quota or at least a nice bonus. 

My guess is that she gets paid by the number of calls she makes or the amount of time she keeps someone on the line, which is a BIG mistake.  Compensation should be based on value and not an arbitrary number that may promote bad behaviors such as this one.  If all else fails, just be nice because it can go a long way.

Share/Save/Bookmark

Hans Rosling Video with Statistical Data Visualizations!

July 11th, 2008

This video is a little long, but Hans does an excellent job presenting statistical data.  It’s worth the time to watch, trust me.

Share/Save/Bookmark