Monday, June 23, 2014

SAS University Edition

About time SAS did this....

SAS University Edition isn't a watered-down version of our software. You get the most recent releases of the same world-class analytics software used by more than 70,000 business, government and university sites around the world, including 91 of the top 100 companies on the Fortune Global 500® list. That means you'll be using the most up-to-date statistical and quantitative methods whenever and wherever – right on your own PC, Mac or Linux workstation.

Follow the link and download yourself a copy.

From Quants to Data Scientists

A nice blog post by Yves de Montcheuil on the journay from Quant to Scientist.

Steve Miller on Data Distributions

A nice blog post on distributions using R. 
 
....the first priority with a new data set revolves on determining the distribution of values for each of the attributes. Initially, we wish to see frequencies for the responses of each variable. Those give us a general sense of the data, its distribution and its quality. For categorical attributes, we prefer to visualize frequencies sorted from most to least in an unadorned graphic; for numeric attributes that assume many different values, we like histograms – and perhaps even the more sophisticated kernel density plots – to detail the shape of the data.
 
Ought to be some kind of rule that nothing can start until histograms are done.   Count the number of times you see an average referenced during the day with no sense of it's dispersion.  Seems like data analyst malpractice 'cept we're never the ones speaking, and only find ourselves quoted.