If The World Were 100 People?

Not only is the topic interesting, but the concept of breaking the global population down into 100 people is brilliant. This infographic is easily understandable, and it conveys a whole lot of information in a clean and concise manner. For more about where the data came from, see the 100 People page.

Top 5 Data Science Blogs

  1. p-value.info - This blog is only about 1 month old, but it is filled with great stuff.  I just hope Carl , a data scientist at One Kings Lane, can keep up the good posts.
  2. Metamarkets Blog - Metamarkets is a startup focusing on data analytics for business users.  The blog contains lots of data science information.  During the summer, the blog ran an excellent series with data scientist interviews.
  3. Kaggle – A great startup with a great blog.  The blog has tips about data science competitions, explanations from winners, and various other data science related posts.
  4. iCrunchData – This is a job site for data-related positions.  That said, the blog is relevant and informative.  They even do data science on job postings for data science.
  5. What’s the Big Data – A frequently updated blog with great links to big data and data science resources. I especially like the “Big Data Quotes of the Week” posts.
Bonus Blogs
  1. Flowing Data – Nathan Dau, the blog’s author, is a PhD student at UCLA.  The blog focuses on visualizations.
  2. Columbia Data Science Course Blog – This was a blog to go along with the Data Science course at Columbia University.  Unfortunately, the blog will no longer be updated since the course is over.  However, it is still worth browsing though, since it covers many of the topics in data science.  It also has some great visualizations.

Interactive Data Visualization for the Web – Free Online Textbook

Interactive Data Visualization for the Web – OFPS – O'Reilly Media is an open textbook for using D3, a javascript library, to merge the following practices.

  1. Data Visualization
  2. Interactive Design
  3. Web Development

The book is in early release and all the sections are available. You are also welcome to comment on any part of the book to help make it better.

Visually becomes a Social Network

Visual.ly recently announced the transition to a social network for data visualizations. Visual.ly aims to be the site for infographic designers. The site now has better profiles, an activity timeline, and more metrics about each infographic plus the ability to follow users. The new features are in preparation for a marketplace to be launched later this year.


Small Note: Most of the infographics I post come from Visual.ly.

The Beauty of Data Visualization – Ted Talk

In this 2010 Ted Talk, David McCandless provides a great example of transforming data into a story. Here are some of the data topics covered.

  • Billion dollar-o-gram What is money spent on?
  • Global media panic (SARS, Swine flu, asteroids, killer wasps, …), pay attention to the gap
  • What country has the most soldiers (per population)?
  • Nutritional Supplement data
  • And More…

This video is worth the 18 minutes. Enjoy!

New Data Science eBook – Free and Open-Source

Jeffrey M. Stanton, member of Syracuse University’s iSchool, just released an open-source ebook about data science. Obviously this book is intended to be used in the curriculum for the new Data Science Certificate Program. In particular, it will be used for two courses on analytics and visualization.

The book is available in the iTunes store or as a PDF. See the book website to get your copy.