Data Analysis with R and Google BigQuery

This blog post describes the process of staging data in Google Cloud Storage and then mapping this to Google BigQuery to provide a low-cost SQL interface for Big Data analysis.  There is an R package for connecting to Google Big Query, called bigrquerythat can be used to connect to Google BigQuery and interface with it … Continue reading Data Analysis with R and Google BigQuery

R Package Install from GitHub with DevTools

The background to this is I wanted to install an R package from GitHub as it wasn't available in CRAN.  DevTools is a great utility for doing this but when I tried to follow the instructions supplied it didn't work for me.  After some googling, looking at SO forums etc I ended up with a … Continue reading R Package Install from GitHub with DevTools

Interactive Web Pages for R Analytics

As part of the Coursera Data Science Specialization that I am currently working my way through, I got to explore the R-Studio Shiny web-application framework. This is a great way to publish interactive web pages that provide an interface to simple applications written in R.  The Shiny environment works really well for visualising the impact of … Continue reading Interactive Web Pages for R Analytics

Simple Machine Learning Prediction with the UN World Happiness Data-Set

Introduction This blog-post is the output from a Hackathon session with the Central London Data-Science Meetup Group. The task was to explore the World Happiness Data-Set published by the UN. The data provided includes a set of country metrics that are expected to influence general happiness and a separate summary survey “Happiness Score” for each country. … Continue reading Simple Machine Learning Prediction with the UN World Happiness Data-Set