This blog post describes the process of staging data in Google Cloud Storage and then mapping this to Google BigQuery to provide a low-cost SQL interface for Big Data analysis. There is an R package for connecting to Google Big Query, called bigrquerythat can be used to connect to Google BigQuery and interface with it … Continue reading Data Analysis with R and Google BigQuery
R
R Package Install from GitHub with DevTools
The background to this is I wanted to install an R package from GitHub as it wasn't available in CRAN. DevTools is a great utility for doing this but when I tried to follow the instructions supplied it didn't work for me. After some googling, looking at SO forums etc I ended up with a … Continue reading R Package Install from GitHub with DevTools
Interactive Web Pages for R Analytics
As part of the Coursera Data Science Specialization that I am currently working my way through, I got to explore the R-Studio Shiny web-application framework. This is a great way to publish interactive web pages that provide an interface to simple applications written in R. The Shiny environment works really well for visualising the impact of … Continue reading Interactive Web Pages for R Analytics
Connecting R Studio to Hadoop via Hive
This is a step-by-step guide to connecting an R Studio client session running on a PC to a remote Hive server running on Hadoop. Although Hive is getting a bit long in the tooth and is falling out of fashion, this is a very easy way to publish data from a Hadoop cluster to end-user … Continue reading Connecting R Studio to Hadoop via Hive
Simple Machine Learning Prediction with the UN World Happiness Data-Set
Introduction This blog-post is the output from a Hackathon session with the Central London Data-Science Meetup Group. The task was to explore the World Happiness Data-Set published by the UN. The data provided includes a set of country metrics that are expected to influence general happiness and a separate summary survey “Happiness Score” for each country. … Continue reading Simple Machine Learning Prediction with the UN World Happiness Data-Set