Practical remote working: part 1 – hardware and environment

In this first post in this series we will look at the hardware you will need and also the physical environment that you should aim to create. Where applicable I will distinguish between advice for individual remote workers and those office workers that need to work with remote team members. Hardware So, let us start […]

A practical guide to effective remote working

Over the last few months I have seen a couple of remote working guides appear in my inbox sharing tips and best practice from across the industry to get the best out of remote work. While I found a lot of these articles interesting and useful I could not help but notice that a lot […]

SSIS frameworks and unnecessary complexity

The notification email arrives in my inbox carrying the unwelcome message that the SSIS job failed. I instantly get a sinking feeling in my stomach, this is going to be an unpleasant morning. I open Management Studio and get to work looking at the SSIS Catalog execution reports to see if I can track down […]

Creating a horizontal comparison bar chart using Tableau

I have put together a screencast with some step by step instructions on how to create a horizontal comparison bar chart in Tableau. This can be problematic when done incorrectly leading to lots of time wasted trying to align two separate bar charts on a dashboard. With this method you will create a single chart making […]

Why Source to Target Mapping documents matter

We as developers really do not like writing documentation. I am quite sure that if given a choice some developers would rather walk over broken glass than sit down and create documentation for their solutions. Sure there are some exceptions to this, but in general writing documentation is not something that we enjoy nearly as […]

Quick and dirty test of Google BigQuery’s ability to scale

As a quick weekend experiment I thought it might be a good idea to look at how BigQuery scales. In order to test this out I made use of the dataset that I already created in BigQuery for my previous blog post comparing HDInsight + Hive against BigQuery. One of the first challenges with such […]

HDInsight + Hive vs BigQuery – A Detailed Comparison

A big thank you goes to Daniel Haviv for his suggestion to use ORC with Snappy compression over Tez (with Vectorised reads) as well as the advice he provided to easily set this up. I have updated the post with the figures from this configuration.   A while ago I wrote about using Google BigQuery […]

Missed opportunity: Power Query as data source in Azure Machine Learning

If you are doing anything around Data Science and Machine Learning in the Microsoft space then I am sure that you have come across Azure Machine Learning. Azure ML is Microsoft’s push to bring machine learning to the masses, kind of the same way that Microsoft has done everything in its power to bring Business Intelligence […]

A fresh approach to delivering projects using Power BI

With the recent changes to Power BI Microsoft has unleashed the most exciting capabilities in the Microsoft BI space in a very long time. Power BI does not only have the ability to promote self-service BI, it has far greater implications for how we deliver BI projects in an agile manner. Power BI allows us to […]