KevCaz's Website

One of my resolutions for 2019 is to use docker to delve deeper into reproducible science. As explained in the Wikipedia’s article about docker:

Docker is a computer program that performs operating-system-level virtualization, also known as “containerization”.

Basically, instead of installing software in a way that very much likely depends on the Operating System (OS) you are using, you install docker (OS specific) and then you run command lines (not OS specific) to install the set of containers (software) you need. Hence, docker provides you with a reproducible software installation! Carl Boettiger gives a very nice example of a fully reproducible data analysis pipeline that uses docker.

The steps to install docker on debian are thoroughly described here Few months ago, I thought sudo apt-get install docker was enough, but I realized that it installs an old version of docker:

1
2
3
4
5
6
7
8
9
$ apt policy docker
docker:
 Installed: (none)
 Candidate: 1.5-2
 Version table:
    1.5-2 500
       500 http://debian.mirror.rafal.ca/debian testing/main amd64 Packages
    1.5-1+b1 500
       500 http://ftp.debian.org/debian stable/main amd64 Packages

After I completed all the steps to install the most recent stable version of docker, version 18.09 was installed.

1
2
$ docker -v
Docker version 18.09.0, build 4d60db4

Then I updated my gist to set up my debian machine accordingly.

As I use R very frequently, my next step is to read An Introduction to Rocker: Docker Containers for R Carl Boettiger and Dirk Eddelbuettel and also the nice tutorial by Colin Fay he mentions in one of his recent tweets.