Monthly Archives: June 2014

Three statistics courses at the University of Southampton, UK

Highland Statistics Ltd. will provide three statistics courses at the University of Southampton, UK:

  1. Data exploration, regression, GLM & GAM with introduction to R. 23 – 27 March 2015.
  2. Introduction to Bayesian statistics and MCMC. 8 – 10 April 2015.
  3. Introduction to Linear Mixed Effects Models and GLMM with R. 13 – 17 April 2015.

Dr Martin Solan

The courses are organized by Dr. Martin Solan: Recent statistical advances provide opportunity to interrogate established understanding and accepted theory, whilst also allowing researchers to generate novel research questions that were not previously answerable using less sophisticated statistical routines. At Ocean and Earth Science, University of Southampton, we believe that developing competency in applying a portfolio of statistical tools is integral to achieving high profile and high impact science, and it is with this focus that I am delighted to host a series of statistical courses run by Highland Statistics through 2014, 2015 and beyond.

The first course is a repeat from the March 2014 course that Highland Statistics ran at the University of Southampton (there was a waiting list, so don’t wait to long registering). The course starts with data exploration following Zuur et al. (2010). This is actually one of the most downloaded papers in MEE! Quite often people think that multiple linear regression is fitting a straight line through a cloud of observations. Wrong! You can easily buy accutane online or model non-linear patterns using linear regression techniques! Once we have explained multiple linear regression (i.e. interactions, model validation, model interpretation, the philosophies of model selection), the rest of statistics is a piece of cake! GLMs and GAMs are all extensions of regression.

You are competing with a large number of scientists for a small amount of space in scientific journals. Who likes statistics? Not your readers, not the referees of your manuscript and neither your line manager/PhD supervisor. We will show you how to increase your chances to get your work published!

So you thought that one week of statistics is enough? The third course is about linear mixed effects modelling and generalized linear mixed effects models (GLMM). You need these techniques if you have multiple observations from the same animal, location, site, plant, tree, country, person, vessel, observer, you name it. Realistically speaking, this means that all you guys need mixed modelling! Before you enthusiastically sign up for this course, please read the rest of this blog!

Mixed effects models are essentially linear regression models (or GLMs) that contain a  dependency structure. So, before signing up for this course ensure you are familiar with R, data exploration, regression and GLM. The problem with mixed effects models is that the software to estimate these models can only cope with standard distributions (Normal, Poisson, binomial). But for some reason ecologists always manage to end up with highly complicated data sets and models; GLMs and GLMMs with temporal correlation, or multiple nested random effects, crossed random effects, zero inflation, spatial correlation, etc., etc. Unfortunately, standard packages in R cannot be used anymore.

So what do you do? The answer is MCMC. For years we though that Bayesian statistics and MCMC were difficult things. Priors, posterior distribution, MCMC; they sound scary. However, the concept of Bayesian statistics is actually much easier than frequentist statistics (that is the stuff we do in the first course). We therefore decided to do the mixed modelling course with MCMC.With MCMC sky is the limit. You can fit almost any model!

There are two problems with MCMC; (i) you need a fast computer and (ii) it is not taught at most undergraduate courses. As to the later problem, the second course at Southampton University provides a 3-day introduction to Bayesian statistics and MCMC. It is half and half expected that participants who do the mixed modelling course also join the Bayesian statistics and MCMC course. Otherwise you will need to obtain the knowledge with self-study (Introduction to WinBUGS for Ecologists from Marc Kery is an excellent source, though he is using WinBUGS and we will be using JAGS. But the syntax is nearly the same). As to the first problem with MCMC; you need a decent computer. Something that is less than 5 years old. Why are we using JAGS and not WinBUGS? Because my old MacBook didn’t like to run WinBUGS under Parallels (see picture below). JAGS is cross-platform, is free, and can be run from R using R2jags.


Bad idea: running WinBUGS via Parallels on a MacBook.