This course offers an in-depth overview of statistical methods for biological data analysis. The course consists of a series of lectures, demonstrations and computer laboratories that cover good practice in statistics and biological data analysis. Topics include general and generalized linear models, categorical data analysis, parametric and non-parametric statistics, and multivariate statistics.
The course consists of two major parts:
Part I (non-compulsory)
This part is organised together with master course BI300F during the last week of Augustas an intense training week in basic biological data analysis. Attending this part is strongly recommended if you haven’t had a similar course before, or if you want to refresh your basic data analysis skills.
Part II (compulsory)
This part covers advanced biological data analysis, and follows after part I, with 1 afternoon session per week in September-October.
Advanced Biological Data Analysis - BIO9000 (part I)
1. Introduction to R 2. Pearson correlation 3. T-test 4. Simple linear regression 5. Model diagnosis and influential observations 6. One-way between group ANOVA 7. Multiple linear regression and interaction 8. Multiway between group ANOVA 9. ANCOVA 10. Nonparametric statistics 11. Analysis of contingency tables 12. Data visualisation
Advanced Biological Data Analysis - BIO9000 (part II)
1 - Complex ANOVA designs: Repeated measures ANOVA, Complex ANOVA designs, nested ANOVA and linear mixed models, Model selection
2 - Generalized linear models: Logistic regression, Generalized linear models, Generalized linear mixed models
3 - Advanced regression techniques: Polynomial regression, Nonlinear regresssion and nonlinear mixed models
4 - Multivariate statistics: PCA and biplot, Non-metric multidimensional scaling and cluster analysis
5 - Special topics I
6 - Special topics II
7 - Special topics III
Practical Information
Part I includes a series of videos which you have to watch before each session (flipped classroom style). The sessions themselves will focus a lot on data analysis in practise using the R/Rstudio software.
Part II consists of classical lectures. On the first four Mondays of part II, we cover complex ANOVA designs, generalized linear models, advanced regression techniques and multivariate statistics. On one of the last three Mondays, you will have to teach yourself. For this you choose a topic from the list of "special topics" (see below), and you prepare and deliver a 45 min lecture on this topic, combining theory + exercises. For this you don't have to start from scratch, as you will receive a powerpoint presentation and R script on the topic. So, you have to make sure that you understand the topic and the R-script, practise your presentation, and make sure that the R script is free of bugs.
The 10 special topics to choose from are:
1. Spline based regression techniques
2. Generalized additive models
3. Survival analysis
4. Nonparametric statistics, bootstrapping and permutation tests
5. Bayesian inference
6. Discriminant analysis
7. MANOVA
8. Canonical ordination
9. Experimental design
10. Power analysis