Introduction to R with applications to bioinformatics (2KV)

Course no.: 365.054
Lecturer: Sepp Hochreiter
Times/locations: Mon 13:45-15:15, room HS 14
Start: Mon 1.3.2010
Mode: KV, 2h, weekly
Registration: KUSSS

Lecture notes (Version Apr 30, 2010):

Files for final project:


Motivation:

This course should show how to approach and solve problems in bioinformatics and computational biology with tools supplied by R. The focus is on data analysis with machine learning methods and visualizing the results of this analysis.

R is free and an implementation of the S language which has been used by statisticians and data analysts since two decades. R is probably the most widely used software tool for bioinformatics and became popular due to its data handling (e.g. importing microarray data), statistical algorithms, machine learning / data modelling implementations and integrated data visualization. One of the largest sources of R tools for bioinformatics is the Bioconductor Project (www. bioconductor.org) which will be utilized in this course. These days R is increasing popular in machine learning even outside bioinformatics e.g. for modelling the financial market or for forecasting.

R has the advantages:

Topics of the course:

R is installed with following manuals: These manuals are sufficient for this course.