At the end of this course you can:
- implement an integrative computational analysis to test a biological hypothesis using high-throughput (epi)genomic and proteomic data;
- write a simple Python program to read and process big, text-based data files;
- design and implement simple algorithms to gain biological insights from data;
- use Python in combination with command line tools to perform critical exploratory analysis of high-dimensional data;
- visualize complex biological data and analysis results at the genomic and network level to understand biological systems.
|
|
Technological advances in the fields of genomics and proteomics have accelerated the ease and speed of data collection. High-throughput instruments, such as DNA sequencers and mass spectrometers, generate large amounts of biological measurements. This has brought the goal of understanding gene regulation within a living cell at the systems levels much closer. However, to integrate and analyze these various big data sets, a quantitative approach to biology is needed.
In this course you will learn to apply the Python programming language in combination with the Pandas data analysis framework to analyze (epi)genomic and proteomic data. The course will deal with a complete view of data analysis, from reading and processing raw data files to interpretation and visualization within the biological context.
|
|