Gene Expression Data Analysis
Analýza genové exprese
This course will present basic types of functional genetics data such as RT-qPCR data, DNA chips profiling data, and high-throughput sequencing data. Students will acquire information on how to preprocess, clean, and standardize data, and will be acquainted with specific statistical and exploratory data analysis methods used for multidimensional genomic data processing. Students will learn how to interpret data using gene ontologies, how to archive, and how to visualize data. During exercises students will practise gained knowledge on the real life data, and will master commonly used online resources and analytical tools.
Students will be able to:

  • Process RT-qPCR, DNA chips and high-throughput sequencing functional genetics data.
  • Validate experimental results using exploratory data analysis.
  • Use statistical methods to analyse and interpret functional genetics data.
  • Effectively design experiment taking into account caveats of individual experimental approaches.
R: Zvárová J, Mazura I (eds.), Metody molekulární biologie a bioinformatiky, Karolinum, Praha 2013, ISBN: 978-8024621500

A: Tevfik Dorak, M. (ed.), Real-time PCR (Advanced Methods), Taylor & Francis 2006, ISBN: 978-0415377348

A: Gentleman R, Carey V, Huber W, Irizarry R, Dudoit S (eds.), Bioinformatics and Computational Biology Solutions Using R and Bioconductor, Springer 2005, ISBN: 978-0387251462

1. Introduction. Types of functional genetics data. Aims of the anlyses.

2. RT-qPCR data preprocessing: Primer and probe design. Standard curve.

3. RT-qPCR data preprocessing: Amplification curve. Threshold cycle. Background correction. Data normalization.

4. Transcription chips data preprocessing: Noise removal. Data normalization. Relative and absolute quantification.

5. Transcription chips data preprocessing: Variance stabilization. Summarization of intensity values.

6. High-throughput sequencing: Reading and mapping.

7. Further applications: Analysis of single nucleotide polymorphisms and chromosomal abberations. DNA methylation.

8. Expolratory data analysis: Dimensionality reduction. Clustering. Control points.

9. Linear models. Problem of test multiplicity.

10. Classification methods.

11. Design of experiments and randomization. Replication.

12. Annotation and results archivation: Genomic browsers and expression databases.

13. Biological interpretation: Gene Set Enrichment Analysis (GSEA). Database of signalling pathways. Gene ontologies.

14. Integration with interaction data: Network analysis. Database of interaction data.

Biochemistry, Molecular genetics

