SubjectsSubjects(version: 982)
Course, academic year 2026/2027
  
   
Fundamentals of Data Analysis - B323018
Title: Základy datové analýzy
Guaranteed by: Department of Food Analysis and Nutrition (323)
Faculty: Faculty of Food and Biochemical Technology
Actual: from 2026
Semester: both
Points: 4
E-Credits: 4
Examination process:
Hours per week, examination: 1/2, MC [HT]
Capacity: winter:unknown / unknown (unknown)Schedule is not published yet, this information might be misleading.
summer:unknown / unknown (unknown)Schedule is not published yet, this information might be misleading.
Min. number of students: unlimited
State of the course: taught
Language: Czech
Teaching methods: full-time
Level:  
Note: course can be enrolled in outside the study plan
enabled for web enrollment
you can enroll for the course in winter and in summer semester
Guarantor: Drábová Lucie Ing. Ph.D.
Classification: Mathematics > Probability and Statistics
Pre-requisite : B413003
Examination dates   Schedule   
Annotation -
In lectures and exercises, students will be introduced to currently used methods used to process data sets in the biological and food science disciplines. For this reason, the freely available software Rstudio, MetaboAnalyst, Python and MySQL are used for teaching. All methods will be practiced on real examples so that students will be able to use them in practice. The available selected software is suitable for both basic statistical tasks and database creation, as well as for subsequent more complex tasks such as multivariate statistical methods. Basic data processing methods are also practised in MS Excel. Emphasis is placed on the practical use of databases as a source of data for subsequent statistical processing and correct interpretation of the results obtained.
Last update: Drábová Lucie (14.04.2025)
Course completion requirements -

Attendance at lectures is encouraged but not monitored.

Individual statistical project.

Last update: Drábová Lucie (14.04.2025)
Literature -

Optional:

  • Stevens, Tim, Boucher, Wayne. Python programming for biology, bioinformatics and beyond. Cambridge: Cambridge University Press, 2015, viii, 702 s. s. ISBN 978-0-521-72009-0.
  • Jorgensen, Adam. Microsoft SQL server 2012 bible. Indianapolis: Wiley, 2012, s. ISBN 978-1-118-10687-7.

Last update: Drábová Lucie (14.04.2025)
Teaching methods -

The course consists of expert lectures with exercises where students work on solving real case studies.

Last update: Drábová Lucie (14.04.2025)
Requirements to the exam -

The prerequisite for obtaining credit is:

Completion of all assigned practical tasks in the required quality.

Preparation of an individual project and its defence.

Last update: Drábová Lucie (14.04.2025)
Syllabus -

1. Data analysis tools

2. Databases - types of databases and their use

3. Relational databases and working with them - creating a simple database in MySQL

4. SQL data analysis - SQL basics, “and” and “or” operators

5. Working with databases and basic data analysis - retrieving, cleaning and preparing data files for statistical data processing

6. Data analysis in Excel - using Data Analysis in MS Excel

7. Descriptive statistics - basic parameters of descriptive statistics, mean, variance, skewness and peak coefficient

8. Different types of analysis - parametric and non-parametric statistical tests

9. Introduction to multivariate statistical methods

10. Visualization - creating graphs, communicating results

11. Using Python and R in data analysis - introduction to Python, data types, working with data

12. Introduction to machine learning - statistics in machine learning, preparing data for a model

13. Advanced machine learning models - random forests, boosting and bagging, neural networks

14. Independent work - consultation, credit

Last update: Drábová Lucie (14.04.2025)
Learning resources -

e-learning/ Základy datové analýzy

Last update: Drábová Lucie (14.04.2025)
Learning outcomes -

Student will be able to:

Create a simple database and work with it.

Statistically process the obtained data files using MS Excel, freely available Rstudio and MetaboAnalyst software and interpret the results correctly.

Last update: Drábová Lucie (14.04.2025)
Registration requirements -

Applied Statistics

Last update: Cibulková Jana (25.02.2025)
Teaching methods
Activity Credits Hours
Obhajoba individuálního projektu 0.1 3
Účast na přednáškách 0.5 14
Příprava na přednášky, semináře, laboratoře, exkurzi nebo praxi 1.5 42
Práce na individuálním projektu 0.5 14
Účast na seminářích 1 28
4 / 4 101 / 112
Coursework assessment
Form Significance
Homework preparation 20
Defense of an individual project 80

 
VŠCHT Praha