SubjectsSubjects(version: 978)
Course, academic year 2025/2026
  
Bioinformatic Data Processing - M320079
Title: Bioinformatické zpracování dat
Guaranteed by: Department of Biochemistry and Microbiology (320)
Faculty: Faculty of Food and Biochemical Technology
Actual: from 2025
Semester: winter
Points: winter s.:4
E-Credits: winter s.:4
Examination process: winter s.:
Hours per week, examination: winter s.:1/2, C+Ex [HT]
Capacity: unlimited / unlimited (unknown)
Min. number of students: unlimited
Qualifications: Expertise, Interdisciplinarity, Tools and instruments, Independence, Communication and presentation skills
State of the course: taught
Language: Czech
Teaching methods: full-time
Level:  
Note: course can be enrolled in outside the study plan
enabled for web enrollment
Guarantor: Strejček Michal Ing. Ph.D.
Examination dates   Schedule   
Annotation -
The course aims to introduce students to the fundamentals of processing small to medium-scale experimental data using automated tools. Emphasis is placed on cleaning raw data, transforming it, visualizing it, and exporting the results. Students will primarily work in the R language, with extensive use of the Tidyverse ecosystem. They will also learn the basics of using the shell (bash) and GitHub for version control and code sharing.
Last update: Strejček Michal (28.05.2025)
Course completion requirements -

Attendance at lectures is recommended but not required.

Last update: Strejček Michal (28.05.2025)
Literature -

Recommended:

  • Wickham, Hadley, Çetinkaya-Rundel, Mine, Grolemund, Garrett. R for data science, import, tidy, transform, visualize, and model data. : , , xxiii, 548 stran s. ISBN 978-1-4920-9740-2.
  • Wickham, Hadley. Advanced R. : , , s. ISBN 978-1-4665-8696-3.

Last update: Strejček Michal (28.05.2025)
Requirements to the exam -

The course is completed by submitting an individual project.

Last update: Strejček Michal (28.05.2025)
Syllabus -

1. Introduction to working with data

2–3. RStudio IDE, basics of R

4–6. Tidyverse – data transformation

7–9. Tidyverse – data visualization

10. Working with strings (Stringr, regular expressions)

11–12. Basics of Linux system administration

13. Remote access and installation of software tools (Anaconda, Pip, GitHub)

14. Project consultation

Last update: Strejček Michal (28.05.2025)
Learning outcomes -

Students will acquire foundational skills in working within a Linux environment and in data processing and visualization in R, with emphasis on the concept of “tidy data” and the Tidyverse ecosystem.

Last update: Strejček Michal (28.05.2025)
 
VŠCHT Praha