University of Warsaw - Central Authentication System
Strona główna

Algorithms for genomic data analysis

General data

Course ID: 1000-718ADG
Erasmus code / ISCED: 11.303 Kod klasyfikacyjny przedmiotu składa się z trzech do pięciu cyfr, przy czym trzy pierwsze oznaczają klasyfikację dziedziny wg. Listy kodów dziedzin obowiązującej w programie Socrates/Erasmus, czwarta (dotąd na ogół 0) – ewentualne uszczegółowienie informacji o dyscyplinie, piąta – stopień zaawansowania przedmiotu ustalony na podstawie roku studiów, dla którego przedmiot jest przeznaczony. / (0612) Database and network design and administration The ISCED (International Standard Classification of Education) code has been designed by UNESCO.
Course title: Algorithms for genomic data analysis
Name in Polish: Algorytmy analizy danych genomicznych
Organizational unit: Faculty of Mathematics, Informatics, and Mechanics
Course groups: (in Polish) Przedmioty z technologii w skali genomowej dla bioinformatyki
Elective courses for Computer Science
ECTS credit allocation (and other scores): 6.00 Basic information on ECTS credits allocation principles:
  • the annual hourly workload of the student’s work required to achieve the expected learning outcomes for a given stage is 1500-1800h, corresponding to 60 ECTS;
  • the student’s weekly hourly workload is 45 h;
  • 1 ECTS point corresponds to 25-30 hours of student work needed to achieve the assumed learning outcomes;
  • weekly student workload necessary to achieve the assumed learning outcomes allows to obtain 1.5 ECTS;
  • work required to pass the course, which has been assigned 3 ECTS, constitutes 10% of the semester student load.

view allocation of credits
Language: Polish
Short description:

Algorithmic problems and methods of analysis of high-throughput sequencing data and other large-scale experimental techniques of modern genomics. Topics will include the problems of mapping reads to reference genomes, reconstructing sequenced genomes from reads, classifying and quantifying reads. Methods handling data from different experiments and sequencing technologies, as well as approaches using different types of data together will be presented.

Full description:

1. Mapping of sequencing reads

◦ pattern matching algorithms, text indexing

◦ approximate pattern matching based on text indexes

◦ techniques for finding approximate occurrences of a pattern with low similarity

2. Structural variant calling

◦ based on sequencing reads

◦ based on optical mapping data

3. RNA-seq data processing

◦ read mapping vs determination of k-mer spectrum

4. Metagenomic data analysis

◦ composition- and homology-based read classification

◦ linked reads deconvolution

5. De novo genome assembly

◦ Overlap-Layout-Consensus approach

◦ de Bruijn graphs approach

◦ contig merging and scaffolding

6. Pangenomics

◦ pangenome models and their construction methods

◦ pangenome-based sequencing data analysis

Bibliography:

V. Mäkinen, D. Belazzougui, F. Cunial, A. Tomescu, Genome-Scale Algorithm Design. Cambridge University Press 2015.

X. Wang, Next-Generation Sequencing Data Analysis, CRC Press 2016.

Learning outcomes:

Knowledge:

- knowledge of algorithmic techniques used in DNA sequence analysis

- knowledge of methods of analysis of high-throughput DNA sequencing data

Skills:

- the ability to choose the proper sequencing technique for a given biological problem

- the ability to properly design experiments using large-scale genomic technologies and to analyze the output data

- the ability to implement selected algorithms for the analysis of data from next generation sequencing

Competences:

- knows the limitations of his own knowledge, is able to formulate questions to deepen the understanding of the issue under consideration

- understands the need for a critical analysis of the study he created

Assessment methods and assessment criteria:

Final assesment is based on lab projects and (optionally) oral exam.

Classes in period "Winter semester 2023/24" (past)

Time span: 2023-10-01 - 2024-01-28
Selected timetable range:
Navigate to timetable
Type of class:
Lab, 30 hours more information
Lecture, 30 hours more information
Coordinators: Norbert Dojer, Aleksander Jankowski
Group instructors: Norbert Dojer, Aleksander Jankowski
Students list: (inaccessible to you)
Examination: Examination

Classes in period "Winter semester 2024/25" (future)

Time span: 2024-10-01 - 2025-01-26
Selected timetable range:
Navigate to timetable
Type of class:
Lab, 30 hours more information
Lecture, 30 hours more information
Coordinators: Norbert Dojer, Aleksander Jankowski
Group instructors: Norbert Dojer, Aleksander Jankowski
Students list: (inaccessible to you)
Examination: Examination
Course descriptions are protected by copyright.
Copyright by University of Warsaw.
Krakowskie Przedmieście 26/28
00-927 Warszawa
tel: +48 22 55 20 000 https://uw.edu.pl/
contact accessibility statement USOSweb 7.0.3.0 (2024-03-22)