Introduction to Regulatory Genomics and Epigenomics

Lecture 1: Introduction to Regulatory Genomics and Epigenomics I: Introduction to the Biology of Gene Regulation 
Lecture 2: Introduction to Regulatory Genomics and Epigenomics II: Motif Finding
Lecture 3: Introduction to Regulatory Genomics and Epigenomics III: Hidden Markov Models in Gene Regulation 


This series of talks is part of the Algorithmic Challenges in Genomics Boot Camp


Speaker: Nadav Ahituv (UCSF), Uwe Ohler (MDC Berlin) & Yaron Orenstein (MIT)

The analysis and understanding of non-coding regulatory regions in the genome is a central area of computational biology. Nucleotide changes in gene regulatory elements can lead to various phenotypes, including morphological differences between species and human disease. In the first lecture, we will describe the different types of gene regulatory elements (promoters, enhancers, silencers, insulators), techniques to identify them and technologies that are used to functionally characterize them and better understand their regulatory code. The remaining lectures will focus on computational problems. With multiple biological mechanisms, a large search space and unknown underlying rules, algorithmic challenges arise at multiple points. For instance, a fundamental problem is the identification of non-coding regions that control the regulation of specific genes. 
 
Another classic problem is the identification of over-represented sequence motifs in such a set of DNA or RNA regulatory regions, which may indicate target sites of a specific regulatory protein. Popular algorithms have roots both in (unsupervised) machine learning (graphical models, expectation-maximization) as well as pattern matching and discrete search, and we will discuss the state of this research area in their context.