Genes, Genomes and Bioinformatics
Mandatory reading
- http://www.yourgenome.org/facts/what-is-a-genome
- http://en.wikipedia.org/wiki/Bioinformatics
- http://en.wikipedia.org/wiki/Genome
- http://en.wikipedia.org/wiki/Introduction_to_genetics
- http://en.wikipedia.org/wiki/Human_genome
- journal.pcbi.1002173
- What+is+a+genome_+_+Facts+_+yourgenome
Extra Reading
- http://collections.plos.org/translational-bioinformatics
- http://en.wikipedia.org/wiki/Saccharomyces_cerevisiae
- http://en.wikipedia.org/wiki/Genome_evolution
- http://en.wikipedia.org/wiki/Escherichia_coli
- http://en.wikipedia.org/wiki/Sequencing
Slides
Videos
- Introduction https://www.youtube.com/watch?v=wTlktv9YSxY
- Molecules of life https://youtu.be/imguwfQdEog
- Proteins https://youtu.be/cpQoAbzW390
- DNA SEQUENCING https://youtu.be/zHRDFosOa-k
- GENE PREDICTIONS https://youtu.be/pTXDyyzZVHs
- GENOMES https://youtu.be/5TAx-DZFrIE
- WHAT IS BIOINFORMATICS https://youtu.be/VYg7j8e9k3U
Discussion/lecture
Recommended watching on the web
https://www.youtube.com/watch?v=_eHz6qzTCfc
1. I would like to know more about ORFs. How are they exactly identified and what other factors are taken to account to identify the regions of genome coding for proteins?
2. Why DNA strand has 6 reading frames?
I would like to know more about the difference between eukaryotic and prokaryotic DNA, especially related to splicing.
Why does the Y cromosome contain about 20 times more genes than the X
cromosome in human?
Leucine has 6 st different codons. But some other amino acids have
only 2 codons.
Does the number of codons somehow reflect their relative frequency in
the genome?
what makes protein structure prediction actually so difficult? I understand that it has to do with the huge complexity of how amino acids can interact, but humans already have mastered complex problems. What is the main problem here?
Please explain the main ideas of “are there laws of genome evolution?”: for instance, the figure 1 and what does it mean by the distribution doesn’t change even the number of genes in compared organism differ?
THX
I would like to know more about how coding and non-coding regions are differenciated in the case of completely new sequences. The start/stop codon theory is clear but what about introns or retrotransposons?
1. Can we go trough 2nd generation sequencing because it was hard to understand it from the video.
2. How can we predict interactions between proteins.
I was wondering a bit about genome evolution and codon usage bias. Also, how could they relate to eachother?
I was also wondering if you could explain a bit more about the “laws of evolutionary genomics” discussed in the article.
1. For classification of organisms is it more common to compare sequence similarity between whole genome or specific regions and what are the advantages and pitfalls?
2. The different structrures of proteins (α , β etc) are predicted to appear equally often in the total of all proteins or is some type of structure favoured? Also, does each type of structure have a “preferance” regarding which other type of structure it is surrounded?
3. How accurate are the predictions of protein structure? Are there experimental data and of what kind that could give us hints of how successful a prediction was?
4. What is taken into account in order to predict gene expression in different cell types/tissues?
I have read something about codon bias in organisms : that even though one single amino acid can be coded by a number of different codons,in biology not all the codes are used with equal frequency in the genes of a specific species!
Can (and in that case how can) the quartenary structure of a protein be determined from the sequence?
What is the relation between crystallography and bioinformatics in order to predict a protein structre?
When sequencing the genome of an organism for the first time, how are the fragments of DNA put together?
Should the fragmentation procedure be done multiple times randomly, so that overlapping regions are put together?