Bioinformatics Education and Tutorials

image

Introduction

When we understand genetic sequences DNA, RNA and protein, plus how they relate to each other, how DNA acts as an information database on how to build all living things, we can start to ask deeper questions about a heritage, infections, allergy, diseases in general, genetic mutations, relationship of all species, how to increase food crop yield, how to design personalised medicine, tailor make genes and so on. The list is long.

The tutorials emphasize the underlying fundamental principles and ideas instead of first focusing on many details. The understanding of the concepts is the key to the learning process. One can always look up the details anytime, besides we would not remember all of them either.

Each tutorial is a self-contained entity. Because bioinformatics is a cross-disciplinary field, we have included pointers to relevant Biology and Biomedicine sections, e.g., such as immunology, microbiology, virology, genetics, infectious diseases and population biology.

Furthermore, each tutorial includes a list of prerequisites with links to those sections. Last but not least, many tutorials contain a wealth of images, animations or videos as required to enhance the learning experience and make it enjoyable.

The best part of it is that you can use the material for free for any non-commercial purpose. Go ahead and explore!

Don’t forget to study the history also! Why? Learn. To both study and value, the steps pioneers daringly took to where no one has gone before, to open new doors. Doors to brand new places for others to explore. To change the world for generations to come.

We label each tutorial as Starter S , Basic B , Intermediate I or Advanced A level.
image
History S

History from the 19th century to the 21st century. From Mendel and Miescher to the birth of bioinformatics and the completion of the human genome. The bold steps taken by great scientists to the unknown.

image
Introduction to sequence comparison S

A starter level primer introduces the ideas of a match, mismatch, gap, insertion, deletion, indel, global and local alignments.

image
Pair-wise sequence alignment B

A basic level tutorial, introducing DNA and protein sequence alignments, substitution matrices and discusses the bases of sequence similarity.

image
Pair-wise sequence alignment methods I

An intermediate level tutorial, introducing pair-wise global (Needleman-Wunch) and local (Smith-Waterman) sequence alignment methods without advanced mathematics and discusses the main implementation aspects.

image
R Tutorial Series by Dr. Khang A

Part one: Whole-genome viral phylogeny estimation without sequence alignment. Complete code and input data downloadable.

image
Construction of substitution matrices I

An intermediate level tutorial of BLOSUM and PAM substitution matrices describe their detailed, step by step construction.

image
DNA Sequence Alignment How to I

Detailed step-by-step instructions on how to construct DNA scoring matrices and how to optimize them to target different sequence similarity levels. After this tutorial, you can design your DNA scoring matrices.

image
How to select the right substitution matrix? I

A comprehensive tutorial explains the effects of selection of scoring matrices for pair-wise sequence alignments and database searches. This tutorial also explores the impact of a 'wrong' scoring matrix selection and how all scoring schemes have explicit or implicit optimal target similarity.

image
Homology, analogy, similarity I

This tutorial is an intermediate level tutorial on homology. We cover homology classes, hierarchies, analogous processes, and the concept of deep homology, among others.

image
Introduction to Information Theory and Its Applications to DNA and Protein Sequence Alignments B

A basic level tutorial describes what information and entropy is. Detailed step-by-step instructions on how to calculate information and entropy content for DNA and protein sequences. A basic introduction to sequence logos and their relation to information contained in multiple sequence alignments.

image
Multiple sequence alignment (MSA) Methods I

Currently working on this tutorial. I am planning to complete this by the end of the next week.
Dr M

image
The Principles of WGS Sequencing and Automated Fragment Assembly I

Although I wrote this a long time ago, the article provides a fascinating insight into the era when the International Human Genome Sequencing Consortium and Celera Genomics reported the drafts of the human genome project and later completed by April 2003.

image
Introduction to sequence assembly S

A starter-level tutorial. It introduces the essential principles of hows and whys of the sequence assembly.

image
Genome assembly Quality Metrics I

A sequence assembly can go wrong in many ways. Thus, we must assess both the correctness and completeness of an assembly. We go through the most common assembly quality metrics. N50, k-mers, and BUSCO.

image
Sequence Assembly Practical I

In this practical, we learn the intricacies of assembling a ~5 Mb haploid genome using long read PacBio sequencing technology.

image
Genome Sequencing S

Genome is the blueprint of life.
Coming soon...

image
Genome assembly II I

Coming soon...

image
Genome assembly III A

Coming soon...

image
Phylogeny I B

Coming soon...

image
Critical thinking I

Coming soon...

image
The scientific method B

Coming soon...
"The creative process moves through five stages. It begins with preparation–an analytical time when the basic information or skills are assembled. It continues on to incubation–a more intuitive and subconscious time in which you connect the dots in a default state. If you stick with it through perspiration, this process will eventually lead to revelation–the eureka experience when you literally feel the tumblers of your mind click into place and you say: ‘A-ha, I have found the solution!’ The creative process ends with production, a time when the insights are put into a useful form and shared with others." View Source