Professional Documents
Culture Documents
Bioinformatics
Bioinformatics
An overview
Soumitra Nath
mail: nath.soumitra1@gmail.com
What is Bioinformatics?
“The field of science in which biology,
Biological Computer
Data + Calculations
OUR Objectives
21ST centaury
1960
1965
First protein structure
1970
1975
1980
1985
1990
Yeast genome
2000
First human genome draf
The Human Genome Project
Initiated in 1986 Completed in 2003
CHIMP GENOME
Chimpanzees are similar to humans in so many
ways: they are socially complex, sensitive and
communicative, and yet indisputably on the animal
side of the man/beast divide. Scientists have now
sequenced the genetic code of our closest living
relative, showing the striking concordances and
divergences between the two species, and perhaps
holding up a mirror to our own humanity.
How humans
are chimps?
Functional sites
Annotation
Structure, function
CCTGACAAATTCGACGTGCGGCATTGCATGCAGACGTGCATG
CGTGCAAATAATCAATGTGGACTTTTCTGCGATTATGGAAGAA
CTTTGTTACGCGTTTTTGTCATGGCTTTGGTCCCGCTTTGTTC
AGAATGCTTTTAATAAGCGGGGTTACCGGTTTGGTTAGCGAGA
AGAGCCAGTAAAAGACGCAGTGACGGAGATGTCTGATG CAA
TAT GGA CAA TTG GTT TCT TCT CTG AAT ......
.............. TGAAAAACGTA
promoter TF binding site
CAAATTCGACGTGCGGCATTGCATGCAGACGTGCATG
Transcription
AAATAATCAATGTGGACTTTTCTGCGATTATGGAAGAA
Start Site
TTACGCGTTTTTGTCATGGCTTTGGTCCCGCTTTGTTC
GCTTTTAATAAGCGGGGTTACCGGTTTGGTTAGCGAGA
CAGTAAAAGACGCAGTGACGGAGATGTCTGATG CAA
GA CAA TTG GTT TCT TCT CTG AAT ..............................
......... TGAAAAACGTA
• 2-d chromatography
– First DNA Sequencing
– Obtained by acedemic researchers –
1970’s
–
• Chemical Procedure for Sequencing
– Developed by Alan Maxam and Walter
• Enzymatic Procedure
• Developed by Fredrick Sanger (1977)
• Pyro-Sequencing
– Currently the method of choice for most researche
indel
• Key aspect of sequence
Sequence U
comparison is
sequence alignment
•mismatch
• A sequence alignment
maximizes the number
of positions that are in
Sequence V match agreement in two
sequences
Copyright Conserved
2004 limsoonsites
wong
Phylogeny: An Example
• By looking at extent of conserved positions in the
multiple seq alignment of different groups of
seqs, can infer when they last shared an
ancestor
⇒Construct “family tree” or phylogeny
To the patient:
Better drug, better treatment
To the pharma:
Save time, save cost, make more $
To the scientist:
Better science