Unformatted text preview:

Bioinformatics,Bi1X‐2010Part,I:,Public,databasesArbel TadmorOverview• Obtaining,sequence,data,from,the,internet• Aligning,two,sequences• Using&the&biologist’s&google:,BLASTWe’ll&use&hemoglobin&as&a&case&studyRelated,reading:,Ch.,6,of,Stryer (Biochemistry,,7thedition),First,stop:,wikiStructure,of,human,hemoglobi nα&subunit(141,aa)β&subunit(146,aa)Iron‐containing,heme group• In&adults&hemoglobin&is&a&tetramer:&α2β2• Each,subunit,contains,a,non‐protein,heme group,(that,holds,an,iron)• The,iron,binds,to,an,oxygen,(shifting,absorbance,from,blue,to,red)• Multiple,subunits,give,rise,to,cooperatively,in,oxygen,binding,and,unbinding,allowing,this,protein,to,release,more,oxygen,in,the,tissues,(making,it,a,good,transporter.,Further,reading,Ch.,7,of,StryerLet’s&focus&on&the&α1,subunit,of,hemoglobin.,This,gene,is,called,HBA1On,which,chromosome,is,this,gene?Let’s&look&up&this&gene• Go,to,the,PubMed website:,google pubmed (http://www.ncbi.nlm.nih.gov/PubMed/)• Search,for,HBA1 under,gene categoryclickWe’ll&start&with&human&hemoglobi nclickRefSeq accession,number,XX_#cattlehumanfrog:etc.More,about,the,RefSeq database&here…&http://www.ncbi.nlm.nih.gov/bookshelf/br.fcgi?book=handbook&part=ch18http://www.ncbi.nlm.nih.gov/pmc/articles/PMC539979/pdf/gki025.pdfWhich,sequence,should,we,pick?,Genomic?,mRNA?,Protein?Genomic,view,of,HBA1Genomic,context,of,HBA1clickWhere,HBA1,falls,on,chromosome,16…our,genechromosome,16transcriptomeGenomic,contextclickWhat&is&the&length&of&…• The,genomic,sequence,,(NC_...)?• The&mRNA&sequence&(NM_…)?• The&protein&sequence&(NP_…)?&Hint:&Double&click&on&gene&then&Right&click→&properties…RefSeq codes:NC_12345 6 Genomic Mixed,Complete,genomic,molecules,including,genomes,,chromosomes,,organelles,,plasmids.NM_123456 mRNA Mixed,Transcript,products;,mature,messenger,RNA,(mRNA),transcripts.,NP_123456NP_123456789 Protein Mixed,Protein,products;,primarily,full‐length,precursor,products,but,may,include,some,partial,proteins,and,mature,peptide,products.double-clickGenomic,vs.,mRNA,vs.,protein,sequences,NM_000558.3: mRNA-hemoglobin, alpha 1total range: NC_000016.9 (226,679..227,520)total length: 842 processed length: 576mRNA product length: 576mRNA join(226679..226810,226928..227132,227282..227520)/gene="HBA1"/product="hemoglobin, alpha 1"/note="Derived by automated computational analysis usinggene prediction method: BestRefseq."/transcript_id="NM_000558.3"/db_xref="GeneID:3039"/db_xref="GI:14456711"HBA1total range: NC_000016.9 (226,679..227,520)total length: 842gene 226679..227520/gene="HBA1"/note="Derived by automated computational analysis usinggene prediction method: BestRefseq."/db_xref="GeneID:3039"/db_xref="HGNC:4823"/db_xref="MIM:141800"GenomicmRNANP_000549.1: alpha 1 globintotal range: NC_000016.9 (226,716..227,410)total length: 695processed length: 429 protein product length: 142CDS join(226716..226810,226928..227132,227282..227410)/gene="HBA1"/note="Derived by automated computational analysis usinggene prediction method: BestRefseq."/codon_start=1/product="alpha 1 globin"/protein_id="NP_000549.1"/db_xref="CCDS:CCDS10399.1"/db_xref="GeneID:3039"/db_xref="GI:4504347"proteinGenomic,vs.,mRNA,vs.,protein,sequences,NM_000558.3: mRNA-hemoglobin, alpha 1total range: NC_000016.9 (226,679..227,520)total length: 842 processed length: 576mRNA product length: 576mRNA join(226679..226810,226928..227132,227282..227520)/gene="HBA1"/product="hemoglobin, alpha 1"/note="Derived by automated computational analysis usinggene prediction method: BestRefseq."/transcript_id="NM_000558.3"/db_xref="GeneID:3039"/db_xref="GI:14456711"HBA1total range: NC_000016.9 (226,679..227,520)total length: 842gene 226679..227520/gene="HBA1"/note="Derived by automated computational analysis usinggene prediction method: BestRefseq."/db_xref="GeneID:3039"/db_xref="HGNC:4823"/db_xref="MIM:141800"GenomicmRNANP_000549.1: alpha 1 globintotal range: NC_000016.9 (226,716..227,410)total length: 695processed length: 429 protein product length: 142CDS join(226716..226810,226928..227132,227282..227410)/gene="HBA1"/note="Derived by automated computational analysis usinggene prediction method: BestRefseq."/codon_start=1/product="alpha 1 globin"/protein_id="NP_000549.1"/db_xref="CCDS:CCDS10399.1"/db_xref="GeneID:3039"/db_xref="GI:4504347"proteinQuestion,1Why,is,the,protein,product length,142x3,=,426bp,shorter,than,the,protein,processed length,=,429bp?The,stop,codon was,removedGenomic,vs.,mRNA,vs.,protein,sequences,NM_000558.3: mRNA-hemoglobin, alpha 1total range: NC_000016.9 (226,679..227,520)total length: 842 processed length: 576mRNA product length: 576mRNA join(226679..226810,226928..227132,227282..227520)/gene="HBA1"/product="hemoglobin, alpha 1"/note="Derived by automated computational analysis usinggene prediction method: BestRefseq."/transcript_id="NM_000558.3"/db_xref="GeneID:3039"/db_xref="GI:14456711"HBA1total range: NC_000016.9 (226,679..227,520)total length: 842gene 226679..227520/gene="HBA1"/note="Derived by automated computational analysis usinggene prediction method: BestRefseq."/db_xref="GeneID:3039"/db_xref="HGNC:4823"/db_xref="MIM:141800"GenomicmRNANP_000549.1: alpha 1 globintotal range: NC_000016.9 (226,716..227,410)total length: 695processed length: 429 protein product length: 142CDS join(226716..226810,226928..227132,227282..227410)/gene="HBA1"/note="Derived by automated computational analysis usinggene prediction method: BestRefseq."/codon_start=1/product="alpha 1 globin"/protein_id="NP_000549.1"/db_xref="CCDS:CCDS10399.1"/db_xref="GeneID:3039"/db_xref="GI:4504347"proteinQuestion,1Why,is,the,protein,product length,142x3,=,426bp,shorter,than,the,protein,processed length,=,429bp?The,stop,codon was,removedGenomic,vs.,mRNA,vs.,protein,sequences,NM_000558.3: mRNA-hemoglobin, alpha 1total range: NC_000016.9 (226,679..227,520)total length: 842 processed length: 576mRNA product length: 576mRNA join(226679..226810,226928..227132,227282..227520)/gene="HBA1"/product="hemoglobin, alpha 1"/note="Derived by automated computational analysis usinggene prediction method: BestRefseq."/transcript_id="NM_000558.3"/db_xref="GeneID:3039"/db_xref="GI:14456711"HBA1total range: NC_000016.9 (226,679..227,520)total length: 842gene 226679..227520/gene="HBA1"/note="Derived by automated computational analysis usinggene prediction method:


View Full Document

CALTECH BI 1 - Bioinformatics

Download Bioinformatics
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Bioinformatics and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Bioinformatics 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?