UMD CMSC 423 - Homework #1 - D889654

Home> Schools> University of Maryland, College Park> Computer Science (CMSC) > CMSC 423> Homework #1

DOC PREVIEW

UMD CMSC 423 - Homework #1

School name University of Maryland, College Park

Course Cmsc 423- Bioinformatic Algorithms, Databases, and Tools

Pages 2

This preview shows page 1 out of 2 pages.

Save

View full document

Premium Document

Do you want full access? Go Premium and unlock all 2 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Premium Document

Do you want full access? Go Premium and unlock all 2 pages.

Access to all documents

Download any document

Ad free experience

Subscribe for instant access Get instant access

Unformatted text preview:

Homework 1Handed out: 9/4/08Due: 9/11/08For this homework you will need to do a bit of research online. You’ll have to look up IUB/IUPAC codes for DNA ambiguity codes as well as conversion tables between amino-acids and the corresponding codons (genetic code). Please list your bibliographic (online, library, etc.) source for this information.1. What is the reverse complement of the following DNA sequences. Note that the third sequence contains IUB/IUPAC ambiguity codes (representing sets of 2 or more nucleotides):ACAGGATGTTCATAGGCATTCCTCAGACTACAGTCACTTGCTAAGAATCTGATTCAGATTCTTAGCAAGTGGCATGTCWAGACCTAMCYGACTCVGTAGGCCATG2. What is the amino-acid sequence encoded in the following DNA sequence (assume gene starts at the first start codon and ends at first stop codon).TTCGAGGGGCATGTTTGTTGCTATGAATGATAATAAAACAATGCTTTTTATTCCGGGGGCAACCAATTAAGTAATTC(Trivia: this is a piece of one of the plague’s - Yersinia pestis - toxic factors)3. Match the following amino-acid sequence to the corresponding location in the DNA string shown below.KLFALTAVALMGGTATGAAAAAACTAAAATTGTTTGCTCTTACAGCTGTAGCCCTAATGGGTGTTTCAGGTGTA(Trivia: this is a piece of a bacterial rhodopsin gene - one of the genes involved in photosynthesis. Until 2000 it was believed that only plants were capable of photosynthesis. The discovery of bacterial rhodopsin was done computationally and is one of the advances made possible by genomic analysis).4. Define the following biological terms (look them up on the Internet and write out a one sentence definition in your own words):• Frameshift mutation• Silent mutation(continued on next page)5. Write a simple parser for FASTA files. Specifically, write a program, in your favorite programming language, that reads in a FASTA file, then identifies all records that contain more than 500 nucleotides and outputs their identifiers to the screen, one per line. Note, for this assignment you are not allowed to use any of the publicly available bioinformatics libraries, the entire code must be written using standard constructs from the programming language of your choice.Note: The code you write must compile and run on the glue machines.Deliverables: (i) Your source code; (ii) The output obtained by running your code on the file

View Full Document

UMD CMSC 423 - Homework #1

Sign up for free to view:

This document and 3 million+ documents and flashcards
High quality study guides, lecture notes, practice exams
Course Packets handpicked by editors offering a comprehensive review of your courses
Better Grades Guaranteed


School:
Email:
New Password:
Confirm Password:

This preview shows page 1 out of 2 pages.

UMD CMSC 423 - Homework #1

Sign up for free to view:

Please select your school