Results 1 to 7 of 7

Thread: counting base pairs

  1. #1

    Thread Starter
    New Member
    Join Date
    Nov 2010
    Posts
    1

    counting base pairs

    Dear all,

    how to count the base pairs of a DNA and protein sequence?
    What are the codings in VB?

  2. #2
    Super Moderator si_the_geek's Avatar
    Join Date
    Jul 2002
    Location
    Bristol, UK
    Posts
    41,974

    Re: counting base pairs

    Thread moved from the 'CodeBank VB6' forum (which is for you to post working code examples, not questions) to the 'VB6 and earlier' forum

  3. #3
    Next Of Kin baja_yu's Avatar
    Join Date
    Aug 2002
    Location
    /dev/root
    Posts
    5,989

    Re: counting base pairs

    How does a protein sequence (your data) look like?

  4. #4
    Cumbrian Milk's Avatar
    Join Date
    Jan 2007
    Location
    0xDEADBEEF
    Posts
    2,448

    Re: counting base pairs

    I'm no biologist, I've just had a quick peek at wikipedia, do you have a bunch of strings which you need to match up like "(5)ATCGATTGAGCTCTAGCG(3)" pairs with "(5)CGCTAGAGCTCAATCGAT(3)" (I think I've just twigged where the film GATTACA gets its name.)
    W o t . S i g

  5. #5

  6. #6
    PowerPoster
    Join Date
    Jul 2006
    Location
    Maldon, Essex. UK
    Posts
    6,334

    Re: counting base pairs

    OK, on the assumption that this is a serious question, and taking baja_yu's question into account, what data do you have (an example would be helpful) and what code have you already tried ?

  7. #7
    PowerPoster
    Join Date
    Jul 2006
    Location
    Maldon, Essex. UK
    Posts
    6,334

    Re: counting base pairs

    I've done some research and I suspect you've just worded your request slightly incorrectly.

    Are you looking to take a protein, work out the amino acids and RNA / DNA condons, and then decode into nucleotides and count them ?

    eg (Somewhat simplified)
    Peptide -> amino acids = Met Alo Trp Thr Ser -> mRNA = A U G G C C U G G A C U U C A
    -> DNA Condons(Antisense) = {T A C} {C G G} {T C C} {T G T} {T G T}
    -> nucleotides = T A C C G G T C C T G T T G T
    from which you can work out the Sense strand (the Compliment) giving you the Base Pairings
    T A C C G G T C C T G T T G T -> Antisense
    A T G G C C A G G A C A A C A -> Sense
    ie 15 Base Pairs in this example.

    EDIT: Further reading suggests (to me at least) that the number of DNA Base Parings will always be 3 X number of amino acids (Number of condons). RNA might be different due to the start and stop condons but I've still to get my head round that.
    Last edited by Doogle; Nov 26th, 2010 at 07:52 AM.

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •  



Click Here to Expand Forum to Full Width