Posts tagged ‘base pair’

My DNA on my iPod?

If I wanted to store all the information included in my DNA, what size would it occupy? Could I copy it on my iPod and take it everywhere with me?

DNA is a double-stranded molecule. That means it is compounded of two strands, i.e. two sequences of nucleotides (which, once again, are represented with four letters A,C,G and T). Those two strands are complementary, facing each other. A natural measure of the length of DNA is known as “base pair” or “bp” . A base pair is just one nucleotide and its complement on the other strand.

The human genome contains about 3 billions baise pairs. Since there are four nucleotide, if one wants to code it in a computer format, he has to translate it in binary code: with 1’s and 0’s.

For example, let’s say:

  • A=00
  • C=01
  • G=10
  • T=11

Two bits are needed to code four letters. Each letter thus requires two bits of information. Eight bits make a byte, which is a classical measure of computer memory size. Don’t be afraid with the two lines of utterly simple math that lay below… In bytes, the size my genome requires:

3,000,000,000 bp x 2 bits / 8 = 750,000,000 bytes

A Mega byte, or Mb, is 2^20 bytes. In Mb, my genome would need:

750,000,000 / 2^20 = 715.2557 Mb

We can now answer the initial question: YES! I could store my whole genome on any iPod, even the smallest one. However, it couldn’t be stored on a classical CD-R, containing only 700Mb.

For the most tenacious among you, I should precise that here, we made the (sound) hypothesis that we wanted to record only one of the two strands of the DNA molecule. In fact, the two strands are redundant so it is useless to store both of them. But if you wanted to do so, note that the smallest model of iPod (the 1Gb Shuffle) is not able to store your genome!

Be cautious when choosing your next iPod!

June 19, 2008 at 6:02 pm 3 comments


Statonomics

  • 4,704 hits

Archonomics