Human Genome Salient Features and Its Applications
Genetic constitution of an individual lies in the sequence of DNA nucleotides. The differences in two individuals will naturally be reflected in the differences of their nucleotide sequences. They can be known only if the entire human genome is mapped. As the young science of genetic engineering developed, the technique to isolate and clone DNA fragments evolved. A fast technique of determining DNA sequence was also found out. In 1990, an ambitious project of mapping human genome was launched.Human genome project or HGP was a project of sequencing the entire human genome and find out its characteristic features. It was a mega project with an estimated cost of 9 billion dollars (3 dollars for a base pair with 3 x 109 base pairs). The data required its storage which would entail conventional entry in some 3300 books each with 1000 pages and each page having 1000 typed letters. However, high speed computational devices for storage, retrieval and analysis of data made it easier to do the same.The science of bioinformatics also developed and helped HGP. The project was estimated to be completed in thirteen years. It was coordinated by U.S. Department of Energy and National Institutes of Health. Soon Wollcome Trust (U.K.) joined the project as a major partner. France, Japan, Germany and China also contributed. The project of human genome sequencing (first phase) was completed on June 26, 2000.
Goals
- Sequencing. To sequence the whole human genome and find out the entire range of base pairs.
- Storage. To store the information in data bases.
- Analysis. To develop and conduct computer based analysis.
- Genes. Indentify all the genes (approximately 20,000-30,000).
- Technology. Transfer technologies developed during the project to other sectors like industry,agriculture and forestry.
- ELSI. To solve any ethical, legal and social issues (ELSI) that may arise from the project.
Methodologies. There are two approaches for sequencing human genome. (1) Expressed Sequence Tags (ESTS). Identify all the genes that are expressed as RNAS and sequence the same. (ií) Sequence Annonation. The whole genome, including both coding and noncoding regions is first sequenced. Later, functions are assigned to different regions. In HGP, sequence annonation has been carried out. (1)The whole DNA of a cell was isolated. (ii) DNA was broken into random fragments of relatively smaller sizes. (iii) The fragments were separated. (iv) The fragments were attached to vectors like BAC (bacterial artificial chromosomes) and YAC (Yeast artificial chromosomes). They were cloned in bacterial or yeast cells. Alternately, the fragments could be amplified through PCR (polymerase chain reaction). (v) The fragments were sequenced by using automated DNA sequences developed on the principle of a method developed by Fraderick Sanger (Nobel Prize for it, as well as the earlier Nobel Prize for sequencing insulin). (vi) The sequences were then arrange with the help of some overlapping fragments. The alignment of the sequences was carried out with the help of special computers based programmes. (vii) The sequences were later annonated and assigned to individual chromosomes (22 autosomes, X and Y chromosomes). The last annonation was for chromosome I. It was completed in May, 2006. (viii) Assignment of genetic and physical maps on the genome was carried out with the help of information on polymorphism of restriction endonuclease sites and microsatellites or repetitive DNA sequences.
Salient Features of Human Genome
(Revelation of HGP)
- Human genome contains 3- 1647 billion base pairs. Fruitfly Drosophila melanogester has 132 million base pairs or roughly 1/24 of total length of human genome.
- The total number of genes is about 30,000. The number is the same as present in mouse, double the number in fruitfly and about 6 times the genes contained in Escherichia coli. Previously it was believed that human beings carry 80,000-140,000 genes.
- The size of genome or number of genes is unconnected with the complexity of body organisation. Lily has 18 times more DNA than human genome. Inspite of this fact, Lily produces fewer proteins than a human being
- The function of over 50% of the discovered genes is unknown.
- Less than 2e of the genome contains coded information for protein synthesis.
- By differential spicing, a gene may code for two or more proteins.
- Part of DNA which contains repeated sequences is called satellite DNA. Satellite DNA is of several types. One of them is moderately repetitive sequences of 150- 300 base pairs long. They include transposons and Alu elements. There are several thousand Alu repeats in human genome Mini-satellite sequences are 11-60 base pairs long hypervariable repeat sequences. They are used in DNA finger printing. Over a million copies of simple sequence repeats (SSR) microsatellites occur in the genome. They have 5-8 pairs. These sequences are clustered around centromere and other areas of constitutive heterochromatin. These repeated sequences are called junked DNA. Repetitive sequences have no direct coding functions. They, however, provide information about chromosome structure, dynamics and evolution.
- 99.9% of DNA of human population is identical so that genomically speaking all human beings are exactly the same. The genetic difference is shared by various ethnic groups or races found in various geographical regions and the variability that allows us to differentiate between Ram and Shyam or Sita and Gita is 0-1% of the gene.
- The length of different human genes varies widely. The average size is 3000 bp. B-globin Or insulin genes are less than 10 kilo base pairs long. The longest gene of human body is that of Duchenne Muscular Dystrophy on X chromosome. It is 24,000 kilo base pair long. The smallest gene is that of TDF (Testis Determining Factor), a holandric gene, present on ‘Y’ chromosome. It is only 14 base pairs long.
- Chromosome Y carries the minimum number of genes (231). Chromosome I has the most gene(2968)
- About 1-4 million single base DNA differences or single nucleotide polymorphisms (SNP pronounced as ‘snips”) occur in humans. This will be helpful in finding chromosomal locations with disease associated sequences and tracing human history.
Applications and Future Challenges
(Prospects and Implications of Human Genome Project)
1 Completion of first phase of human genome project has been compared to discovery of antibiotics because it has opened a vast database of knowledge about various aspects of human
- Soon we shall be mapping all the human genes, Alu sequences, transposons and junk DNA.
- There are more than 1200 genes that cause common cardiovascular ailments, endocrine diseases like diabetes, Alzheimer’s disease and other neurological ailments, cancers, etc. After taking ther snapshots, it will be possible to know the method to alter them and remove the possibility of the disorders.
- Single gene defects produce a number of hereditary diseases. Their exact location, reason for defect and method of correcting the same can be found out.
- Studying the genes that will cause reversion of cancerous cells to normal cells.
- It will be possible to study interactions between various genes, proteins and as well as mechanism of forming tissues, organs, tumours or switch over to different developmental stages.
- It holds the promise of healthier and longer living, designer drugs and genetically modified diets according to needs of individual human beings.
REFERENCES