These include obtaining the remaining sequence and correcting errors (proofreading the genome), filling sequence gaps and then sequencing the 7-15 per cent of the genome that contains heterochromatin.
Heterochromatic regions of the genome were not sequenced initially because they contain long stretches of repetitive DNA sequences. Further, it was initially considered that heterochromatin does not contain genes. But the genome sequence of Drosophila revealed that heterochromatic regions do contain a small number of genes (about 50 in Drosophila).
As a result of this discovery, heterochromatic regions of the human genome have to be sequenced to ensure that all the genes in the human genome are identified. Once the genome of an organism is sequenced, compiled, and proofread, the next stage of genomics, viz., annotation, begins.