In the process of characterizing putative genes, the genome under study becomes increasingly well understood as more interactions can be identified.
[5] Identification of putative genes is necessary to study genomic evolution, as significant proportion of genomes make up larger families of related genes.
Putative genes can be identified by clustering large groups of sequences by patterns and arranging by mutual similarity[8] or can be inferred by potential TATA boxes.
[10] Software tools have been developed in order to automatically identify putative genes.
[11] Protein products can be identified and used to characterize the putative gene that codes for it.