Recent genome-wide surveys on ncRNA have revealed that a substantial fraction of miRNA genes is likely to form clusters. However, the evolutionary and biological function implications of clustered miRNAs are still elusive. After identifying clustered miRNA genes under different maximum inter-miRNA distances (MIDs), this study intended to reveal evolution conservation patterns among these clustered miRNA genes in metazoan species using a computation algorithm. As examples, a total of 15–35% of known and predicted miRNA genes in nine selected species constitute clusters under the MIDs ranging from 1 kb to 50 kb. Intriguingly, 33 out of 37 metazoan miRNA clusters in 56 metazoan genomes are co-conserved with their up/down-stream adjacent protein-coding genes. Meanwhile, a co-expression pattern of miR-1 and miR-133a in the mir-133-1 cluster has been experimentally demonstrated. Therefore, the MetaMirClust database provides a useful bioinformatic resource for biologists to facilitate the advanced interrogations on the composition of miRNA clusters and their evolution patterns.
Highlights
► A novel database for miRNA cluster discovery and visualization. ► An efficient machine learning approach for the discovery of miRNA clusters. ► An extensive study of miRNA cluster properties in fifty-six metazoan species. ► A tool for interrogating recruitments of miRNAs in paralogous cluster families.