Keywords:-
Article Content:-
Abstract
The ample availability and importance of large-scale protein-protein interaction (PPI) data demand a flurry of research efforts to understand cells' organization, processes, and functioning by analyzing these data at the network level. In the bioinformatics and data mining fields, network clustering requires a lot of attraction to discover clusters of interacting proteins. Clustering proteins in a PPI network has been an excellent method for discovering functional modules, disclosing functions of unknown proteins, and other tasks in numerous research over the last decade. In this research, a unique graph mining approach is proposed to detect dense neighborhoods (highly connected regions) in an interaction graph, including protein complexes. Our technique first finds size-3 cliques and then expands these size-3 cliques based on their affinity to produce maximal dense regions. To highlight the efficiency of our suggested strategy, we present experimental results using yeast and human protein interaction data. Our predicted complexes match or overlap much better with the gold standard protein complexes in the CYC-2008 and CORUM benchmark databases than other existing approaches.
References:-
References
E.Wong, B. Baur, S. Quader, and C.H. Huang. Biological network motif detection: principles and practice. Briefings in Bioinformatics, 13(2):202-215, 2011.
N.T. Tran, S. Mohan, Z. Xu, and C.H. Huang. Current innovations and future challenges of network motif detection. Briefings in Bioinformatics, 16(3):497-525, 2014.
Guimin Qin and Lin Gao. An algorithm for network motif discovery in biological networks. International journal of data mining and bioinformatics, 6(1):1-16, 2012.
Bjorn H Junker and Falk Schreiber. Analysis of biological networks, volume 2. John Wiley & Sons, 2011.
Martin G Grigorov. Global properties of biological networks. Drug discovery today, 10(5):365-372, 2005.
Anil K Jain and Richard C Dubes. Algorithms for clustering data. Englewood Cliffs: Prentice Hall, 1988.
Liang Yu, Lin Gao, Kui Li, Yi Zhao, and David KY Chiu. A degree-distribution based hierarchical agglomerative clustering algorithm for protein complexes identification. Computational biology and chemistry, 35(5):298-307, 2011.
Saeed Jalili, Sayed-Amir Marashi, et al. Camwi: Detecting protein complexes using weighted clustering coefficient and weighted density. Computational biology and chemistry, 58:231-240, 2015.
Ioannis Xenarios, Danny W Rice, Lukasz Salwinski, Marisa K Baron, Edward M Marcotte, and David Eisenberg. Dip: the database of interacting proteins. Nucleic acids research, 28(1):289-291, 2000.
Gary D Bader and Christopher WV Hogue. An automated method for finding molecular complexes in large protein interaction networks. BMC bioinformatics, 4(2), 2003.
Gergely Palla, Imre Derenyi, Illes Farkas, and Tamas Vicsek. Uncovering the overlapping community structure of complex networks in nature and society. nature, 435(7043):814{818, 2005.
Md Altaf-Ul-Amin, Yoko Shinbo, Kenji Mihara, Ken Kurokawa, and Shigehiko Kanaya. Development and implementation of an algorithm for detection of protein complexes in large interaction networks. BMC bioinformatics, 7(207), 2006.
Andrew D King, Natasa Przulj, and Igor Jurisica. Protein complex prediction via cost-based clustering. Bioinformatics, 20(17):3013-3020, 2004.
Mokhtarul Haque, Rosy Sarmah, and Dhruba K Bhattacharyya. A common neighbour based technique to detect protein complexes in ppi networks. Journal of Genetic Engineering and Biotechnology, 16(1):227-238, 2018.
Salwinski L, Miller CS and Smith AJ, et al., The database of interacting proteins: 2004 update, Nucleic acids research, vol 32, number suppl_1, pp. D449–D451, 2004.
Oughtred R, Stark C and Breitkreutz BJ, et al., The BioGRID interaction database: 2019 update, Nucleic acids research, vol 47, number D1, pp. D529–D541, 2019.
Pu S,Wong J and Turner B, et al., Up-to-date catalogues of yeast protein complexes, Nucleic acids research, vol 37, number 3, pp. 825–831, 2009.
Giurgiu M, Reinhard J, Brauner B, et al., CORUM: the comprehensive resource of mammalian protein complexes—2019, Nucleic acids research, vol 47, number D1, pp. D559–D563, 2019.