Introduction

The formation of biomolecular condensates by liquid‒liquid phase separation has become a universal mechanism for spatiotemporal coordination of biological activities in cells and has been widely observed to directly regulate the key cellular processes involved in cancer cell pathology. However, the complexity of protein sequences and the diversity of conformations are inherently disordered, which poses great challenges for LLPS protein calculations and experimental research.

Herein, we proposed a novel predictor (PredLLPS_PSSM) for LLPS protein identification based only on sequence evolution information. PredLLPS_PSSM provides the option of predicting not only LLPS proteins but also PS-Self and PS-Part proteins. the source code for PredLLPS_PSSM are freely available at here.

Dataset

We built two new independent LLPS protein test datasets, which were collected from LLPSDB V2.0. We first filtered out proteins with posttranslational modification sites, invalid amino acids and lengths less than 50 or greater than 5000. Then, we weeded out those sequences with a similarity greater than 0.4 from the database of LLPS+, LLPS-, SaPS, and PdPS applying CD-HIT. Finally, there were 48 PS-Self proteins named Ind_Test_I and 140 PS-Part proteins named Ind_Test_II.

We collected LLPS proteins from the latest version of PhaSepDB V2.1, LLPSDB V2.0, and PhaSePro to construct the training data for the final prediction model. Through The CD-HIT algorithm with a minimum similarity threshold of 0.4 was used to remove proteins with high similarity. we obtained 578 LLPS proteins, 246 PS-self proteins, and 410 PS-part proteins. The same number of non-LLPS proteins (negative dataset) were randomly selected from PDB* correspondingly. For the convenience of description, we named the three final training datasets LLPS_Training, PS_Self_Training and PS_Part_Training, which are used to train the models.

In addition to the data in the ready-made database, we obtain nine protein sequences from Insect data that have been experimentally verified to undergo LLPS.

Contact Us

If you have any questions, please feel free to contact us.

Name:

Shengming Zhou

Location:

No. 1, Linghai Road, Dalian City, Liaoning Province, China