Data sets

cTP containing data set used for training of ChloroP networks

ChloroP was trained on a set of 150 sequences, whereof 75 were chloroplast transit peptide (cTP) containing. The 75 cTP containing proteins have all been checked with the papers originally presenting them and a few database annotation errors have been corrected. The 75 sequences are also redundancy reduced (with regard to their annotated cTP sequence) using the Hobom algorithm 2 (Hobohm, U, et. al., Protein Science 1:409-417 (1992)).

Here are the 75 sequences containing cTP:

Here are the 75 sequences not containing cTP:


Scientific problems:        Technical problems: