The U.S National Library of Medicine states proteins play critical roles in the body, they are required for the function of the body’s tissues and organs. Several smaller units of twenty different types of amino acids are combined to make a protein. A unique feature of eukaryotic cells is that specific functions are performed in spatially defined membrane bound compartments, thus the subcellular location of protein gives information about its function. The use of this information makes strides in several fields like drug design. While several papers exist on identifying the subcellular locations of proteins, there has been no significant improvement in prediction accuracy over the years as will be discussed in this paper. A general issue is how proteins are represented and what algorithms can be used to improve prediction accuracy. This paper uses the Pseudo amino acid composition to represent proteins sequence and the classifications algorithms used includes Random Forest, AdaBoost, and SAMME, Support Vector Machines and Artificial Neural Networks. To improve prediction accuracy, gradient boosting algorithms is applied as well as some techniques to tackle class imbalance.