
A Comparative Study of Machine Learning and Evolutionary Computation Approaches for Protein Secondary Structure Classification 19
Martin, B. (1995). Instance-based learning: Nearest neighbor with generalisation, Master’s thesis,
Department of Computer Science, University of Waikato, Waikato, New Zealand.
Matthews, B. (1975). Comparison of the predicted and observed secondary structure of T4
phage lysozyme, Biochimica et Biophysica Acta 405(2): 442–451.
Mitchell, T. M. (1997). Machine Learning, McGraw-Hill, New York, USA.
Nelson, D. & Cox, M. (2008). Lehninger Principles of Biochemistry,5
th
edn, W.H. Freeman, New
York, USA.
Nölting, B. (2006). Protein Folding Kinetics,2
nd
edn, Springer-Verlag, Berlin, Germany.
Ohkawa, T., Namihira, D., Komoda, N., Kidera, A. & Nakamura, H. (1996). Protein structure
classification by structural transformation, Proc. of IEEE International Joint Symposia on
Intelligence and Systems, IEEE Computer Society Press, Piscataway, USA, pp. 23–29.
Pauling, L., Corey, R. & Branson, H. (1951a). Configurations of polypeptide chains with
favored orientations of the polypeptide around single bonds: two pleated sheets,
Proceedings of the National Academy of Sciences of the USA 37(11): 729–740.
Pauling, L., Corey, R. & Branson, H. (1951b). The structure of proteins: two hydrogen-bonded
helicel configurations of the polypeptide chain, Proceedings of the National Academy of
Sciences of the USA 37: 205–211.
Platt, J. C. (1998). Fast training of support vector machines using sequential minimal
optimization, in C. B. B. Schoelkopf & A. Smola (eds), Advances in Kernel Methods,
MIT Press, Cambridge, USA.
Quinlan, J. (1993). C4.5: Programs for Machine Learning, San Francisco, USA.
Rabiner, L. (1989). A tutorial on hidden Markov models and selected applications in speech
recognition, 77(2): 257–286.
Scapin, M. & Lopes, H. (2007). A hybrid genetic algorithm for the protein folding problem
using the 2D-HP lattice model., in A. Yang, Y. Shan & L. Bui (eds), Success in
Evolutionary Computation,Vol.92ofStudies in Computational Intelligence,Springer,
Heidelberg, Germany, pp. 205–224.
Seymore, K., McCallum, A. & Rosenfeld, R. (1999). Learning hidden Markov model structure
for information extraction, AAAI 99 Workshop on Machine Learning for Information
Extraction, pp. 37–42.
Shmygelska, A. & Hoos, H. (2005). An ant colony optimisation algorithm for the 2D and 3D
hydrophobic polar protein folding problem, BMC Bioinformatics 6: 30.
Sing, T., Sander, O., Beerenwinke, N. & Lengauer, T. (2005). ROCR: visualizing classifier
performance in R, Bioinformatics 21: 3940–3941.
Sunde, M. & Blake, C. (1997). The structure of amyloid fibrils by electron microscopy and
X-ray diffraction, Advances in Protein Chemistry 50: 123–159.
Tavares, L., Lopes, H. & Lima, C. (2008). A comparative study of machine learning methods
for detecting promoters in bacterial DNA sequences, in D.-S.Huang,D.S.L.
Donald C. Wunsch II & K.-H. Jo (eds), Advanced Intelligent Computing Theories
and Applications, Vol. 5227 of Lecture Notes in Computer Science,Springer-Verlag,
Heidelberg, pp. 959–966.
The UniProt Consortium (2010). The universal protein resource (UniProt) in 2010, Nucleic
Acids Research 38: D142–D148.
Tsunoda, D. & Lopes, H. (2006). Automatic motif discovery in an enzyme database using a
genetic algorithm-based approach, Soft Computing 10: 325–330.
257
A Comparative Study of Machine Learning
and Evolutionary Computation Approaches for Protein Secondary Structure Classification