Pattern Analysis and Neural Networks Group Department of Computing University of Exeter UK (s.singh@exeter.ac.uk)
ABSTRACT
Spiral structures are one of the most difficult patterns to classify. Spiral time series data has a helical movement with time that is both difficult to predict as well as classify. This paper focusses on how structural information about spirals can be useful in providing critical information to a neural network for their recognition. Results are presented on neural network solutions to the classical two-spiral problem by extracting structural and rotational information from the spiral training data. The results show that in both two and three dimension, the spirals can be easily recognised by neural networks if they are trained on the temporal structural changes.
1
1. SPIRAL STRUCTURES
In our previous paper in Pattern Recognition Letters[11] it was highlighted that spiral classification is a difficult problem which can be reasonably well solved using a fuzzy nearest neighbour classifier. Our earlier reservation for not using a neural network was that neural networks have convergence problems with raw coordinate data as inputs for recognising spirals. The two spiral benchmark from Carnegie Mellon repository in particular has been highly researched and it has been found that sophisticated neural networks have difficulty in classifying spirals with reasonable accuracy. In this paper we summarise the problem in short again for new readers and develop an input selection method for neural networks that allows them to recognise spirals in two or more dimensions with relative ease.
Spiral data is found in several natural and physical domains. The classic double helix DNA, the motion of particles in cyclotrons, spiral feed in manufacturing, spiral galaxies and spiral movement of financial stocks are some of the well-known examples. Spirals are particularly
References: [1] Touretzky, DS, and Pomerleau, DA. What’s hidden in the hidden layers? Byte 1989; August issue:227-233. [2] Fahlman, SE. Faster-learning variations on back-propagation: An empirical study. In Proceedings of the 1988 Connectionist Models Summer School, Morgan Kaufmann, 1988. [3] Fahlman, SE and Lebiere, C. The cascade-correlation learning architecture, In Advances in neural information processing systems 2, Touretzky, DS (ed.), Morgan Kaufmann, 1990. [4] Lang, KJ and Witbrock, MJ. Learning to tell two spirals apart, In Proceedings of the 1988 Connectionist Models Summer School, Morgan Kaufmann, 1988. [5] Tay, LP and Evans, DJ. Fast learning artificial neural network (FLANN II) using the nearest neighbour recall. Neural, Parallel and Scientific Computations 1994; 2(1):17-27. [6] Sun, CT and Jang, JS. A neuro-fuzzy classifier and its applications, In Proceedings of the IEEE International conference on fuzzy systems, 1993, vol. 1, pp. 94-98. [7] Chua, H, Jia, J, Chen, L and Gong, Y. Solving the two-spiral problem through input data encoding, Electronics letters 1995; 31(10):813-14. [8] Jia, J and Chua, H. Solving two-spiral problem through input data representation, In Proceedings of the IEEE International conference on neural networks, 1995, vol. 1, pp. 132-135. [9] Ulgen, F, Akamatsu, N and Iwasa, T. The hypercube separation algorithm: a fast and efficient algorithm for on-line handwritten character recognition, Applied Intelligence 1996; 6(2):101-116. [10] Singh, S. A single nearest neighbour fuzzy approach for pattern recognition, (in press, International Journal of Pattern Recognition and Artificial Intelligence, 1999). [11] Singh, S. 2D spiral recognition using possibilistic measures, Pattern Recognition Letters 1998; 19(2):141-147. 15 [12] Singh, S. Effect of noise on generalisation in Massively Parallel Fuzzy Systems, Pattern Recognition, vol. 31, issue 9, 1998. [13] Singh, S. Massively Parallel Fuzzy Systems: The case of the three spiral recognition, In Proceedings of the IEEE International Conference on Fuzzy Systems FUZZ-IEEE '98, IEEE Press, 1998, vol. 2, pp. 1066-1071. 16