Complexity Reduction of Local Linear Models Extracted from Neural Networks
Keywords:
model reduction, model transformation, knowledge discoveryAbstract
Nonlinear black-box models have become more and more important not only in research but also in industrial practice. However, their main disadvantage is that they are often too complex and not interpretable; therefore it is a hard and complex task to validate them by human experts. It is a challenge how a priori knowledge can be utilized and integrated into the black-box modeling approach. This could be a difficult multi-stage process. One of these steps can be the reduction of the identified model. It is also important from the viewpoint of overparameterization and to reduce the time and computational demand of the model. This article would like to show how model reduction techniques can be used for complexity reduction purposes by local models from neural networks. A possible method family is orthogonal techniques. These methods can roughly be divided into two groups: the rank revealing ones like SVD-QR algorithm and those that evaluate the individual contribution of the rule or local models, like the orthogonal least-squares approach (OLS). This later technique requires more computations, but for system identification purposes it is preferable as it gives a better approximation result. Apart from that, other methods can also be used to reduce the number of local models: the most similar models can be merged together. The analyzed methods are used for knowledge discovery purposes from neural networks.
References
Abonyi, J., Feil, B. (2007). Aggregation and Visualization of Fuzzy Clusters based on Fuzzy Similarity Measures. Advances in Fuzzy Clustering and its Applications, John Wiley & Sons, 95–123. https://doi.org/10.1002/9780470061190.ch5
Abonyi, J. (2003). Fuzzy Models of Dynamical Systems. Fuzzy Model Idetiftcation for Control, Birkhauser, 60–61. https://doi.org/10.1007/978-1-4612-0027-7_3
Benitez, J. M., Castro, J. L., Requena, I. (1997). Are artifical neural networks black boxes? IEEE Transactions on Neural Networks, 8(5), 1156–1164. https://doi.org/10.1109/72.623216
de Castro, L. N., Iyoda, E. M., Zuben, F. J. V., Gudwin, R. (1998). Feedforward Neural Network Initialization: an Evolutionary Approach. Proceedings of the 5th Brazilian Symposium on Neural Networks, 43–49. https://doi.org/10.1109/SBRN.1998.730992
Cun, Y. L., Denker, J., Solla, S. (1990). Optimal brain damage. Advances in neural information processing systems, 2. 598–605.
Duch, W. (2003). Coloring black boxes: visualization of neural network decisions. Int. Joint Conf. on Neural Networks, Portland, Oregon, IEEE Press, 1. 1735–1740. https://doi.org/10.1109/IJCNN.2003.1223669
Duch, W. (2004). Visualization of Hidden Node Activity in Neural Networks: I. Visualization Methods. Lecture Notes in Artificial Intelligence, 3070. 38–43 https://doi.org/10.1007/978-3-540-24844-6_5
Duch, W. (2004). Visualization of hidden node activity in neural networks: II. Application to RBF networks. Lecture Notes in Artificial Intelligence, 3070. 44–49. https://doi.org/10.1007/978-3-540-24844-6_6
Harris, T. J., Yu, W. (2007). Controller assessment for a class of non-linear systems. Journal of Process Control, in press. https://doi.org/10.1016/j.jprocont.2007.01.011
Hassibi, B., Stork, D., Wolff, G. (1992). Optimal brain surgeon and general network pruning. Technical Report 9235, RICOH California Research Center, Menlo Park, CA.
Henrique, H. M., Lima, E. L., Seborg, D. E. (2000). Model structure determination in neural network models. Chemical Engineering Science, 55. 5457–5469. https://doi.org/10.1016/S0009-2509(00)00170-6
Huang, D. S., Zhao, W. B. (2005). Determining the centers of radial basis probabilistic neural networks by recursive orthogonal least square algorithms, Applied Mathematics and Computation, 162. 461–473. https://doi.org/10.1016/j.amc.2003.12.105
Markowska-Kaczmar, U., Chumieja, M. (2003). Opening neural network black box by evolutionary approach. Design and application of hybrid intelligent systems, 147–156.
Mastorocostas, P. A., Theocharis, J. B., Petridis, V. S. (2001). A constrained orthogonal least-squares method for generating TSK fuzzy models: Application to short-term load forecasting. Fuzzy Sets and Systems, 118. 215–233. https://doi.org/10.1016/S0165-0114(98)00344-3
Mastorocostas, P. A., Theocharis, J. B. (2003). An orthogonal least-squares method for recurrent fuzzy-neural modeling. Fuzzy Sets and Systems, 140. 285–300. https://doi.org/10.1016/S0165-0114(02)00523-7
Nelles, O. (2001) Nonlinear system identification. Springer-Verlag. https://doi.org/10.1007/978-3-662-04323-3
Setiono, R., Leow, W. K., Thong, J. Y. L. (2000). Opening the neural network black box: an algorithm for extracting rules from function approximating artificial neural networks. Proceedings of the 21st International Conference on Information systems, Queensland, Australia, 176–186.
Setnes, M. (2001). Complexity reduction in fuzzy systems. PhD Thesis, TU Delft.
Downloads
Published
Issue
Section
License
Copyright (c) 2007 Kenesei Tamás, Feil Balázs, Abonyi János

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

