Neuro-fuzzy Modeling and Fuzzy Rule Extraction ... - Tshilidzi Marwala

Comment

Report 2 Downloads 65 Views

Neuro-fuzzy Modeling and Fuzzy Rule Extraction Applied to Conﬂict Management Thando Tettey and Tshilidzi Marwala University of the Witwatersrand School of Electrical and Information Engineering Private Bag 3 Wits 2050, Johannesburg, South Africa {t.tettey, t.marwala}@ee.wits.ac.za

Abstract. This paper outlines all the computational methods which have been applied to the conﬂict management. A survey of all the pertinent literature relating to conﬂict management is also presented. The paper then introduces the Takagi-Sugeno fuzzy model for the analysis of interstate conﬂict. It is found that using interstate variables as inputs, the Takagi-Sugeno fuzzy model is able to forecast conﬂict cases with an accuracy of 80.36%. Furthermore, it found that the fuzzy model oﬀers high levels of transparency in the form of fuzzy rules. It is then shown how these rules can be translated in order to validate the fuzzy model. The Takagi-Sugeno model is found to be suitable for interstate modeling as it demonstrates good forecasting ability while oﬀering a transparent interpretation of the modeled rules.

1

Introduction

With the frequency at which wars are occurring, it has become imperative that more research eﬀort be directed towards conﬂict management. The focus of this study over the years has been centered on ﬁnding improved approaches to conﬂict forecasting at the same time not neglecting causal intepretation of interstate interactions [1]. Understanding the reasons why countries go to war is just as signiﬁcant as forecasting the onset of war because it proposes the steps that can be taken to avoid conﬂict. Therefore a successful interstate conﬂict tool is one which is able to forecast dispute outcomes fairly accurately at the same time allowing for an intuitive causal intepretation of interstate interactions. International conﬂict has been studied using mainly techniques found in the ﬁelds of Statistics and Computational intelligence. These techniques have been applied on quantitative measures which have been collected over the years. It is widely accepted that improvements in conﬂict forecasting can mainly be acheived in two ways [1]. The ﬁrst improvement that can be made is with the data and measures of interstate interactions [2] and secondly by improving the forecasting of interstate conﬂict involves ﬁnding models which approximate interstate interactions better. I. King et al. (Eds.): ICONIP 2006, Part III, LNCS 4234, pp. 1087–1094, 2006. c Springer-Verlag Berlin Heidelberg 2006

1088

T. Tettey and T. Marwala

In this paper a survey of the work performed on international conﬂict is presented. The paper focuses mainly on the techniques applied to analyse quantitative descriptions of interstate interactions. An outline of the methods together with their shortcomings is discussed. Finally, neuro-fuzzy modeling is presented as an alternative technique which addresses the current shortcomings that exist in international conﬂict studies.

2

Background and Literature Survey

Militarised interstate dispute (MID) is deﬁned as a set of interactions between or among states that can result in the actual use, display or threat of using military force in an explicit way [3]. Projects such as the Correlates of War (COW) facilitate the collection, dissemination and use of accurate and reliable quantitative data in international relations [4]. The collected data, called interstate variables, are used to study the conditions associated with MID. The measures used in MID studies are People Oriented Government, Dependency, Capability, Alliance, Contiguity, Distance and Major power. Any set of measures describing a particular context has a dispute outcome attached to it. The dispute outcome is either a peace or conﬂict situation. Statistical methods such as logit and probit have been used in the analysis of interstate variables. However, it has been found that these methods have several shortcomings [1]. Some of the problems associated with the use of logit and probit is that they require the use of a priori knowledge usually obtained from the analyst. A problem then arises when the analyst pushes their data analyses extremely hard in search of eﬀects they believe exist but are diﬃcult to discover [1]. The consequence of this is that the results vary from researcher to researcher and are therefore not exactly repeatable. The other problem, as one might expect, is that conﬂict cases occur far less frequently than peace cases. Interstate conﬂict is therefore a rare event and the processes which drive it are likely to be diﬀerent from those found elsewhere. This has led quantitative researchers to conclude that the relationship between the interstate variables and dispute outcomes is highly nonlinear and highly correlated [1]. The conclusion is further conﬁrmed by the studies performed by Lagazio and Russet [5]. This means that statistical techniques, linear-normal models in particular, would perform poorly at modeling the relationship between interstate disputes and their outcomes. The neural network has also been applied to interstate conﬂict modeling and forecasting. The neural network was ﬁrst introducted by Schrodt [6] in 1995 and by Zeng [7] in 1999 as a method of analysing conﬂict without the need for the researcher to incorporate qualitative a priori knowlege or make assumptions about the problem space. The neural network was presented as a function approximator which is able to model highly nonlinear and interdependant relationships. However the neural network itself suﬀers similar problems to statistical methods in that a model selection technique must be considered. In recent studies, Beck et al [1] make use of a neural network, which is trained using the Bayesian framework outlined in [8]. The Bayesian training of neural networks involves

Neuro-fuzzy Modeling and Fuzzy Rule Extraction

1089

the use of Bayesian framework to identify the optimal weights and biases in a neural network model. It is found that the use of neural networks yields results expressed in the form of classiﬁcation accuracy. This interpretation of the results is found to be unambigious compared to previous methods. However, the resulting neural network model is regarded as a black box due to the fact that it does not provide a way of obtaining a causal interpretion of dispute outcomes. The weights extracted from the neural network oﬀer no understanding as to why countries go to war. In [9], Marwala and Lagazio propose the use of Automatic Relevance Detection (ARD) as a means to making the neural network more transparent. The result of ARD reveals that the importance of the interstate variable in predicting dispute outcomes is as follows (listed in decreasing relavance): Peoples Oriented Government, Capability, Dependancy, Allies, Contiguity, Distance and Major power. From this work on neural networks we can conclude that neural network models have a fairly strong forecasting ability but only a limited amount of knowledge can be extracted. In [10], Habtemariam et al introduce support vector machines (SVMs) to the study of conﬂict management. It is found that SVMs oﬀer an improved forecasting ability over neural networks. However, a sensitivity analysis which aims to determine the inﬂuence of each variable on a dispute outcome reveals that results obtained from neural networks are much more intuitive. Therefore, while SVMs oﬀer better forecasting ability they lack the ability to give an intuitive causal interpretation of the results. As stated earlier on in the paper, the main focus of studies in international conﬂict has been on the ability of a model to accurately forecast dispute outcomes while at the same time allow the analyst to extract knowledge from the model. In the next section a Neuro-fuzzy model is proposed as a method of modeling interstate interaction which has a fairly accurate forecasting ability and at the same time oﬀers intuitive causal explainations of disputes obtained from a fuzzyrule extraction process.

3

Fuzzy Systems and Neuro-fuzzy Modeling

Fuzzy logic concepts provide a method of modelling imprecise models of reasoning, such as common sense reasoning, for uncertain and complex processes. Fuzzy set theory resembles human reasoning in its use of approximate information and uncertainty to generate decisions. In fuzzy systems, the evaluation of the output is performed by a computing framework called the fuzzy inference system. The fuzzy inference system is a computing framework that maps fuzzy or crisp inputs to the output - which is usually a fuzzy set [11]. The inference system performs a composition of the inputs using fuzzy set theory, fuzzy if-then rules and fuzzy reasoning to arrive at the output. More speciﬁcally, the fuzzy inference involves the fuzziﬁcation of the input variables, evaluation of rules, aggregation of the rule outputs and ﬁnally the defuzziﬁcation of the result. The are two popular fuzzy models: the Mamdani model and the Takagi-Sugeno (TS) model [12].

1090

T. Tettey and T. Marwala

A fuzzy rule-based system can be viewed as a layered network similar to Radial basis function (RBF) artiﬁcial neural networks [12]. When training RBF networks several parameters of the kernel have to be optimised. Similarly when setting up a Fuzzy rule-based system we are required to optimise parameters such as membership functions and consequent parameters. In order to optimise these parameters, the neuro-fuzzy system relies on training algorithms inherited from artiﬁcial neural networks such as gradient descent [12]. There are two approaches to training neuro-fuzzy models [12]: 1. Fuzzy rules may be extracted from expert knowledge and used to create an initial model. The parameters of the model can then be ﬁne tuned using data collected from the operational system being modelled. 2. The number of rules can be determined from collected numerial data using a model selection technique. The parameters of the model are also optimised using the existing data. The Takagi-Sugeno model is mostpopular when it comes to data-driven identiﬁcation and has been proven to be a universal approximator [11]. The major motivation for using a neuro-fuzzy model in this work is that it is considered by many to be a ‘gray-box’ [12]. Unlike neural networks, once the neuro-fuzzy model has been optimised it is possible to extract the fuzzy rules and perhaps interpret the obtained results in an intuitive and qualitative manner. With neural networks e.g. the multilayer perceptron, it is not possible to intuitively explain input-output relationships using the weights of the network. In this study the neuro-fuzzy hybrid system is used to model the relationship between interstate (input) variables and the outcome of a dispute (output) from MID data.

4

Conﬂict Forecasting Using a Neuro-fuzzy Model

In our study, the TS neuro-fuzzy model has been optimised to map the relationship between the input variables (interstate variables) and the output i.e. the dispute outcome. The neuro-fuzzy model is trained using a balanced set of 500 peace cases and 500 conﬂict cases. The remaining 26845 peace instances and 392 conﬂict instance are then used for testing the forecasting ability of the model. The model selection process involves selecting the optimum number of rules of the TS model. This is done by creating models with rules ranging from 2 to 7 and evaluating them using 5 fold cross-validation. It is found that the optimum model consists of 2 fuzzy rules. The forecasting ability of the neuro-fuzzy model is then tested using the test examples. The prediction ability of the neuro-fuzzy model is evaluated based on how well it is able to predict both conﬂict and peace outcomes. The receiver operating characteristic (ROC) curve in ﬁg 1 is used to illustrate the results. As the output of the neuro-fuzzy is expressed as a decision value, an optimum threshold needs to be determined as the decision point which allows the overall

Neuro-fuzzy Modeling and Fuzzy Rule Extraction

1091

Fig. 1. A ROC curve illustrating the performance of the neuro-fuzzy model Table 1. Results for the TS neuro-fuzzy model

Correctly predicted Incorrectly predicted

Conﬂict cases Peace cases 315 17967 77 8378

maximum peace and conﬂict prediction accuracy. This threshold is found to be 0.488 and the results are expressed as a confusion matrix shown in Table 1. The results show the TS neuro-fuzzy model predict conﬂict cases with an accuracy of 80.36% while predicting peace cases with an accuracy of 66.93%. In the work done by Habtemariam et al [10], it was found that SVM predicts peace and conﬂict with 79% and 75%, respectively. From these results it is clear that the TS neuro-fuzzy model predicts conﬂict with a much higher accuracy but sacriﬁces its performance on peace outcomes. This biased result will be accepted as the prediction of conﬂict is considered more important than the prediction of peace.

5

Fuzzy Rule Extraction

The TS neuro-fuzzy model used for forecasting in the previous section can also be used for rule extraction. Two fuzzy rules can be extracted from the model and they are shown below. 1. If u1 is A11 and u2 is A12 and u3 is A13 and u4 is A14 and u5 is A15 and u6 is A16 and u7 is A17 then y(k) = −1.86 · 10−1 u1 − 1.33 · 10−1 u2 + 0.00 · 100 u3 − 6.05 · 10−1 u4 − 1.26 · 10−1 u5 − 1.33 · 100 u6 + 4.71 · 10−1 u7 + 8.95 · 10−1

1092

T. Tettey and T. Marwala

Fig. 2. A ROC illustrates the performance degradation of the neuro-fuzzy model when several inputs are pruned

2. If u1 is A21 and u2 is A22 and u3 is A23 and u4 is A24 and u5 is A25 and u6 is A26 and u7 is A27 then y(k) = −2.79 · 10−1 u1 + 6.26 · 10−2 u2 + 2.47 · 10−1 u3 − 7.56 · 10−1 u4 − 8.85 · 10−1 u5 − 9.04 · 100 u6 + 0.00 · 100 u7 + 3.73 · 10−1 The symbols from u1 to u7 are the input vector which consists of People oriented Government, Dependancy, Capability, Alliance, Contiguity, Distance and Major power. The rest of the symbols are as previously deﬁned. It is clear that the rules are quite complex and need to be simpliﬁed in order to obtain a didactic interpretation. In fact it is often found that when automated techniques are applied to obtaining fuzzy models, unnecessary complexity is often present [13]. Setnes et al [13] present a similarity measure for fuzzy rule based simpliﬁcation. Two methods of simplifying rules that are proposed are the removal and/or merging of similar fuzzy sets. Removal of a fuzzy set is proposed in the event that it is similar to the universal set i.e. μ ≈ 1 and the merging of fuzzy sets is proposed in the event that fuzzy sets from diﬀerent rules but belonging to the same premise are similar. In our case the TS fuzzy model contains only two fuzzy rules. The removal of a fuzzy sets similar to the universal set leaves only one remaining fuzzy set. This results in the input being partitioned into only one fuzzy set and therefore introduces diﬃculty when expressing the premise in linguistic terms. To simplify the fuzzy rules and avoid the redundant fuzzy sets the number of inputs into the TS neuro-fuzzy model have been pruned down to four variables. These variables are People oriented Government, Dependancy, Alliance and Contiguity. Fig 2 illustrates how the output deteriorates when three of the inputs are pruned. The ROC curve shows that the performance degradation is minimal. The rules extracted can then be converted so that they are represented in the commonly used linguistic terms. However it is only possible to translate the

Neuro-fuzzy Modeling and Fuzzy Rule Extraction

1093

antecedent of the fuzzy statement into english. The consequent part together with the ﬁring strength of the rule are still expressed mathematically. The translated fuzzy rules with the ﬁring strengths omitted can be written as shown below. 1. If Government orienting towards people is low and Alliance is strong and Contiguity is true then y(k) = −3.87 · 10−1 u1 − 9.19 · 10−1 u3 − 7.95 · 10−1 u4 + 3.90 · 10−1 2. If Government orienting towards people is high and Alliance is weak and Contiguity is false then y(k) = −1.25 · 10−1 u1 − 5.62 · 10−1 u3 − 2.35 · 10−1 u4 + 4.23 · 10−1 To validate the model we can then apply expert knowledge of the problem domain. For instance if the level of people oriented Government of two countries is low, they have a weak alliance and they share a border there is a reasonable chance that the countries can ﬁnd themselves in a conﬂict situation. If we ﬁnd values of Majority rule, Alliance and Contiguity which have a membership value of one, we can then use these as inputs to the model to see if it conﬁrms our existing knowledge. It is found that by using these values and and an arbitrary Dependency value the model gives a prediction of 0.6743 which is above the conﬂict threshold of 0.5360 calculated from the ROC curve. By validating the model with similar statements, we can get a feel for how much conﬁdence we can put in the system. The neuro-fuzzy model therefore oﬀers an improved method of forecasting international conﬂict as it allows for accurate prediction of dispute outcomes while also catering for the cases where causal interpretations are required.

6

Conclusion and Recommendataion

A background on conﬂict management has been presented in the form of a literature survey. The methods that have been used in the past to predict interstate disputed have been highlighted. The performance criteria that is expected of a forecasting model has been stated i.e. a satisfactory model must be able to accurately predict dispute outcome and at the same time be able to give a causal explaination of the results. It has been found that previous models have either lacked a suﬃcient prediction ability or transparency. A Takagi-Sugeno fuzzy model which is trained using concepts from neural network learning is then proposed. It is shown that the TS model is able to predict conﬂict cases with an accuracy of 80.36%. Further, the TS model is expressed as a set of fuzzy rules which are made readable by expressing them using common linguistic terms. The TS model therefore is able to meet the speciﬁed peformance criteria. However, we recommend the Takagi-Sugeno model for prediction only as it is able to predict the conﬂict cases accurately. The transparency of the fuzzy model allows for improved validation and therefore increases the chances of user acceptance. It is not advisable for the system to be used to source expert knowledge because of its accuracy. A prediction accuracy of 80.36% is considered good

1094

T. Tettey and T. Marwala

especially on a rare-event prediction problem. However, this means that under certain conditions the fuzzy rules may oﬀer incorrect knowledge. To increse the validity of the model we also recommend that the way some of the interstate variables are expressed be reviewed. For instance, variable such as contiguity have values of either true or false. In the fuzzy domain these are considered exceptions and are commonly termed as fuzzy singleton. Therefore, expressing these fuzzy sets as Gaussian membership functions might introduce ﬂaws into the model.

References 1. N. Beck, G.K., Zeng, L.: Improving quantitative studies of international conﬂict: A conjecture. American Political Science Review 94(1) (2000) 21–35 2. Jones, D., Bremer, S., Singer, J.: Militarized interstate disputes, 1816-1992 rationale, coding rules and empirical patterns. Conﬂict Management and Peace Science 15(2) (1996) 585–615 3. Gochman, C., Maoz, Z.: Militarized interstate disputes 1816-1976. Conﬂict resolution 28(4) (1984) 585–615 4. : Correlates of war. (Internet Listing) URL: http://www.correlatesofwar.org/. 5. Lagazio, M., Russett, B.: A Neural Network Analysis of Militarised Disputes, 1885-1992: Temporal Stability and Causal Complexity. In: Temporal Stability and Causal Complexity. University of Michigan Press, New Jersey (2003) 28 –62 6. Schrodt, P.: Patterns, rules and learning: Computational models of international behaviour. (Unpublished manuscript available at) URL: http://www.ku.edu/simkeds/papers.dir/Schrodt.PRL.2.0.pdf. 7. Oneal, J., Russet, B.: Prediction and classiﬁcation with neural network models. Sociological Methods and Research 4(3) (1999) 499–524 8. Mackay, D.: A practical bayesian framework for backpropagation networks. Neural Computing 4(3) (1992) 448–472 9. Marwala, T., Lagazio, M.: Modeling and controlling interstate conﬂict. In: Proceedings of the IEEE International Joint Conference on Neural Networks, Budapest, Hungary, IEEE (2004) 1233–1238 10. Habtemariam, E.A., Marwala, T.: Artiﬁcial intelligence for conﬂict management. In: Proceedings of the IEEE International Joint Conference on Neural Networks, Montreal, Canada, IEEE (2005) 2583–2588 11. Jang, J., Sun, C., Mizutani, E.: Neuro-fuzzy and soft computing: A computational approach to Learning and Machine Intelligence. First edn. Prentice Hall, New Jersey (1997) 12. Babuska, R., Verbruggen, H.: Neuro-fuzzy methods for nonlinear sytem identiﬁcation. Annual Reviews in Control 27 (2003) 73–85 13. Sentes, M., ska, R.B., Kaymak, U., van Nauta Lemke, H.: Similarity measures is fuzzy rule based simpliﬁcation. IEEE Transactions on Systems, Man and Cybernatics-Part B: Cybernatics 28(3) (1998) 376–386

Recommend Documents

Neuro-fuzzy Modeling and Fuzzy Rule Extraction Applied to Conflict ...

Supervised fuzzy clustering for rule extraction - Fuzzy ... - CiteSeerX

Fuzzy Data Mining by Heuristic Rule Extraction ... - Semantic Scholar

Rule-based Mamdani-type fuzzy modeling of skin ... - Semantic Scholar

Rule-based Mamdani-type fuzzy modeling of skin permeability