Pictorial analysis: a multi-resolution data visualization ...
-
Upload
khangminh22 -
Category
Documents
-
view
5 -
download
0
Transcript of Pictorial analysis: a multi-resolution data visualization ...
Pictorial analysis: a multi-resolutiondata visualization approach for monitoring
and diagnosis of complex systems
V.G. Grishin a, A.S. Sula a, Mihaela Ulieru b,*
a View Trends Ltd., ElectroCardioVision, Inc., 250 Chatham Way, Ste. 845, Cleveland,
OH 44124, USAb Faculty of Electrical and Computer Engineering, The University of Calgary,
2500 University Drive NW, Calgary, Alberta, Canada, AL T2N 1N4
Received 10 March 2001; accepted 27 November 2002
Abstract
This paper aims to describe the prospective methodology (Pictorial Analysis) of a
human–computer interaction capable to select and adjust visual representations for
better feature and pattern selection. As an adequate tool for on-line monitoring and
diagnosis of very complex systems which require human supervision in addition to the
computerized one, our approach exploits human capabilities of pattern recognition in
doing data analysis and selecting appropriate representations, features and class de-
scriptions by means of interactive human–computer learning. It is based on the analysis of
multidimensional relations through transforming the initial data (heterogeneous arrays,
signals and fields) into artificial pictures. Practical applications of this approach prove it
extremely useful in critical areas of safety, such as flight control, power plant moni-
toring, etc.
� 2003 Published by Elsevier Science Inc.
Keywords: Data modeling; On-line monitoring; Process safety; Data visualization;
Interactive human–computer learning; Fuzzy sets; Neural networks
Information Sciences 152 (2003) 1–24
www.elsevier.com/locate/ins
*Corresponding author. Tel.: +1-403-220-8976; fax: 1-403-282-8406.
E-mail addresses: [email protected] (V.G. Grishin), [email protected] (M. Ulieru).
URL: http://isg.enme.ucalgary.ca/people.
0020-0255/03/$ - see front matter � 2003 Published by Elsevier Science Inc.
doi:10.1016/S0020-0255(03)00044-6
1. Introduction
It has often been said that a picture is worth a thousand words or a thou-
sand numbers as the case may be. This is the basic premise behind the ap-proach outlined in this article for presenting vast amounts of process
information to operators or researchers that must solve complex decision
making problems associated with a system (plant, engine, process) model
choice as well as with monitoring, diagnosis and control tasks in industrial
systems [1,2].
Visual representations of data are very important for the automation and
control of large-scale industrial processes, such as fossil fuel or nuclear power
generation and distribution systems, as well as stand alone gas-turbine engines,powerful pumps for gas or oil pipelines, chemical, mining and metallurgical
processes and many other complex systems of different industries. In the de-
velopment of a control system for such complex processes one needs to develop
adequate models that enable quick visualization of any critical misbehavior.
Decision-making and control tasks are often organized in a hierarchical
configuration that includes direct level and supervisory level decision and
control functions. Direct level control functions are intended to respond to
disturbances and commands in the natural time-scale of the process to main-tain safe and acceptable system performance. The decision-making and control
tasks in the supervisory level can include planning for operation, process
monitoring, fault detection and isolation, and adaptive and self-organizing
control functions.
Human operators are usually not involved at the direct level of the auto-
mation and control hierarchy but they are an integral part of the supervisory
automation and control functions, especially during abnormal system opera-
tion, for example during large transients resulting from disturbances, equip-ment malfunctions and emergency operating situations [3,14].
Typically, control systems include distributed sensors, actuators and control
processors supplying a vast amount of data on control panels monitored by
human operator. Consequently, one of the important goals of control system
design is a synthesis of visual representations that can on one side underline the
distinction between normal and abnormal situations and on the other side
uniquely identify each abnormal state of the controlled process. To achieve this
goal we need an adequate model capable to outline the variables which candiscriminate/differentiate among these behaviors. These models are higher level
models than the ones for the direct level control and they do not require the
same level of precision.
In this article we introduce a data visualization technique, which we named
pictorial analysis, suitable for on-line monitoring and diagnosis of complex
systems. The method proved useful in the following circumstances which will
be exposed in the sequel:
2 V.G. Grishin et al. / Information Sciences 152 (2003) 1–24
1. in process monitoring and equipment diagnostics to specify the set of differ-
ent important states requiring human operator decision making and how
these states connect to the same parameters;
2. in direct level control to find how important ‘‘outputs’’ of the process, forexample product quantity and quality, environment pollution, etc. depend
on a system�s measured parameters, including its ‘‘inputs’’,
2. A review of data analysis tasks involved in monitoring and diagnosis and their
geometrical interpretation
2.1. Model selection for state monitoring and equipment diagnostics
For both monitoring and diagnosis the sought model usually has to connect
some limited set of process or system states, possible faults or equipments mal-
functions to the limited amount of classes fXigk of measured parameter vectorsor matrices Xi. These states, faults, etc. are selected by experts or by experimental
data to provide a required effectiveness, reliability and safety of the process.
Two types of data units are useful: primary data units and sample structures.
2.1.1. Primary data units
In the case of a static system, monitoring and control are completely spec-
ified by the current values of system parameters. If xd denotes the value of the
nth measured parameter of an system, then its state will be described by meansof a vector Xi ¼ fxdðiÞ; d ¼ 1; 2; . . . ;Dg. Possible variations of a state could berepresented by these vectors collection fXig, i.e. a matrix of generalized state.Examples of simple representations of these data vector Xi are given in Fig. 1.
These are different images of just one sample of a gas-turbine engine state.
The current state of a dynamic system is defined by some history of para-
meter dynamics (e.g. a technological process, market trends). In this case, each
state of a system, instead of a vector Xi, is specified by a data matrix Xi with a
separate row for the time series of each parameter value (temperatures, pres-
sures, flows, etc.). This matrix is ‘‘the digital image of a state sample’’ (see Fig. 2).
2.1.2. Data sample structures
Pattern recognition learning: So-called learning sample [1,5,17,20] is made of
K classes (collections) of vectors or matrices for dynamic systems ffXigkg,k ¼ 1; 2 . . . ;K, where k denotes the sequential number of the class and i denotesthe specific realization of the class. Every class consists of data vectors which
are known to be in similar system states for a particular operating condition
that is to be monitored and diagnosed. The system state can be verified by
experts, by direct possibly invasive inspection of systems, or it can be determinedfrom historical data about the process. In the latter instance, it is necessary to
V.G. Grishin et al. / Information Sciences 152 (2003) 1–24 3
learn how to distinguish between various system states by monitoring param-
eters. Every state that is to be monitored for diagnostic or analysis purposes is
represented in the learning sample by a set of its variations. Collectively, these
realizations are used to account for the variability of the process data and states.
Clustering or pattern recognition self-learning: If there are new classes of
process states, unknown in advance and indistinguishable by available meth-ods, it is necessary to automatically identify coherent classes for these states.
The learning sample is not divided into classes a priori and the analysis is
expected to indicate those decompositions that follow from the structure of the
data and make possible a meaningful interpretation of the data in terms of the
features of the process data. Once such decomposition has been determined,
the process model can be chosen or updated, the process dynamics can be
described in detail, and monitoring, detection and diagnosis functions can be
improved.
2.2. A data modeling approach for direct level control
The above mentioned discrete models of system state classification by theirobserved parameters are used for automatic control, but for a continuous
Fig. 1. (a) Linear and (b) polar (star) contour representation of 32 parameters vector image of the
current values for a gas-turbine engine.
4 V.G. Grishin et al. / Information Sciences 152 (2003) 1–24
process control and optimization the control system needs some equations as amodel. The model has to give some analytical dependencies of process ‘‘out-
puts’’, as productivity, quality, etc. from its ‘‘independent inputs’’ which can be
changed for outputs control. The modeling process goes through several stages
involving different techniques, as explained in the sequel.
Multiple regression analysis is used first to get a static model of the system by
its experimentally measured ‘‘input-outputs’’ dependencies. It could be linear
or non-linear, multidimensional, etc. [5,17,20].
Systems identification approach is used to model the dynamic system forcontrol and forecasting, as differential equations representing well the con-
nections between inputs and outputs. It could be non-stationary, non-linear
and multidimensional simultaneously. In both approaches to get an appro-
priate quantitative model we have to select its correct qualitative properties
such as structure, types, and power of non-linearity, etc. [17]. This structural
identification often starts from:
Fig. 2.
V.G. Grishin et al. / Information Sciences 152 (2003) 1–24 5
• clustering of input/output mappings of data samples calculated with the raw
model;
• a classification of relationships between these clusters for different model
structures;• selection of final model structure by matching the appropriate clusters rela-
tions.
Both clustering and pattern recognition learning prove to be extremely
useful in modeling and this is a main point where data visualization methods
can help for a structure selection of the equation model. Therefore our focus
will be on these techniques here.
3. Geometrical representation of data
Consider Xi a vector of parameters for some system state, and a point in
EðDÞ representing it. We represent values of the parameters from
Xi ¼ fxdðiÞ; d ¼ 1; 2; . . . ;Dg by means of orthogonal axes and get a D-dimen-sional space EðDÞ in which Xi is a vector or point and a class fXigk, corre-sponding to kth system state, covers some region Sk (Fig. 3) which can consistof a few unconnected regions. The goal of pattern recognition learning usuallyis to find a decision, e.g. a description of boundaries between structures Sk inEðDÞ that is effective enough to ensure a correct definition of current systemstates by new vectors or Xi which were not in the learning sample. Usually, a
part of the verified data vectors of learning sample are randomly excluded from
the learning process and are used instead to check its result by an examination
sample. A clustering approach can be useful in finding and describing unknown
Fig. 3. Geometrical interpretation; data structure in data space.
6 V.G. Grishin et al. / Information Sciences 152 (2003) 1–24
structures Sk in EðDÞ. When connected with a new systems classification
they can become in turn useful in the fault detection and identification process.
An illustration of a possible Sk complexity is shown in Fig. 3.
4. On human–computer interaction during the modeling process
Practically the modeling process (including selection and evaluation of the
most appropriate model) involves human–computer interaction even if the
approach is called ‘‘automatic’’. In these Sections we will review the most
popular approaches.
4.1. Discrimination methods of pattern recognition and clustering
In discrimination methods of pattern recognition and clustering [5] a human
being searches for some combinations of N analytical hyper surfaces (potentialfunctions, functional correspondences, etc.) which together with some discri-
minant or probabilistic decision rule provide an acceptable accuracy in de-
scribing the boundary separating different Sk from the learning sample.
Usually, it is a search of a system of hyperplanes:
fPn ¼ a1ðnÞx1 þ a2ðnÞx2 þ � � � þ aDðnÞxD ¼ 0g; n ¼ 1; 2; . . . ;Nand a decision rule defines which current vector Xi belongs to which Sk when itis in different places of EðDÞ relatively to these hyper planes.Below, we primarily consider this analysis approach as it provides the most
vivid example essential for understanding of this article. The computational
complexity C of searching for these hyper surfaces is directly proportional tothe problem dimension D, the problem non-linearity N (amount of hyper
planes in simplest case) and the number of classes K, i.e.C ¼� D Nk ðK 1Þ;
where Nk is some average amount of hyper planes for one Sk separation. Dcould be up to a few thousands (as in the case of nuclear power plant diag-
nostics), K––up to hundreds and N usually unknown and often very large.Human expert role: To build the effective discriminating function for a
complex system a human expert has to select:
• informative features providing either a way of decreasing dimension D or a‘‘linearization’’ problem, i.e. decreasing of N in a new features space and soon;
• the most appropriate features combinations forming different state patterns
which may be useful for a following decision rule selection and for human
operator monitoring;
• the structure of a decision rule, i.e. a quantity and types of hyper surfaces for
Sk boundary approximation;• initial positions of these hyper surfaces;
V.G. Grishin et al. / Information Sciences 152 (2003) 1–24 7
• the algorithm and criteria for their position adjustment in the learning pro-
cess in order to get an acceptable recognition accuracy for each learning and
examination sample.
The role of the computer: According to specialists [5], most methods ofcomputer pattern recognition are almost equitably effective, and the overall
performance of a particular approach depends most critically on the expertise of
the problem domain experts. For example, if the expert cannot successfully solve
the above listed problems of feature and model structure selection the computer
will be wrongfully guided to check a huge amount of the local extremes and will
not reach an acceptable solution. Moreover, in this case it could be very difficult
to understand the solution obtained. In any case, and especially if domain
knowledge is not enough to avoid these problems, experts usually analyzegraphical representations of data like that shown in Fig. 1(a) and (b). However,
as shown below, separately from matrix representations as in Fig. 2 they are not
effective for visual analysis of multidimensional data and complicated time se-
ries. Sometimes color matrix representations of stand-alone vectors as well as of
parameter dynamics matrix are used, which could be very profitable for this
problem solution, but unfortunately are used poorly. The reason is that the vast
majority of researchers and system developers are not oriented towards emu-
lating the human pattern recognition process because they do not understand thehuman vision system in depth and mainly do not know how to use its strength
properly. Further many researchers face a wide spread modern prejudice that
computers can do everything. At most they try to minimize the human functions
by a computer substitution. Sometimes this may work but not for the problems
considered here. As a result the abilities of the control system are decreased.
In response to the above mentioned problems we propose a methodology
able to develop pictorial representations for complicated non-linear problems
and non-stationary process data that are designed to take advantage of theunique processing and feature extraction capabilities of the human vision
system. To emphasize the high value of our method we will first discuss the
drawbacks of two existing techniques which aimed to overcome the mentioned
drawbacks by means of a human task simplification:
Artificial neural networks: One of the automated techniques that has gained
wide popularity recently is artificial neural networks (ANNs). Through train-
ing, usually formulated as a mathematical programming problem, ANNs can
learn a non-linear discrimination rule to separate features and Sk in the data.However for large data sets with strong non-linear dependencies in the data,
the networks are very difficult to train and computational time can be pro-
hibitively large. Another drawback of the ANN approach is that it is very
difficult, if not impossible, to gain insight into the clustering mechanism by
examination of the network parameters, for example the weights and thresh-
olds of the network. Therefore, it is particularly difficult for an expert to
interpret the results of the network solution and to make sense of it.
8 V.G. Grishin et al. / Information Sciences 152 (2003) 1–24
Computer search of reduced data representations: A second direction of a
data analysis enhancement is targeted on a computer search of reduced data
representations, which may simplify a visual analysis. They will be critically
analyzed in the next subsection.
4.2. Projection pursuit and visualization methods
For a choice of qualitative models and decision rules, we need to understandsomething about multidimensional spatial structures fSkg of data samples inEðDÞ and their boundaries. Because human vision can not perceive more thanthree-dimensional space structures many methods of EðDÞ and X projection
visualization have been developed. Some of them involve projecting the initial
data sample onto various planes of its main (statistically optimal) components
(this is commonly referred to as principal component analysis). Methods of this
type are linear and can only be applied to problems with D < 40–50. Theprimary restriction is that a particular data matrix is of full rank and an inversemust be computed.
The ‘‘multidimensional scaling’’ type of methods automatically arrange a set
of L ¼ jfXgj data points on a plane. This can be accomplished by optimizingsome performance function, e.g. a standard linear least squares problem that
captures the important characteristics of a sample. This reduces to the problem
of multiextremal optimization for L variables.Other methods are based on systems of the Peano-Hilbert type with em-
bedded developments (scans), which generate, for example, the density curvesof the sample vectors distributed along regular trajectories. Evidently, it is very
difficult to estimate visually the features of multidimensional, even two- or
three-dimensional structures by such a curve. Other visualization methods that
are also used can be found in [2,6], but all of them use only a small part of the
overall capability of the human vision systems.
All these type of visualization are based on the misunderstanding, that be-
cause human vision cannot see more than 3D space, we have to represent
multidimensional data structures by their 2D or 3D projections. This is notcorrect because human vision can really analyze very high dimensional struc-
tures without seeing their space by means of visual analysis of data relation-
ship. In the following we will plead for this argument.
5. Pictorial analysis
5.1. Direct visual exploration of multidimensional relationships
Multidimensional structures and relationships can be visually analyzedwithout using spatial 2D–3D projections, if to properly map X onto visible
V.G. Grishin et al. / Information Sciences 152 (2003) 1–24 9
features of two or three-dimensional pictures. Although diagrams, bar graphs
and other graphical representations of data have been used for many years [2],
they are empirical constructions useful in the estimation of a few simple fea-
tures of low dimensional data. In fact, the questions of their use for compli-cated problem solutions were not considered in depth first of all because of
absence of a quantitative model of the human visual perception and secondly
because of the very complicated connections between picture features and the
multidimensional data structures represented by them. In the following we will
discuss essential human vision capabilities which pictorial analysis exploits.
Features hierarchy: Humans are capable of distinguishing and comparing of
many pictures in parallel with hundreds of local attributes and features relating
to the shapes, textures, colors, or brightness of the images (see Fig. 4). A standalone spot on a picture or a spot cluster has certain boundary shapes which can
be segmented by simple local features such as ‘‘straight line’’, ‘‘concave’’,
Fig. 4. Plane picture analysis by human vision.
10 V.G. Grishin et al. / Information Sciences 152 (2003) 1–24
‘‘convex’’, ‘‘angle’’, ‘‘hole’’, etc. These features have attributes such as ‘‘sizes’’,
‘‘orientation’’, ‘‘symmetry’’, etc. and are connected by means of ‘‘upper-
lower’’, ‘‘left-right’’, ‘‘inside-outside’’ positioning. Local features are visually
joined in more complicated shapes as ‘‘wave’’, ‘‘leaf’’, ‘‘a face profile’’ andothers associated with real world systems and also have above mentioned and
more complicated attributes synthetically expressed easily in a linguistic
manner. This features generalization continues until obtaining a holistic image
of the spot which integrates features such as ‘‘complexity’’, ‘‘symmetry’’,
‘‘elongation’’ besides the usual size, orientation, position on the picture, etc.
Plus, color and textural properties can be described.
These attributes and features are then organized into a multilevel hierarchy
that can be partially verbalized. If a picture contains many different separateshapes, a nested hierarchy can be applied which zooms out from shapes to
clusters to groups to pictures. Additionally, combinatorial and statistical fea-
tures can be visually detected and estimated. The capability of human vision to
rapidly process through this hierarchy, searching for more details or general-
izing the attributes, allows for the simultaneous examination of many facets
of the image in terms of a variety of attributes and features.
Linguistic and formal representations: A human-linguistic description of
these features and patterns consists of some combinations of features Qi andattributes Ai connected via AND/OR operators. For example:
‘‘<middle size ðQaÞ shape> AND <with pretty symmetrical ðQbÞ aroundhorizontal axis ðAcÞ> AND <compact ðAhÞ>. . .. AND <small Qk (concave)with AiðkÞ (for example, a position)> OR Qm AND QjðmÞ AND QkðmÞ (withothers attributes)’’, and so on. Last disjunction ‘‘OR’’ describes at least two
shapes and a few of them some class of shapes. Using ranges of attribute values
we will get different classes of continuous shape variations. These descriptions
are invariant for global rotations, shifts and projective transformations ofwhole shapes as well as their parts (up to some limit). This result is in tune with
the well-proven robustness of fuzzy reasoning systems [15].
The relationships: visual features––data structures: For any representation, a
certain shape sample is a point in the space EðDÞ. A shape pattern including itsvariations covers some region around this point. In general, these regions for
different shapes are not similar, convex or symmetrical because even for simple
contour displays as in Fig. 1(a) and (b) the stability of a complicated feature�svisual perception depends on the relationships among its components andbetween them and the surrounding features. For example, if the values of
neighbor parameters Xk, Xkþ1, Xkþ2 and Xkþ3 in Fig. 1(b) form a ‘‘concave part’’,
this will be visible so for Xk > Xkþ1 < Xkþ2 < Xkþ3, for Xk > Xkþ1 > Xkþ2 < Xkþ3and others of their combinations depending on the current values of each
parameter, on the differential and absolute thresholds of vision, etc. This
subjective factor is quite well encapsulated linguistically as presented in the
previous subsection. For the corresponding color matrix representations of
V.G. Grishin et al. / Information Sciences 152 (2003) 1–24 11
structures in data space EðDÞ even simple patterns are more complicated anddepend on mapping of Xi or Xi into picture.
The exceptional flexibility and power of visual features hierarchy is the best
argument for using our approach. It allows detecting and describing verycomplicated high dimensional non-linear system models and structures Sk inEðDÞ depending on the appropriate mapping of initial data onto pictures.
5.2. Visual representation classes
There are several possibilities of mapping fXig into pictures fPig. Eachrepresentation has its own advantages and disadvantages and will fit someproblem classes well, but an estimation of their capabilities is a difficult task.
For example, at first glance, it seems promising to map each data parameter
value into some visible feature attribute to distinguish essential Sk by con-ventional pictures. For example [8], statisticians tried to find such mapping of
patients parameters into features of face parts which map ‘‘smile’’, ‘‘quiet’’
faces for healthy humans and ‘‘sad’’, ‘‘angry’’, ‘‘bad’’ ones for different dis-
eases. During a decade, many attempts were made to develop and apply this
technique, but these attempts were not successful. Why? An analysis shows thatstructures corresponding to ‘‘smile’’, ‘‘sad’’ or other expressions of faces in an
Euclidean space of their features are not simple and often consist of a few
unconnected regions with complicated unknown boundaries. Consequently,
this approach requires finding effective ways of mapping unknown complicated
Sk structures of data into other complicated non-linear also unknown structures
of facial expressions. Researchers using this approach found some essentially
non-linear representations of a few classes identifying different facial expres-
sions through certain learning samples but did not check if these representa-tions were validated by the examination samples.
Our approach is based on the analysis of multidimensional relations through
transforming the initial data (heterogeneous arrays, signals and fields) into
artificial pictures fPg. It exploits human capabilities of pattern recognition indoing data analysis and selecting appropriate representations, features and
class descriptions by means of interactive human–computer learning––the very
essence of ‘‘pictorial analysis’’ [9].
5.3. Human–computer interaction in pictorial analysis
The concept: The essence of our approach is to conduct visual analysis prior
to a computer analysis. No data is rejected without either a priori knowledge
about its uselessness for following model choice or the same assurance from in
depth visual analysis. Computer methods based on reduced data representation
(such as projections or principal component analysis, as well as pattern rec-ognition learning or model identification) have to be directed by holistic images
12 V.G. Grishin et al. / Information Sciences 152 (2003) 1–24
of the system�s states figured out via a full visual analysis of all data. Fortu-nately, modern monitors display simultaneously more than 4096� 4096 in-dependent color dots (pixels) and with color printing these values could be tens
times more on the same display size. Consequently, complete information oftens and hundreds of vectors X with dimension D up to many thousands can besimultaneously represented on the screen by separate pictures via the ‘‘matrix
representation’’.
Criteria for choosing a representation for visual analysis: Color matrix rep-
resentation is the most effective choice to start feature and model selection for
monitoring of highly dimensional, non-stationary systems (such as power
plants). In Fig. 5, visual representations of data are hierarchically classified
according to the power of their usage of different sub channels (sub modalities)of vision as well as the dimension and connectivity of formed images. It is very
important to correctly choose effective mapping of each system’s parameter intopicture features. It could be not only matrix cell color-intensity or star ray
length as on Fig. 1(b) but as well other local and integral features of a picture,
as listed in Fig. 4. A complexity of connection between structures in the state
Fig. 5. Static visual representations of data.
V.G. Grishin et al. / Information Sciences 152 (2003) 1–24 13
space EðDÞ and visually selected features Q depends on this mapping. This
connection can be fairly complicated even for the simple stars and matrix
representations of static vectors. Therefore, to effectively apply visual analysis
results to pattern recognition, identification and other model selection, we haveto do an analysis for extracting representations providing a well interpretable
connection of EðDÞ with a picture space P ðQÞ.From visual perception psychology it is known that on the average it is best
to equally share data between sub modalities, features and attributes of a
picture and to represent with each of them no more than 5–7 different values of
some data parameter. This restricts the representation choice.
5.4. Human and computer roles in pictorial analysis
The role of the human: We refer to Fig. 6. In general, at each step of the
dialog the human-researcher performs the following tasks:
• specifies a current representation and its parameters (for example, color
scale, an order of Xi coordinates placement in matrix cells or along contour,
etc. . .), as a set of vectors for on screen representation and may be some oftheir previous state transformation of X (for example, statistical or spectral,for signals);
• visually compares displayed pictures P ðiÞ and selects features separatingsubgroups of Pi;
• builds descriptions Qfqg allowing to distinguish classes fPig of the learningsample, corresponding to different system states or equipment malfunctions;
• determines informative Pi clusters, if this is a self-learning, and a search ofnew important system states, dangerous situations and so on;
• selects a representation change to enhance a picture�s visibility and a class�sseparation;
• checks with the computer the qualitative model selected.
The role of the computer: A computer has to process the human requests by:
• data transformations and analyses (if needed);
• data representations and their possible computational enhancement;
• creation and development of selected representations, fqg and fQg databases;
• testing these features and descriptions for the separation of a class fPig inthe feature space EðQÞ;
• automatic testing of selected models, discriminating decision rules and their
quantitative adjustment by means of conventional pattern recognition or
identification algorithms.
Through an iterative approach of these steps, the most effective pictorial
representation, i.e. the one that leads to the most effective set of discriminants,
is selected. Indeed, any reliable a priori knowledge should be used to facilitatethe choice of the above transformations and the corresponding features.
14 V.G. Grishin et al. / Information Sciences 152 (2003) 1–24
However, to get effective algorithms for real-life control problems, the ob-
tained fqg and fQkg have to be from pictures well interpreted in terms of thefXigk data structures in EðDÞ (recall Section 3). This is not always possible.Because of the subjective nature and fuzziness of human perception and verbal
descriptions of the patterns recognized by classification in picture classes, theirmathematical expression or even algorithmic approximation can be very dif-
ficult. A computer model will be derived from a pictorial analysis in any case
and if it does not provide a complete problem solution, a control system has to
use it together with human-operator monitoring to cover those detected rep-
resentations, situation features and descriptions that could not be modeled. Of
course in this case the control system has to have a database of standard
patterns of important situations and human operators have to be trained for
visual comparison of these patterns with an image of the current situation.
Fig. 6. Pictorial analysis.
V.G. Grishin et al. / Information Sciences 152 (2003) 1–24 15
5.5. Main issues in pictorial analysis theory
In Fig. 4 possible structures of Q were described. Their search complexityCðQÞ depends on the number of inherent features, their attributes, relationsand logical operations. A subset of the shortest descriptions fQ : CðQÞ ¼ Cming exists for each sample fXg and human can construct descriptions of thecomplexity no larger then some l-max in reasonable time. Therefore, at leastfor comparison with considered conventional methods, the first question of our
theory is: ‘‘What structures of data can be detected and described by visual
perception on different representations in a reasonable amount of time?’’ The
answer requires:
• human vision modeling, i.e. mathematical definition of the human-used al-phabet fqg, their attributes and relations frðqðiÞ; qðjÞ; . . . ; qðzÞÞg;
• search of the sets of probable descriptions of various pattern structures
(properties, characteristics), of an estimation of different Q search time;• analyze which structures in EðDÞ are corresponding to different q, fqg, Q forspecific pictorial representation and dependencies of CðQÞ-min on sample Xproperties.
Unfortunately the decades of work by many scientists have led to the
construction of formal models of only some specific features of perception forvery simplified stimulus. After years of experimental research and modeling of
human vision capabilities of different pictorial representations analysis we have
just a partial solution to estimate learning prospects and results for different
data structures [1]. Fortunately, this only limits the algorithmic implementa-
tions of the above analysis for automatic control but not for a human-operator
process monitoring. If stable and distinguishable patterns of important situa-
tions were found, they could be used for visual system monitoring without
knowledge of corresponding fXig structures in EfDg. Moreover, this knowl-edge is not needed to figure out what real events in a system generate different
features and patterns of its states. These relationships usually have a simpler
interpretation than with EðDÞ. So we also get a system model, which gives thevisual and verbal description of system�s states by its parameters.A few decades of our experience has shown the pictorial analysis always helps
to better understand a controlled process and always gives information about
data structures for enhancement of regression, pattern recognition and identi-
fication models, via this approach and using an appropriate dialogue interface.
5.6. Methodology of pictorial analysis
The essence of our methodology of pictorial analysis is based on sequential
looking through some contour and matrix representations specified by the
problem, data, above considered criteria and principles and intermediateanalysis results.
16 V.G. Grishin et al. / Information Sciences 152 (2003) 1–24
Recalling Section 4.2, the simplest data, containing a few tens of vectors Xi
(L < 100) with up to 30–40 parameters (D < 40) can be visualized and com-pletely analyzed with only simple contour representations as illustrated in Fig.
1(a) and (b). However, even for this simple case, a linear display (Fig. 1(a)) isworse than a polar one (‘‘star’’) which has two dimensionally organized con-
nected shape more closely resembling natural shapes than the linear ones and
therefore allowing to better estimate and compare integral features as sym-
metries, elongations, inside figure relations, etc. (See Fig. 4). This stands even
more for greater D and L. Therefore in the sequel we will consider just onecontour display––the star.
We have to visually compare stars of a few vectors of one state with stars of
another state and to look for features, state patterns and their descriptions todistinguish these states with needed accuracy. Logical combinations Q of
shapes features [1] can probably describe any structures in EðDÞ, but forpractice it is important to know which structures in E for the given represen-tation can be described by sufficiently short combinations of features and at-
tributes known to be easily found in a reasonable time. For stars, we
experimentally found that Q from 10–15 features can be selected by the spe-cialist for a few hours on average and this is usually one of the shortest problem
solutions [1,3]. Due to the flexibility and diversity of visual features and de-scriptions, a set of Q with this length can cover a majority of real problems.Practically, many multidimensional problems for which we have to start the
data analysis from the color matrices, as in Fig. 2, can be reduced to their small
subspace visualization by contour displays [1].
For ‘‘stars’’ to interactively search the state�s patterns by the means of datarepresentation, the following main tools can do it:
• Mapping of xi in to ray length should map the most important xi value rangesinto the best visible ray changes. It could be specific non-linear mapping foreach xi.
• Coordinates permutations of Xi on the star permits to make shorter Qfor structures with the long initial descriptions, and this is a main goal of
human–computer dialogue, i.e. this is the synthesis of the better ‘‘visual’’
representation.
• Origin shifts: For example, the stars of EðDÞ vectors laying in tubes aroundstraight lines crossing coordinate axes near their origin will have similar
shapes because of close coordinates xi proportions. Therefore, by joiningthe origin with the ends of different Xi, we can detect all sample tubes pass-
ing near Xi and all other tube structures in EðDÞ by moving the origin to an-other Xi. Other easily visible features, which can be detected by means of
their appropriate mapping, are shape transformations such as shifts, rota-
tions, etc.
Matrix representations: Even built from data samples consisting of 20–30
vectors Xi, a matrix representation, as on Fig. 2, can be a fruitful supplement to
V.G. Grishin et al. / Information Sciences 152 (2003) 1–24 17
stars in order to map all vectors in to one matrix. The matrix gives a pattern of
all learning sample giving some general connections between its vectors, which
could be useful for star parameters choice. Sometimes, this technique will yield
the problem solution. Three interactive tools were applied for a solution visi-bility enhancement: a selection of effective mapping xi values into gray inten-sities of cells, binarization of these intensities for further application of special
algorithms and then permutations of rows and columns for picture quality
criteria maximization by these algorithms. We proved that for most samples
fXg one of these algorithms, based on the analysis of X graph structure, en-sures the improvement of picture connectivity up to L D times by permu-tations of columns and rows of the picture [10].
When D > 50 and/or L > 100 the stars shape can be too complicated to startanalysis from them. In this case, the matrix representations obviously have
Fig. 7.
18 V.G. Grishin et al. / Information Sciences 152 (2003) 1–24
priority because they allow simultaneous illustration of a large amount of in-
formation on one screen. If we were to represent all realizations fXigk of each kstate of a system by a separate matrix, these state�s portraits would show theirinternal variations and stable patterns, from which we have to interactivelyselect patterns and features for state�s separation.Except for the most important tool of dialogue, (e.g. considered above matrix
elements permutations) it is important to consider the mapping of xn values intogray scale or colors [11], picture improvement by smoothing, edge sharpening,
and many other methods (see Fig. 7). On these pictures, besides the contour
features, many other ones appear (textural, statistical, space spectral, color,
etc.). The dependencies between data structures and picture properties become
considerably more complicated in this case. Simultaneously, the possible deci-sion variety increases many times. Each choice of state�s patterns and theircomparative variations estimations, i.e. for static systems some general vector
Xk for each state, can be more precisely compared by means of stars again.
For dynamic systems, when an image of a state sample is already a matrix of
parameter dynamics, many of these matrices represent each state variation. To
detect state patterns, other informative features and descriptions of the same
methods of a representation adjustment are applied, except column permuta-
tion to avoid time dependencies destruction [2,12,13].For time series, and especially vibrations and acoustic signals for equipment
diagnostics, the spectral analysis is an additional powerful tool for the detection
of useful information in noisy data/environments. The human–computer-
interaction goal is to find the appropriate basic functions, intervals, weights,
etc.
6. Critical areas of application where pictorial analysis is most suitable
6.1. Visual system monitoring and diagnostics
After obtaining all possible information for the control model choice, the
state patterns, features and visual decision rules can be applied for human
operator decision-making. These results of a pictorial analysis usually containthe information, which cannot be exhausted by algorithms and formal models
due to its fuzziness and absence of appropriate models of human perception.
Only the subjective character of this fuzzy information can encapsulate the
complexity of a visual representation required by monitoring and control in
complicated situations. From the answers to a survey done in the Soviet Union
(in 1986–87, after the Chernobyl�s disaster), leading developers and chief en-gineers of nuclear power plants considered as either impossible or inexpedient a
complete automation of the following functions, for which pictorial analysisproves very promising:
V.G. Grishin et al. / Information Sciences 152 (2003) 1–24 19
• transient modes, preparation and initiation of an automatic mode, start–
stops;
• choice of reserves (mechanisms or channels) and the majority of tasks deal-
ing with system structure changes;• account of non-formal characteristics of a situation (technical, natural and
others) and corresponding corrections of control;
• control of effectiveness in normal plant functioning, in particular at the cost
of its ‘‘individuality’’;
• early diagnostics, forecasting and troubleshooting of different types of faults
in process equipment (e.g. small leaks, vibrations induced by mechanical
failures in rotating equipment, actuator or sensor failures) and control sys-
tem malfunctions; preventing• equipment failures and emergency situations through proper recognition
of abnormal operating conditions and the timely initiation of appropriate
decision and control strategies, etc.
Each of these tasks requires decisions involving a human operator for the
information and control loops of the process. In order to present the state
patterns obtained by pictorial analysis and simultaneously represent a current
situation image generated by the same representation, trained operators can
match real-time patterns with the templates for fault detection, diagnosis orcontrol tasks [3].
6.2. Conventional human operator decision making
Today, the design of information displays (a combination of tables, text,
hard panels, etc.) requires an operator to solve many complex formal-logic typeproblems in order to reach the desired conclusion. For example, consider the
problem of an operator attempting to estimate the operating state of an in-
dustrial process from real-time plant data in order to make a control decision.
Information and decisions might include: Valve A is open, pressure B is in somedefinite range, temperature C is decreasing, etc.; this means that variable Tneeds to be decreased to maintain acceptable operating performance. Fuzzy
control [18] has proven extremely useful in dealing with this kind of problems,
however there are cases when the human operator is needed to make the finaldecision (especially in high risk situations). This is often cumbersome and
difficult for human operators, especially during abnormal operating conditions
of a complex non-stationary process. By transforming plant data via pictorial
analysis and representation, decision making by the operator is enabled in the
image domain.
It is well known from engineering psychology investigations that operators
cannot simultaneously handle more than 7–12 variables for logical decision-
making tasks. For more complicated problem solving, either a hierarchy (tree)of conventional windows and/or an expert system is required. The first en-
20 V.G. Grishin et al. / Information Sciences 152 (2003) 1–24
hancement to the operator information/decision making system requires that
the operator looks through many levels/resolutions of windows to obtain rel-
evant information. The operator is required to search through various bran-
ches of a tree-structured database. This increases the decision making time andvery detailed operator training is required.
By means of a pictorial representation, an operator can perceive the current
situation as a holistic image and quickly estimate its general state. Typically, an
operator works in a supervisory mode about 40% of the time, i.e. he observes
whether ‘‘all is in order’’ or if it is necessary to interfere with the process. If so,
he/she can compare a current image with standards by means of learned fea-
tures and rules and make a decision. When more complicated situation occurs,
then this approach could be more profitable because the pictorial analysisforms an open semiotics system of situation descriptions [16]. It can generate
patterns and descriptions for states unrepresented in the learning sample.
Pictorial features obtained in the analysis together with interpretation of their
connections with real events in the system create a system of a pictorial de-
tection of new combinations of these features and their generating events. They
can be easily integrated with fuzzy control systems to enable human inter-
vention whenever needed. We are currently working on the development of a
system that integrates pictorial analysis within a fuzzy control software [19].Pictorial methods of data representation, recognition and analysis can be
used as a tool to interact with domain experts and also to incorporate the
history of past operational data into the knowledge base of the expert system.
Fuzzy expert systems primarily use verbalized expert knowledge and heuristic
expert skills and ‘‘know-how.’’ Difficulties often encountered include differ-
ences of opinions from two experts commenting on the same aspect of a be-
havior and the difficulty for a novice in the problem domain, the expert system
builder for example, to ask the correct questions to obtain the necessary in-formation to construct an appropriate knowledge base. Pictorial representa-
tions, state patterns and decision rules give new tools and such an enhancement
to the knowledge base of an expert system may help to increase the eventual
practical uses of such systems in real-time industrial decision-making and
control problems.
7. Pictorial functional diagnostics for gas-turbine engines (GTE)
We have applied pictorial analysis to solve the following problems: electro-
and phono-cardio diagnostics; early prediction of throwing of oxygen steel
converter by acoustic noises: checking of cutting regimes of mining machines;
search of informative features of dolphin signals; clustering of geological
sample; visualization of seismic cuts for geophysicists; pictorial diagnosticsof nuclear power plants (not published due to confidentiality agreement
V.G. Grishin et al. / Information Sciences 152 (2003) 1–24 21
bindings.) In the following we present the most relevant application which
emphasizes the radical advantages of pictorial representations for non-sta-
tionary process monitoring. Binded by non-disclosure agreements we cannot
give as many details as we would like to.Purpose: Early preventive functional diagnostics and forecasting of GTE
technical state during flight operation.
Basic Method:
• Pictorial representation of GTE thermal, gas and dynamic parameters (op-
erating process parameters) in transient mode of flight; current fuel con-
sumption; oil system parameters; positions of passage section controlled
elements and engine controls.
• Visual analysis and comparison of pictorial representations of these processdata by trained human-operator on the basis of special methodology and
supporting expert system.
• Decision-making, checking and verification by the technician using an ex-
pert system and knowledge base.
Advantages: Pictorial visualization allows a technician to effectively use the
human vision system capabilities to analyze the transient modes of the engine in
repeated flights. As a result, it is possible to observe the effect of incipient failures
through observable symptoms of the malfunctions in the transient modes ofengine operation. Therefore, improvements in the reliability of diagnostic
conclusions were achieved as compared to conventional analysis techniques.
Performance evaluation:
• Average time of obtaining a diagnostic conclusion: 3–5 min.
• Diagnostic period: 1–3 flights.
• Reliability of correct classification using deviations from the norm which are
expressed by recorded parameters (probability of correct detection of symp-
toms) better than 0.99.• Systematic reliability of early detection of malfunction 0.8–0.95 (probability
of detecting a malfunction no less than a flight before the critical event).
• Forecasting interval: 1–5 flights.
• Using the technology of a pictorial dialog allowed for the early diagnosis of
engine malfunctions with almost 100% reliability resulting in timely mainte-
nance of the engine, which significantly increased to engine�s availability.Practical results: The diagnostic system was implemented on one type of by-
pass GTE that had fuel control equipment with an electronic corrector, anadjustable compressor and an adjustable nozzle. Real-time measurements were
recorded for ten parameters of the process. After the development and im-
plementation of the pictorial functional diagnostic system for interflight pro-
cessing of the recorded data in a ground-based unit, the following results were
obtained:
• in the early stages, before 1–20 engine hours-to-failure, 80% of the operating
malfunctions were detected;
22 V.G. Grishin et al. / Information Sciences 152 (2003) 1–24
• preprocessing of the data and the use of correlations practically eliminated
false detections: the probability of a false detection was decreased by 15
times;
• the average flight time till in-flight failure was increased by 40%;• seven new types of malfunctions were detected and classified during opera-
tion; this establishes the high adaptability of the system to any deviations
from the norm;
70% of the detected malfunctions were correctly forecasted for 3–7 flights.
8. Conclusions
The essence of our approach is to conduct visual analysis prior to a com-
puter analysis. No data is rejected without either a priori knowledge about its
uselessness for following model choice or the same assurance from in depth
visual analysis. Computer methods based on reduced data representation (suchas projections or principal component analysis, as well as pattern recognition
learning or model identification) have to be directed by holistic images of
the system�s states figured out via a full visual analysis of all data. The methodhas proven useful in several critical domains. Current work is focused on the
integration of our method within a fuzzy control software [4,7,19].
Acknowledgements
Our most grateful thoughts go to Dr. A.S. Soula, Chief Developer with
View Trends Ltd., who led the work for the gas turbine engine diagnostics
presented in Section 7.
References
[1] V.G. Grishin, Pictorial Analysis of Experimental Data (in Russian), Nauka Publishing,
Moscow, 1982.
[2] P.C. Wong, R.D. Bergeron, 30 Year of Multidimensional Multivariate Visualization, in:
Scientific Visualization––Overview, Methodologies, Techniques, IEEE Computer Society
Press, 1997.
[3] V.G. Grishin, CAM operator functions and pictorial representation of information, in:
Preprints IFAC/IFIP/ Int. Conference on Analysis, Design & Evaluation of Man–Machine
Systems, vol. 2, 1988, pp. 422–425.
[4] D.J. Guilmore, Interface design: Have we got it wrong?, in: Human–Computer Interaction:
Interact �95, Chapman & Hall, London, 1995.[5] V.N. Vapnik, The Nature of Statistical Learning Theory, Springer, London, 1995.
[6] J. Foley, A. van Dam, S. Feiner, J. Hughes (Eds.), Interactive Computer Graphics: Principles
and Practice, Addison-Wesley, Reading, MA, 1995.
[7] B.M. Velichkovsky, D.M. Rumbaugh (Eds.), Communicating Meaning: The Evolution and
Development of Language, Lawrence Erlbaum Assoc, Mahwah, NJ, 1996.
V.G. Grishin et al. / Information Sciences 152 (2003) 1–24 23
[8] H. Chernoff, The use of faces to represent points in k-dimensional space graphically, J. Am.Stat. Assoc. 68 (1973) 361–368.
[9] V.G. Grishin, Multivariate Data Visualization for Qualitative Model Choice In Learning
Systems, in: Proceedings of IEEE ISIC/CIRA/ISAS International Symposium on Intelligent
Control/Systems, 1998, pp. 622–627.
[10] L.L. Verin, V.G. Grishin, Algorithm for interactive forming matrix data representation and
estimation, Pattern Recogn. Lett. 4 (1986) 193–200.
[11] V.G. Grishin et al., The Color Coding Models For Data Matrix and Scalar Fields (in Russian),
Institute of Control Sciences Publishing, Moscow, 1991.
[12] V.G. Grishin, Dynamic spectroscopy in problems of visual analysis and identification of
compound acoustical signals, Automat. Rem. Control (2) (1967) 71–77.
[13] SAP AG Collaborates With Human-Intelligence Scientists on the Management Cockpit for R/
3, Press Release of SAP AG, Walldorf, Germany, 30 March 1998.
[14] D. Roverso, Soft computing tools for transient classification, Inform. Sci. 127 (2000).
[15] M. Ulieru, in: J. Baldwin (Ed.), Fuzzy Logic, Fuzzy Logic in Diagnosis: Possibilistic Networks,
invited Chapter, John Wiley and Sons, 1996, ISBN 0471 962813.
[16] A. Meystel, Semiotic Modeling, 1995.
[17] T. S€ooderstr€oom, P. Stoica, System Identification, Prentice Hall, London, 1989.
[18] H. Hellendoorn et al., Fuzzy Control, Springer Verlag, 1994.
[19] TransferTech, 2000 �––Fuzzy Control Manager, Version 5.1, User�s Manual, TransferTechGmbH, Germany www.transfertech.de.
[20] M. Ben-Horim, H. Levy, Statistics, Random House, New York, 1984, p. 870.
24 V.G. Grishin et al. / Information Sciences 152 (2003) 1–24