Post on 21-Feb-2023
ENABLING KNOWLEDGE CREATION THROUGH ASSOCIATIVENETWORKS AND SEMANTIC WEB TECHNOLOGIES
L.Lella1,A.F.Dragoni1, G.Giampieri2
1D.E.I.T., Università Politecnica delle Marche, Ancona, Italy2A.R.C.H.I. – Advanced Research Center for Health Informatics, Ancona, Italy
ABSTRACTNowadays the organizations operate in an evolving and dynamic environment and to keep anadvantage over the other companies they need new management techniques as well as improvedapplications, where inter-operability can support innovative thinking.The goal of these tools is twofold. To improve the information retrieval processes and the finding ofcompetencies within the organization in order to establish the conditions for the creation ofknowledge and the generation of communities of practice.This objective cannot be resolved without flexible knowledge representation structured asassociative networks, which are capable to define user models, i.e. representations of theirknowledge, their competencies and their objectives.But this subsymbolic representation must be formalized to optimize the search of information. Wethink that, for knowledge management purposes, the better way is to reify the employees’conceptualizations of the read documents. The obtained representations of the knowledge acquiredevery day by the workers can be further improved by the use of existing ontologies.We provide some examples of researches performed with the aid of these representations.
1.INTRODUCTION
The growing interest towards the knowledge management field is justified by the new issues and
challenges that the actual organizations have to deal with (Smith and Fatquhar, 2000).
First of all companies are extending worldwide, leading to the establishment of a collaborative
space that is now more virtual than physical. This inevitably creates new problems in the
collaboration between peers that have to be solved in order to keep efficient the entire organization.
Secondly the intellectual capital of a firm has become an important differentiation factor with
respect to the other organizations. While in the 70s a key differentiator was the optimization of the
working environment, in the 80s the quality, in 90s the customer satisfaction, nowadays an
organization must be able to learn quickly and continuously from the environment where it operates
and must be capable of reusing efficiently the acquired knowledge to sustain competition.
Luckily information technologies through Internet, intranet and Web technologies, have all the
necessary potentialities to capture, share and leverage the knowledge of a company.
But they have to be used in the right way (Stenmark, 2003). This substantially means that the
implemented solutions must consider all the principal characteristics of knowledge.
The nature of knowledge has been well debated and investigated in literature (Nonaka, 1994;
Nonaka and Takeuchi, 1995; Alavi and Leidner, 2001).
Knowledge evolves continually. It is used to reflect upon data and information and when these have
been interpreted a new state of knowledge arise.
Moreover researchers in the field of KM seem to confirm the subjective and specific nature of
knowledge that adapts itself to every situation to which it is applicable. So, despite of the attempts
to manage knowledge centrally in organizations, knowledge is decentralized and it is best managed
in the area in which it is created and exchanged.
So, from these first considerations we can conclude that knowledge is dynamic, contextualized and
goal oriented and therefore the modern tools and systems for the creation of knowledge have to take
into account these principal features.
It must also be considered that organizations are particularly interested in the acquisition of the so
called “tacit knowledge” (Polanyi, 1967), a particular kind of knowledge deeply rooted in individual
actions and experiences that is very difficult to formalize.
According to Polanyi tacit knowledge is linked to the act of focusing the attention on particular
perceived aspects of reality. He distinguishes (Polanyi, 1975) between “proximal” and “distal”
aspects (by the analogy with the adjectives “proximal” and “distal” taken from anatomy) of
attention by stating that the proximal aspects are the ones we associate with ourselves and with our
perceptions, while the distal ones comprehend the world beyond ourselves.
When solving a comprehension or a recognition problem, the human cognitive processes of
association relate proximal situational knowledge through a process of cognition, selection and
mapping, which can trigger a tacit leap to distal situational knowledge.
The identikits used by police work exactly in this manner. A collection of different eyes, noses,
faces and ear-shapes that represent the proximal situational knowledge can serve as cues to compile
an identikit of the face of the searched person which is the distal situational knowledge.
This example made by Polanyi suggests that tacit knowledge should be structured as an associative
network, i.e. a graph that links nodes, representing proximal situational knowledge aspects through
weighted and not labelled links. Within this framework the distal knowledge should be related with
the most activated part of the network.
For example, within a knowledge management system, the distal knowledge represented by the
searched organizational documentation or the employees having the requested skills for a given task
should emerge from the proximal knowledge represented by the description of the situational
context (goals, requested competencies etc.).
The representation of the tacit knowledge through a formal language could facilitate and otpimize
the processes of information retrieval and the search of competencies. But this formalization should
follow the dynamic process of knowledge acquisition in order to achieve better results.
2.KNOWLEDGE NETS
The first problem to be solved is to find the most suitable representation schema for knowledge.
Classic knowledge representation forms like semantic networks, frames and scripts are static
(Kintsch, 1998b). Instead, as we have just explained, human mind generates contextualized
structures that are adapted to the particular context of use.
Given the fact that we are interested in systems capable to acquire knowledge from textual data, we
have choosen a psychologically valid model of discourse’s comprehension in order to update a
dynamic representation of the extracted knowledge.
“Networks of propositions” or “knowledge nets” (Kintsch, 1998a) are a formalism that combines
and extends the advantages of classic representation forms used in AI.
As can be seen in figure 2.1, networks of propositions link atomic propositions consisting in
predicate-argument schemas through weighted and not labeled arcs. According to this formalism the
meaning of a node is given by its position in the net.
From a psychologic point of view only the nodes that are active (i.e. that are maintained in the so
called working memory) contribute to specify the sense of a node. Hence the meaning of a concept
is not permanent and fixed but is built every time in the working memory by the activation of a
certain subset of propositions in the neighbour of the node that represents the concept. The context
of use (objectives, accumulated experiences, emotional and situational state etc.) determines which
nodes have to be activated.
Figure 2.1 – An example of network of proposition knowledge representation.
For the definition of retrieval modalities Ericsson and Kintsch have introduced the concept of long
term working memory (LTWM) (Kintsch, Patel and Ericsson, 1999).
According to this model the working memory, that can be seen as a repository where all the
knowledge for the correct reconstruction of the meaning of the analysed information is stored, can
be subdivided in a short term part (STWM) that has a limited capacity and a LTWM that is a part of
the long term memory represented by the network of propositions. The content of STWM
automatically generates the LTWM, because some objects present in the STWM are linked to other
objects in the LTM by fixed and stable memory structures called “retrieval cues”.
The approach of the network of propositions yields two project problems. The creation of the
LTWM and the activation of LTM nodes, i.e. the creation of the retrieval cues.
Kintsch has developed two methods for the definition of the LTWM.
The first, defined with Van Dijk (Van Dijk and Kintsch, 1983), is a manual technique that cannot be
automatized.
The second is based on the latent semantic analysis (LSA) (Landauer, Foltz and Laham, 1998). This
technique can infer, from the matrix of co-occurrence rates of the words, a semantic space that
reflects the semantic relations between words and phrases.
But this latter solution lacks of precise criteria for the correct definition of the LTWM by the
retrieval of concepts from the semantic space.
After the creation of the LTWM the integration process begins i.e. the activation of the nodes
correspondent to the meaning of the phrase. Kintsch developed a diffusion of an activation signal
procedure that is a simplified version of the one developed by McClelland and Rumelhart (1986).
Firstly an activation vector is defined whose elements are indexed over the nodes of LTWM. Any
element’s value is “1” or “0” depending on the presence or the absence of the corresponding node in
the analyzed phrase (i.e. in the STWM). This vector is multiplied by the matrix of the correlation
rates (the weights of the links of the LTWM) and the resulting vector is normalized. This becomes
the new activation vector that must be multiplied again by the matrix of the correlation rates. This
procedure goes on until the activation vector becomes stable. After the integration process, the
irrelevant nodes are deactivated and only those that represent the situation model remain activated.
3.AN ALTERNATIVE IMPLEMENTATION OF THE LTWM MODEL
The adoption of a network of propositions for the knowledge representation presents certainly great
advantages in comparison with the classic formalisms. While semantic networks, frames and scripts
organize knowledge in a more ordered and logical way, the networks of propositions are definitely
more disorganized and chaotic, but present the not negligible advantage that are capable to vary
dynamically not only in time, on the basis of the past experiences, but also on the basis of the
perceived context.
But the limits presented by the technique worked out by Kintsch and Ericsson have lead us to define
a different implementation of the LTWM model.
Unfortunately the lack of adequate textual parsers able to convert the paragraphs of a text in the
correspondent atomic propositions has driven us to develop, at least in this initial phase of the
project, simple dynamic models of associative networks of words.
Figure 3.1 - A possibile architecture of a system for the dynamical acquisition of knowledge.
The part of the document that is analysed, which is stored in a buffer, must be codified on the basis
of the context before being elaborated by the working memory block. The context represents the
theme, the subject of the processed text and for its correct characterization is necessary to take into
account the representation of the textual information (textbase) together with a part of the
knowledge acquired from previous documents (Long Term Memory), that can be retrieved by an
activation signal diffusion procedure similar to that one developed by Kintsch.
For the implementation of the working memory block, self organizing networks with suitable
procedures for the labeling of their nodes could be used, but this solution requires a lot of
computational time, especially for the analysis of entire repositories of documents. So we
considered alternative models based on the theory of scale free graphs for the implementation of an
associative network.
Recently it has been found that human knowledge seems to be structured as a scale free graph
(Steyvers and Tenenbaum, 2005). Representing words and concepts with nodes, some of these (the
hubs) establish much more links compared with the other ones.
This particular conformation seems to optimize the communication between nodes. Thanks to the
presence of the hubs, every pair of nodes can be connected by a low number of links in comparison
with a random network with the same dimensions.
The definition and the eventual updating of a scale free network does not require a lot of time and
the execution of particular processes, as the diffusion of the activation signal, is very fast.
The textual analysis is performed through the following steps.
The new text is analysed paragraph by paragraph. The buffer contains not only the words of the
analysed paragraph, but also words retrieved from the long term memory using the diffusion of the
activation procedure where the activation signal starts from the nodes in the LTM that represents the
words in the paragraph.
The buffer, the working memory and the activated part of the LTM block can be compared to the
LTWM defined by Kintsch and Ericsson.
During the acquisition of the content of the paragraph a stoplist of words that must not be
considered (as articles, pronouns etc.) is used.
For any word in the text, the paragraphs where it has appeared, or where it has been inserted after
the retrieval procedure, are stored. When the entire text has been parsed and the data of all the N not
filtered words have been memorized, the formation of the scale free graph of words begins within
the WM block.
The LTM is an associative network that is updated with the content of the WM. Whenever a link of
the WM corresponds to a link present in the LTM, the weight of this one is increased by “1” while
other links that are not present in the LTM are created from scratch.
Since the scale free network that represents the content of the WM is used to update the content of
LTM, this associative networks keeps the form of a scale free graph (Dragoni, Tascini, Lella and
Giordano, 2004).
The semantic validity of the LTM representation has been proved by comparing it with other
associative networks obtained from human subjects who read the same textual information (Licata,
Lella and Giordano, 2004).
4.THE KNOWLEDGE MANAGEMENT SYSTEM
The knowledge repository created through the model of discourse’s comprehension can indeed be
used to create a cognitive model of an employee.
The knowledge acquisition module can mine the textual content of the document daily visioned by
the workers of an organization as e-mails, newsgroups, reports and other documents in digital
format. The resulting representation can be considered as a good approximation of the knowledge
acquired by the employer’s, since the analysis takes into account not only the context, i.e. the
particular semantic domain of the texts and the objectives of the reader, but also his past cumulated
experience. Obviously the accuracy of the representation could be initially quite inaccurate, but it
improves in time after the analysis of new documents.
Even if each node within the LTM associative network is related to a string, in fact it represents a
concept because the pattern of links established by the word constrains its meaning.
For example the string “house” can be associated to the meanings of building, organization or
dynasty. But if the corresponding node within the activated part of the network is linked to the ones
related to the strings “product”, “international” and “turnover”, surely the words belong to the
economy domain.
The meaning of the activated part of the network could be automatically disambiguated by a
technique similar to the one developed by Veronis and Ide (1998), that is based on the electronic
dictionary WordNet (Miller, 1993).
Figure 4.1 – The disambiguation procedure. LTM words are linked to WordNet synsets.
In WordNet all the possible meanings of a word are represented through synsets, i.e. sets of
synonims. Even in this case the sense of a word is constrained by the presence of the synonims, but
this association is not weighted as happens in the LTM associative network which is more suitable
to represent the cognitive model of an employee.
This is due to the pattern of links that can be considered as a set of fuzzy relations. For example
there can be two representations related to different employees where the word “software” is linked
to the words “open” and “freeware”. But in the first representation “software” could be more
strongly connected with “open”, while in the second case the same couple of words could establish
a weaker connection with respect to the other one.
So for the first employee’s knowledge representation the static meaning of the word “software” is
more linked to the open source world. Probably the diffusion of an activation signal starting from
“software” could also lead many times to the activation of the word “open” with a lower activation
level for the word “freeware”.
But the meaning of a word within the associative network keeps a fuzzy form, while certain
processes that require a more clear disambiguation, as the information retrieval or the search of
competencies, need the word to be associated with a precise and shared meaning.
In these cases we can use the following procedure.
First the existing links with the associative network must be normalized i.e. the weight of each link
starting from a node must be divided by the greatest one. After that each node is linked to all the
possible related WordNet synsets and if a word within a given synset is present in the associative
network a link with weight 1 must be added between the node representing the synset and the node
representing the word as shown in figure 4.1.
Furthermore all the possible synsets related to a given word must be connected by inhibitory
connections with a negative (-1) weight , because they represent disjoined meanings of the same
word.
Finally an activation signal procedure similar to the one defined by Kintsch could be used to select
the most activated synset.
Once the words have been disambiguated, the knowledge of an existing ontology built over
WordNet, as the alignments of SUMO (Niles and Pease, 2003) and DOLCE (Gangemi, Guarino,
Masolo and Oltramari, 2003), can be recalled in order to provide a symbolic description of their
meanings.
A better representation particularly suited to represent the knowledge of an entire organization
could be obtained reifying the conceptualization of the document.
As depicted in fig. 4.2 the personal interpretation of the resource A made by the employee E1 could
be represented as a blank node connected by a relation “#regards” with the resources representing
the synsets of WordNet corresponding to that conceptualization.
The resource representing the analysed document could be connected with the blank node by a
relation “#viewedAs”.
The resource representing the employee E1 could be associated with the blank node by a relation
“#knows” and with the synset resources by the relation “#interestedIn”.
This schema is justified by the consideration that the conceptualization of a given information is a
process influenced by the knowledge of the reader and his objectives.
This representation is updated every time new documents are visioned by the emoployees, aiding to
track the interests and the competencies of the employees.
After this knowledge acquisition phase a user can select a set of arguments of interest coded as
WordNet synsets and the system can return a set of documents that according to the view of a
particular employee regards that particular argument.
Figure 4.2 – The reification of a conceptualization made by the employee A.
In this way the users can find useful information and they can also know the personal interpretations
made by the others.
This is a manner to keep track of the activities of the other workers favouring the creation of
communities of practice (Wenger and Snyder 2000).
By the selection of the corresponding synsets it could be also possible to know all the employees
whose activities are involved within a given research area or knowledge domain.
The presence of an external ontology could make the information search more effective and
accurate. For example the selected synset WNS_4 in figure 4.1 could be identified as a subclass of a
synset WNS_3 that is mapped as a part of the conceptualization related to the employee E1.
Other selected synsets could be linked by meronimy relations with the mapped synsets.
We can consider the following case study regarding the medical domain that takes into account
DOLCE, a particular external ontology built over WordNet.
A user is interested in the concept of “$fungal_infection”. The ontology could retrieve the synsets
“$coccidioidomycosis” , a mycotic infection of lungs and skin that is a specialization of the
searched concept, and “$zymosis” that is the process of the development and spread of an infectious
disease caused by a fungus, that is a generalization of the searched concept.
These two retrieved synsets could be mapped within the conceptualization of two different medical
articles, the first one known by a specialist of mycotic infections and the second one known by
physician who made a search for a patient that presents zymotic symptoms.
The user could retrieve all the articles read by the specialist and the physician, knowing their
personal conceptualizations and so their interests and their actual pursuits and activities. After that
he could decide to contact one of the two mapped users.
This operation could also be performed automatically by the system, that could present a list of
documents and users interested in the same topic of the last analysed document, enhancing the
awareness of the users on the competencies of the others.
Suitable user interfaces for the treatment of the query are needed. A symple one could provide the
user with a list of all the possible synsets related to the exposed query to be selected.
But this solution is indeed slightly user-friendly.
Another solution could perform a disambiguation of the words constituent the query by the same
activation signal diffusion process presented before.
CONCLUSIONS
The knowledge acquisition module can be used to create and update the models of a group of
employees. The models are dynamic knowledge representations structured as associative networks
of words.
These particular schemas can be analysed in a static or dynamic manner.
The static or potential meaning of a word is given by the position that it occupies within the
network. It represents all the acquired knowledge regarding that word.
The actual meaning of a word can be determined by the diffusion of an activation signal starting
from the nodes which represent the content of the last information analysed by the user.
This process can be used to detect the actual interests of an employee, that can be better described
by the use of an external ontology.
The reification of the conceptualizations enables an intelligent search of information based on the
personal interpretations of the other workers.
This solution can favour the reflection on all the possible implications of the analyzed information
and the awareness of the actual activities and competencies of the other workers. These processes
are generally considered as the basis for the knowledge creation (Nonaka, Toyama and Konno,
2000) and the developing of communities of practice (Dieng, 2000) within an organization.
Future tests are needed to evaluate the effectiveness, the efficiency and the scalability of the
knowledge acquisition system.
ACKNOWLEDGEMENTS
The authors are grateful to Dr. Alessandro Oltramari (Laboratory for Applied Ontology, Institute of
Cognitive Science and Technology, Italian National Research Council, Trento) for helpful
discussions, comments and criticisms.
Special thanks goes to Dr. Ebe Tartufo and Dr. Cristiana Machella for their kind help and support.
REFERENCIES
Alavi M. and Leidner D. (2001), “Knowledge Management and Knowledge ManagementSystems: Conceptual Foundations and Research Issues”, MIS Quarterly, 25 (1).
Dieng R. (2000), “Knowledge Management and the Internet”, IEEE Intelligent Systems.Dragoni A., Tascini G., Lella L. and Giordano W. (2004), "Knowledge Extraction Using
Dynamical Updating of Representation", Proceedings of the 3rd workshop on RObust Methods inAnalysis of Natural Language Data.
Gangemi A., Guarino N., Masolo C. and Oltramari A. (2003), “Sweetening WordNet withDOLCE”, AI Magazine 24(3): Fall 2003, 13-24.
Kintsch W. (1998a). “The Representation of Knowledge in Minds and Machines”. InternationalJournal of Psychology, 33(6), pp.411-420.
Kintsch W. (1998b). “Comprehension. A Paradigm for Cognition”. Cambridge University Press,1998.
Kintsch W., Patel V.L. and Ericsson K.A. (1999). “The role of long-term working memory in textcomprehension”. Psychologia, 42, pp.186-198.
Ide N. and Veronis J. (1998), “Word Sense Disambiguation : The State of the Art”.Computational Linguistics, no.24(1).
Landauer T.K., Foltz P.W., Laham D. (1998). “An Introduction to Latent Semantic Analysis”.Discourse Processes, 25, pp.259-284.
Licata I., Lella L. and Giordano W. (2004), "From Ontologies to Ontogenetic Models: KnowledgeExtraction Using Dynamical Updating of Representation", Proceedings of Dynamic Ontology.
McClelland J.L., Rumelhart D.E. (1986). “Parallel distributed processing”. Cambridge, MA: MITPress.
Miller G. A. (1993). “Five papers on WordNet”. Cognitive Science Laboratory Report 43.Niles, I. and Pease, A. (2003), “Linking Lexicons and Ontologies: Mapping WordNet to the
Suggested Upper Merged Ontology”. In Proceedings of the 2003 International Conference onInformation and Knowledge Engineering (IKE ’03), Las Vegas, Nevada, June 23-26, 2003.
Nonaka I. (1994), “A Dynamic Theory of Organizational Knowledge Creation”, OrganizationScience, Vol.5, No. 1, February 1994.
Nonaka I. and Takeuchi H. (1995), “The Knowledge-Creating Company : How JapaneseCompanies Create the Dynamics of Innovation”, Oxford University Press, New York, 1995.
Nonaka I., Toyama R. and Konno N. (2000), “SECI, ba and leadership: a unified model ofdynamic knowledge creation”, Long Range Planning (32), pp. 5-34.
Polanyi M. (1967), “The Tacit Dimension”, in Knowledge in Organizations, L. Prusak, Ed.Boston, MA: Buttreworth-Heinemann, 1997, pp. 135-146.
Polanyi M. (1975), “Personal Knowledge”, In Meaning. M.Polanyi and H Prosch (eds.),University of Chicago Press, Chicago, 1975, pp. 22-45.
Smith R.G. and A. Fatquhar (2000), “The Road ahead for Knowledge Management”, AIMagazine.
Stenmark D. (2003), “Knowledge Creation and the Web: Factors Indicating Why Some IntranetsSucceded Where Others Fail”, Knowledge and Process Management, vol.10, n.3, pp. 207-216.
Steyvers M. and Tenenbaum J. (2005), “The Large-Scale Structure of Semantic Networks:Statistical Analyses and a Model of Semantic Growth”, Cognitive Science n.29, pp. 41-78.
Van Dijk T.A., Kintsch W. (1983). “Strategies of discourse comprehension”. New York:Academic Press.
Wenger E.C. and Snyder W.M. (2000), “Communities of Practice: The Organisational Frontier”,In Harvard Business Review, Harvest Business School Publishing, Boston, MA, January-February,pp. 139-145.