Thesis Summary

 Michael T. Cox
Computer Science Department
Carnegie Mellon University
Pittsburgh, PA 15213-3891





Abbreviated Statement


Abstract

My dissertation research investigated goal-driven learning by creating a computational model of introspective multistrategy learning implemented in a system called Meta-AQUA. I used this specification to model the learning performed by humans during complex reasoning tasks, and I applied the model to the task of selecting a learning algorithm in machine learning contexts. The focus of my research has been on the strategy selection task in learning. That is, given some computational performance task specified by the system's goals, context and some input, if a failure occurs during the task, the problem is to choose a learning algorithm with which to repair the background knowledge of the system. The background knowledge is considered repaired if, given a similar future situation, the failure does not recur. The system I developed is a hybrid model having two parts. Given a reasoning failure, the first half uses case-based methods to retrieve meta-explanation patterns (Meta-XPs) that support self-reflective blame assignment of the reasoning failure and that assist in the generation of a set of learning goals. Given such learning goals (representing desired changes in the system's background knowledge), the second part of the hybrid model uses a nonlinear planner to construct a partially ordered sequence of calls to various learning algorithms.
 
 

The Central Problem

The overall extent of my research has been to develop a case-based model of introspection and to explore its benefits and limitations in learning, understanding and problem-solving. More than any other aspect of cognition that separates humans from the rest of the animal (and artificial) world is the ability to deliberately formulate goals, develop plans for achieving them, and to reflectively reason about why such goals and plans fail. Self-knowledge in the form of memories of one's own reasoning and knowledge of one's own memory system, as well as confidence in one's own cognitive abilities and knowledge, is a key leverage point from which to reason about these tasks. The specific focus of my dissertation is that of strategy selection in multistrategy learning, particularly, in failure-driven learning. That is, given some task specified by a system's goals (e.g., story understanding or problem solving), context and some input, if a failure occurs during the task, the computational problem is to choose a learning algorithm or strategy with which to repair the background knowledge of the system. The background knowledge is considered repaired if, given a similar future situation, the failure will not recur. My solution to this problem is to maintain a declarative trace of reasoning that leads to or supports a particular choice of goals or plans, to retrieve past cases of meta-reasoning that can explain the reasoning failure, and then to directly inspect and manipulate such explanations. The system's resultant analysis of its failure is then used as a basis for generating specific learning goals. Subsequently, a nonlinear planner chooses a learning strategy to achieve these goals. This method has proved a fruitful way in which to apply computational introspection to both understanding and learning.

 Yet introspection has a number of disadvantages including considerable resource overhead, and with regard to human learning, a severe lack of reliability. So my research does not pretend to claim that introspection is a computational panacea. Rather, I have investigated the role of introspection in learning and the conditions under which it is either a gain or a loss. Adding introspection to a machine, however, does allow it to have an idea of what it is doing and why. Moreover, in some cases (for example when errors are due to forgetting) it is necessary for the agent to consider that reasoning failures may originate with the reasoner's own cognitive abilities, and not simply with the world about it. In such cases introspection is a necessary precondition to learning. Furthermore, to justify the importance of introspection in multistrategy learning systems, I have advanced the following argument: To choose a learning strategy, a system needs to know what is supposed to be learned (that is, it must have a learning goal); to decide what needs to be learned, it must know the cause of failure; to determine the cause of the failure, it must perform blame assignment; and to perform complete blame assignment in many situations, it must reflect upon its own reasoning.

Given such a model, though, a number of questions arise: What exactly can go wrong when reasoning? That is, what are the sources of failure or what kinds of failures should a system consider? Then, given a failure taxonomy, how can a system declaratively represent these failures and how can it make use of such knowledge representation? That is, what constitutes a sufficient formal language for expressing instances of failure and how can the system perform blame assignment upon them? How do these structures then help in generating the goals or help it decide what to learn? What are the types of such learning goals? In what way can a system use self-knowledge to analyze its problem-solving performance? Finally, how can a system choose an algorithm or strategy, given a learning goal, and what are the possible learning-algorithm interactions not found in single-strategy systems?
 
 

Solution Outline

A taxonomy of reasoning failures

 To answer the first of these questions requires only a few assumptions. Given that reasoning is the goal-directed processing of a given input using the reasoner's knowledge, then only a limited number of classes of faults can be responsible for a given failure: the reasoning failure can originate in either the reasoner's goals, its processing strategies, the input, or the domain knowledge. Given an additional assumption that knowledge is memory-based and subject to retrieval and organizational problems, then the organization of suspended goals (indexes), processing strategy associations (heuristics), or the organization (indexes) of the domain knowledge may also be to blame. In my taxonomy of reasoning failures, if one of these categories is responsible for an error, the item corresponding to the category is either absent or incorrect. If an item is correct, then that category contributes nothing to the failure. The taxonomy is comprehensive and goes beyond systems that limit the cause of error by assuming, for instance, noise-free input or other simplifications.

 Blame assignment

 To reason about these types of failures I have extended Schank and Ram's notion of an explanation pattern (XP). A meta-explanation pattern (Meta-XP) is an explanation of how and why an explanation goes awry in a reasoning system. I have developed two classes of Meta-XPs that facilitate a system's ability to reason about itself and to assist in selecting a learning algorithm or strategy. A Trace Meta-XP (TMXP) explains how a system generates an explanation about the world or itself, and an Introspective Meta-XP (IMXP) explains why the reasoning captured in a TMXP fails. The TMXP records the structure of reasoning tasks and the reasons for decisions taken in processing in a chain or series of decide-compute nodes. The IMXP is a general causal structure composed of primitive, network structures that represent various failure types identified in the failure taxonomy above. They are retrieved and applied to instance of reasoning captured in TMXPs, and assist in forming the learning goals of the systems after failure occurs.

Deciding what to learn

 Once a system has identified the causes of a given reasoning failure, it must decide what it needs to learn. To represent these desires explicitly, it posts a series of learning goals that, if achieved, will reduce the likelihood of repeating the failure. Some learning goals seek to add, delete, generalize or specialize some concept or procedure. Others deal with the ontology of the knowledge, that is, with the kinds of categories that constitute particular concepts. Many learning goals are unary in that they take a single target as argument. For example, a knowledge acquisition goal seeks to determine a single piece of missing knowledge, such as the answer to a particular question. A knowledge refinement goal seeks a more specialized interpretation for a given concept in memory, whereas a knowledge expansion goal seeks a broader interpretation that explores connections with related concepts. Other learning goals are binary in nature since they take two arguments. A knowledge differentiation goal is a goal to determine a change in a body of knowledge such that two items are separated conceptually. In contrast, a knowledge reconciliation goal is one that seeks to merge two items that were mistakenly considered separate entities. Both expansion goals and reconciliation goals may include/spawn a knowledge organization goal that seeks to reorganize the existing knowledge so that it is made available to the reasoner at the appropriate time, as well as modify the structure or content of a concept itself. Such reorganization of knowledge affects the conditions under which a particular piece of knowledge is retrieved or the kinds of indexes associated with an item in memory.

Strategy selction

 Given a learning goal, then, a system must also decide which learning strategy is most appropriate to achieving it. The approach I take is to treat the learning task like a traditional planning problem, creating a learning plan that is composed of a series of executions of learning algorithms that will achieve its learning goals. However, unlike learning algorithms executed by single-strategy systems, the learner must dynamically consider possible interactions that may occur between the learning strategies. It is therefore important to recognize that when multiple items are learned from a single episode, the changes resulting from one learning algorithm may affect the knowledge structures used by another algorithm. Such dependencies destroy any implicit assumption of independence built into a particular learning algorithm used in isolation. For instance, if one algorithm modifies a conceptual definition, thus introducing or altering constraints on an attribute of the concept, any re-indexing based on this attribute must occur after the modification rather than before it, in order for the indexing to be effective. A non-linear planner is thus used to resolve these types of dependencies and goal interactions.
 
 

Implementation

To investigate and evaluate the theoretical problems, I have implemented an introspective version of AQUA, called Meta-AQUA. AQUA (Ram, 1991; 1993) is a question-driven story understanding system that learns about Middle Eastern terrorist activities. Its performance task is to "understand" the story by building causal explanations that link the individual events into a coherent whole. Meta-AQUA adds introspective reasoning and learning using Meta-XP structures. Its domain consists of using knowledge of terrorist activities (taken from the original AQUA system) to understand stories of drug smuggling. Meta-AQUA is programmed in Symbolics Common LISP under Genera Version 8.1.1 on a Symbolics MacIvory-3 LISP processor embedded in a Macintosh II computer. The LISP source files use 764,201 bytes of disk space. To support the main system, I have written a large frame system that manages Meta-AQUA's knowledge representation. I have also incorporated the Tale-Spin story generation system (Meehan, 1981) into Meta-AQUA's front end to supply automatically-generated input and the Nonlin planning system (Ghosh, Hendler, Kambhampati, & Kettler, 1992; Tate, 1976) on Meta-AQUA's back-end to generate the final learning plan.

The program has been evaluated in a number of contexts. In machine-learning environments, my emphasis has been on the problem of choosing computational learning algorithms, given some learning task. In cognitive science evaluation, the emphasis has been on modeling how human learners choose particular approaches or metacognitive strategies given some problem-solving task. Most of my results have been published in machine learning and other artificial intelligence forums, including an invited chapter in the latest volume of Michalski and Tecuci's Machine Learning: A Multistrategy Approach (Ram & Cox, 1994). I am also working on the relationship between learning goals and the notion of bias in learning systems. In my theory, learning goals further extend the analysis of bias by providing a system with tendencies to select particular changes to its background knowledge and selective filtering of learning examples. The theory has also been applied to human data in real-world problem-solving tasks. In collaboration with colleagues from the Industrial and Systems Engineering Department of the College of Engineering at Georgia Tech, we have successfully used this approach in the domain of diagnostic repair of circuit boards, modeling the behavior of expert troubleshooters at NCR's electronics assembly plant in Atlanta. This research has been accepted for publication in the journal Cognitive Science (Ram, Narayanan, & Cox, in press).
 
 

Summary

In summary, I have a developed a novel theory of introspective multistrategy learning that has many implications and which incorporates a number of processes that are often considered necessary for intelligent behavior. My process theory addresses learning in both problem-solving and comprehension contexts. I have also developed a formalism with which to represent metacognitive processes.

References

Cox, M. T., & Ram, A. (in press). Learning-goal interactions: Treating learning
as a planning task. In M. Keane & J.-P. Haton (Eds.), Topics in case-based
reasoning (Lecture notes in artificial intelligence). Berlin: Springer-Verlag.

Ghosh, S., Hendler, J., Kambhampati, S., and Kettler, B. (1992). UM Nonlin [a
Common Lisp implementation of A. Tate's Nonlin planner]. Maintained at the
Dept. of Computer Science, University of Maryland, College Park, MD. Available
by anonymous ftp from cs.umd.edu in directory /pub/nonlin.

Meehan, J. (1981) Talespin. In R. C. Schank & C. Riesbeck (Eds.), Inside
computer understanding: Five programs plus miniatures. Hillsdale, NJ: Lawrence
Erlbaum Associates. 

Ram, A. (1991). A Theory of Questions and Question Asking. The Journal of the
Learning Sciences, 1(3&4):273-318.

Ram, A. (1993). Indexing, Elaboration and Refinement: Incremental Learning of
Explanatory Cases. Machine Learning. 10:201-248.

Ram, A., & Cox, M. T. (1994). Introspective reasoning using meta-explanations
for multistrategy learning. In R. S. Michalski & G. Tecuci (Eds.), Machine
learning: A multistrategy approach IV (pp. 349-377). San Francisco: Morgan
Kaufmann.

Ram, A., Narayanan, S., & Cox, M. T. (in press). Learning to trouble-shoot:
Multistrategy learning of diagnostic knowledge for a real-world problem solving
task. Cognitive Science. 

Tate, A. (1976). Project planning using a hierarchic non-linear planner (Tech.
Rep. No. 25). Edinburgh, UK: University of Edinburgh, Department of Artificial
Intelligence.