Cognitive Load and Discovery Learning

 

 

Mr. Juhani E. Tuovinen

Charles Sturt University

School of Education

P O Box 588, Wagga Wagga, NSW, 2678

Ph 02 6933 2461

Fax 02 6933 2888

Email jtuovinen@csu.edu.au

 

 

 

ABSTRACT

 

Innovative forms of instructional provision such as discovery learning

are discussed in the context of cognitive load theory. Recent

educational experiments with school and university students are

described where methods based on cognitive load theory were used to

measure the educational effectiveness of instruction in computer

education, contrasting discovery learning and other forms of

instruction. The implications of the results of these experiments

suggest considerable deficiencies in discovery learning with data

favouring new, more effective forms of instruction based on cognitive

load theory. These new learning approaches reduce cognitive load by

eliminating the extraneous working memory load caused by the use of

some problem solving strategies during learning, or the elimination of

split-attention and redundancy effects for material that imposes a high

working memory load.

 

Cognitive Load Theory

 

Cognitive load theory (Sweller, 1988; 1994) derives instructional

design principles from aspects of our cognitive architecture. The

theory assumes a very limited working memory (Miller, 1956), an

effectively unlimited long-term memory (Simon and Gilmartin, 1973)

holding large numbers of schemas (Chi, Glaser & Rees, 1982) that can

vary in their degree of automaticity (Kotovsky, Hayes & Simon, 1985).

This architecture interacts with instructional material in various

ways.

 

First, different learners will process the material in different ways.

If the elements of material that require processing are incorporated in

an automated schema, working memory load (or cognitive load) will be

low. Schemas allow many elements to be treated as a single element in

working memory and automatic processing limits working memory demands

compared to controlled, conscious processing (Schneider & Shiffrin,

1977; Shiffrin & Schneider, 1977). As a consequence, if a learner has

acquired appropriate automated schemas, cognitive load will be low and

substantial working memory resources are likely to be free. In

contrast, if the elements of material that require processing must each

be considered as a discreet element in working memory because no schema

is available, cognitive load will be high. Working memory may be

entirely occupied in processing large numbers of individual elements.

 

Second, the characteristics of the instructional material is important.

Some material can be learned element by element without relating one

element to another. Learning basic word pairs in foreign language

provides an example. Each vocabulary item can be learned without

reference to any other item. Such material is low in element

interactivity and low in intrinsic cognitive load. It imposes minimal

demands on working memory. Alternatively, situations where a number of

elements must be considered simultaneously for the successful execution

of a task, are called high element interactivity tasks. Learning the

order of words in English provides an example. Word order cannot be

learned without considering several words simultaneously. Under these

circumstances, intrinsic cognitive load is high because of high element

interactivity. These situations occur often in mathematics, computer

programming, design development, etc., where no individual component

can be considered in isolation, since any action on a given component

will have complex and far-reaching effects on the task outcome.

 

Third, the characteristics of the learner and the material to be

learned, interact. Material which imposes a heavy cognitive load for

some people because they must deal with large numbers of interacting

elements may impose less of a cognitive load for other people because

they have acquired automated schemas that incorporate the individual

elements. An expert in elementary algebra will treat the equation, (a +

b)/c = d, as a single, automated schema requiring limited working

memory resources. A novice who has just commenced learning algebra may

need to treat each symbol and relations between symbols as individual,

interacting elements, resulting in a working memory overload.

 

This theory has proved beneficial for the improvement of the planning,

organisation and implementation of learning in many fields. It is

argued that in the process of dealing with information, working memory

has only a limited processing capacity available to deal with distinct

items at any given time, and that the capacity of working memory is

often overloaded due to inappropriate presentation of material or

inappropriate learner activities, leading to a reduction in learning

and the capacity to solve problems. Thus new material is learned most

effectively and efficiently if the unnecessary cognitive load is

reduced to a minimum.

 

The cognitive load associated with any task consists of two parts.

There is the intrinsic or natural cognitive load, i.e. the inherent

aspects of the mental task that must be understood for the learner to

be able to carry out the task. Intrinsic cognitive load is determined

by levels of element interactivity. However, in addition, there is

usually a range of extraneous matters associated with the way the

instructional material is taught, that may add to the inherent nucleus

of the intrinsic cognitive load (Sweller, 1988, 1994). This category of

cognitive load is classed as extraneous cognitive load.

 

In an effort to reduce the extraneous cognitive load associated with

the use of problem solving search strategies, several researchers have

tested the effects of using worked examples. Worked examples require

the reader to attend to problem states and their associated moves,

rather than searching for the right moves involved in conventional

problem solving. In situations where an extraneous cognitive load due

to problem solving search existed, worked examples were found to

effectively reduce that load and enhance learning (e.g. Carroll, 1994;

Cooper & Sweller, 1987; Paas, 1992; Paas & van Merrienboer, 1994;

Sweller & Cooper, 1985; Trafton and Reiser, 1993; Zhu & Simon, 1987).

 

This presentation deals with some comparisons of approaches to learning

which have a bearing on discovery learning.

 

Discovery Learning

 

Discovery learning principles have been widely promoted since the

1960's at various levels of education. Let us begin with three

statements by Jerome Bruner, the father of discovery learning.

 

"Consider now what benefits might be derived from the experience of

learning through discoveries that one makes oneself. I shall discuss

these under four headings: (1) the increase in intellectual potency,

(2) the shift from extrinsic to intrinsic rewards, (3) the learning of

the heuristics of discovering, and (4) the aid to conserving memory."

(Bruner, 1979, p. 83)

 

"By recognizing the legitimacy of intuition as an intellectual

operation, schools could spare their students the painful relearning

that is required of them later when , for example, they "really get

into" physics and are required not to prove a given solution but to

find a solution." (Bruner, 1971, p. 92)

 

"You cannot consider education without taking into account how culture

gets passed on. It seems to me highly unlikely that given the

centrality of culture in man's adaptation to his environment - the fact

that culture serves him in the same way as changes in morphology served

earlier in the evolutionary scale - that, biologically speaking, one

would expect each organism to rediscover the totality of its culture -

this would seem most unlikely. Moreover, it seems equally unlikely,

given the nature of man's dependency as a creature, that this long

period of dependency characteristic of our species was designed

entirely for the most inefficient technique possible for regaining what

has been gathered over a long period of time, i.e. discovery." (Bruner,

1966, p.101).

 

In these three statements Bruner makes three major claims about

discovery and learning. In the first statement he argues that learning

by discovery is beneficial. In the second statement he asserts that it

is important to learn to discover for oneself, that is, he sees

discovery as an end or a goal of learning, rather than the means, as

suggested by the first statement. In the final statement he cautions

that learning by discovery is inefficient, and should not be expected

to be the main means of education.

 

 

Discovery Learning and Cognitive Load

 

Work in the Cognitive Load Theory framework has resulted in two major

findings that help to clarify some aspects of the questions regarding

the usefulness of discovery learning.

 

Goal-free problem solving

When students studying kinematics in physics were given goal-free

problems, instead of the conventional problems, which required them to

find one specific answer, their learning improved (Sweller & Levine,

1982). In this situation the students had available a small number of

kinematics equations describing the action of bodies under constant

acceleration. Conventionally after a lesson introducing the equations

they were given problems of the type:

 

"If a stone begins to move from rest under the action of constant

acceleration of 3 m/s2, find the final velocity after 4 seconds."

 

The alternative form of goal-free problem used was:

 

"If a stone begins to move from rest under the action of constant

acceleration of 3 m/s2, find all about it after it has moved for 4

seconds."

 

In this situation the students were discovering or exploring all the

possible alternative variables describing the object's motion and their

values. The goal-free problem solving produced better learning than the

conventional problem solving.

 

Why was this format of practice better? On reflection it became

apparent that in the conventional goal-directed practice the students

were employing a problem solving strategy called 'means-ends analysis'.

During the means-ends analysis the students had to attend to the

possible actions for problem solving, the given information, and the

final goal, and how they might reach the final goal, i.e. the

intermediate goals. In the goal-free condition they only had to

process in their limited working memories the possible actions, the

given information and a simpler goal - to find an equation to derive

any new variable value. Thus instead of overloading their working

memories with an extraneous load as described above, they were able to

deal with a smaller number of variables more effectively. Thus instead

of concentrating on the problem solving to the exclusion of the

learning the schema for the subject matter to be learned, they were

able to develop better schemas, due to a smaller cognitive load in the

goal-free problem solving.

 

However, in a subsequent experiment involving geometry learning the

goal-free problem superiority effect over conventional problem solving

vanished. It became apparent that in the physics situation there was

only a small number of possible equations that could be used, and only

a small number of variables and their values that could be derived.

This is sometimes termed the size of search space in problem solving.

When the search space became larger, the greater variety of options

available produced an intolerable load on the working memory and the

processing advantage disappeared.

 

Instead the heavy use of worked examples was found to be a more

effective way to teach geometry (and many other things as noted above)

than conventional problem practice.

 

Exploration learning vs worked examples practice

In a more recent experiment the worked examples practice approach was

directly contrasted with exploration (discovery) practice for learning

to use a computer program, a database (Tuovinen & Sweller,

unpublished). The university students were given common lessons in the

development of database files and their manipulation. Then they were

asked to practice the operation before sitting for a test of the

database operations a week later. In the practice stage they either

freely explored the database operations or read through multiple worked

examples before working through practice tasks.

In this work there was no difference in the learning by the two groups

on low element interactivity material. However, with high element

interactivity content, i.e. the construction of database field

formulas, interesting results were obtained. If the students had

previously experienced some database work, there was no statistically

significant difference between the forms of learning. However, for the

students with no previous database exposure before this unit of work,

the worked examples practice was significantly more effective. In fact

the mean test scores for the 4 groups were:

 

 

DATABASE EXPERIENCE

NO PREVIOUS EXPERIENCE PREVIOUS EXPERIENCE

WORKED EX 29.6 30.9

EXPLORATION 15.1 35.9

 

This indicates that the exploration practice was only useful if the

student already possessed the schema required to be used in exploration

or discovery. On the other hand if the students did not possess the

schema, they were significantly disadvantaged by the exploration or

discovery practice format in comparison with the worked examples

practice.

 

 

General Discussion

 

What does this say about discovery or exploration learning in general?

Firstly, it appears that if the material has low element interactivity,

most common practice methods are equivalent. Secondly, it would appear

that if the schema for the area is known, the exploration or discovery

method may be as beneficial as the worked examples approach. However,

if the schema are poorly known the exploration or discovery approach is

more time consuming and the learning is less effective than the worked

examples approach for high element interactivity material.

 

Similarly if the search space is small, i.e. the number of elements to

be manipulated is low, the exploration of discovery method is more

effective than conventional problem solving practice, due to the

reduction of the extraneous cognitive load. However, even when the

search space is much larger, the worked examples approach is more

effective than the conventional problem solving.

 

Thus the worked examples approach produced at least as good or better

learning than the exploration or discovery approach.

 

This work has mainly addressed Bruner's first contention, with respect

to the benefits derived from learning by discovery in specific

contexts, and supported his third cautionary comment regarding the

limitations of the learning by discovery. Thus we have made a beginning

in the long process of finding out, as Cronbach put it in 1966, of

generalisations with respect to the benefits and limitations of

inductive (discovery) learning approaches for:

 

¥ particular subject matter

¥ inductive experiences of particular type

¥ provided in particular amounts

¥ producing particular patterns of responses

¥ in pupils at a particular level of development (Cronbach, 1966, p. 77).

 

References

 

Bruner, J. S. (1966). Some elements of discovery. In L. S. Shulman & E.

R. Keislar (Eds.), Learning by discovery: A critical appraisal (pp.

101-113). Chicago: Rand McNally.

Bruner, J. S. (1971) Toward a disciplined intuition. In A. Gil (Ed.)

The relevance of education (pp.82-107). London: George Allen & Unwin.

Bruner, J. S. (1979) The quest for clarity. In J. S. Bruner (Ed.) On

knowing. Essays for the left hand (pp. 75-96). Cambridge, MA: Harvard

Univ. Press.

Carroll, W. (1994) Using worked examples as an instructional support in

the algebra classroom. Journal of Educational Psychology, 86(3),

360-367.

Chi, M. T. H., Glaser, R. & Rees, E. (1982) Expertise in problem

solving. In R. J. Sternberg (Ed.) Advances in the psychology of human

intelligence, (pp. 7-75) Hillsdale, New Jersey: Lawrence Erlbaum Assc.

Cooper, G., & Sweller, J. (1987). Effects of schema acquisition and

rule automation on mathematical problem-solving transfer. Journal of

Educational Psychology, 79(4), 347-362.

Cronbach, L. J. (1966). The logic of experiments on discovery. In L. S.

Shulman & E. R. Keislar (Eds.), Learning by discovery: A critical

appraisal (pp. 76-92). Chicago: Rand McNally.

Kotovsky, K., Hayes, J. R., & Simon, H. A. (1985) Why are some

problems hard? Evidence from the Tower of Hanoi. Cognitive Psychology,

17, 248-294.

Miller, G. A. (1956). The magical number, seven, plus or minus two:

some limits on our capacity for processing information. Psychological

Review, 63, 81-97.

Owen, E., & Sweller, J. (1985). What do students learn while solving

mathematics problems? Journal of Educational Psychology, 77(3),

272-284.

Paas, F. G. W. C. (1992). Training strategies for attaining transfer of

problem-solving skill in statistics: a cognitive-load approach. Journal

of Educational Psychology, 84(4), 429-434.

Paas, F. G. W. C., & Van Merri‘nboer, J. J. G. (1994). Variability of

worked examples and transfer of geometrical problem-solving skills: A

cognitive-load approach. Journal of Educational Psychology, 86(1),

122-133.

Schneider, W. & Shiffrin, R.(1977) Controlled and automatic information

processing: I. Detection, search and attention. Psychological Review,

84, 1-66.

Shiffrin, R. & Schneider, W. (1977) Controlled and automatic

information processing:II. Perceptual learning, automatic attending,

and a general theory. Psychological Review, 84, 127-190.

Simon, H. A. & Gilmartin, K.(1973) A simulation of memory chess

positions. Cognitive Psychology, 5, 29-46.

Sweller, J., & Cooper, G. A. (1985). The use of worked examples as a

substitute for problem solving in learning algebra. Cognition and

Instruction, 2(1), 59-89.

Sweller, J., & Levine, M. (1982). Effects of goal specificity on

means-ends analysis and learning. Journal of Experimental Psychology:

Learning, Memory and Cognition., 8, 463-474.

Sweller, J.(1988) Cognitive load during problem solving: effects on

learning. Cognitive Science, 12, 257-285.

Sweller, J. (1994) Cognitive load theory, learning difficulty, and

instructional design. Learning and Instruction, 4, 295-312.

Trafton, J. G., & Reiser, B. J. (1993). The contributions of studying

examples and solving problems to skill acquisition. In Proceedings of

the Fifteenth Annual Conference of the Cognitive Science Society, (pp.

1017-1022). Institute of Cognitive Science, University of

Colorado-Boulder: Lawrence Erlbaum Assoc., Pub.

Tuovinen, J. E. & Sweller, J. (unpublished) Cognitive load associated

with discovery learning techniques.

Zhu, X., & Simon, H. A. (1987). Learning mathematics from examples and

by doing. Cognition and Instruction, 4, 137-166.