PROPOSALS FOR RENEWAL IN ASSESSMENT FOR MALAYSIA TO FACILITATE THE REALISATION OF THE WIDER AIMS OF...

177
PROPOSALS FOR RENEWAL IN ASSESSMENT FOR MALAYSIA TO FACILITATE THE REALISATION OF THE WIDER AIMS OF PRIMARY SCIENCE EDUCATION NG KEE CHUAN Dissertation submitted in part fulfilment of the requirements of the MA (Science Education) Degree of the University of London. (Independent Study) September 1996 10

Transcript of PROPOSALS FOR RENEWAL IN ASSESSMENT FOR MALAYSIA TO FACILITATE THE REALISATION OF THE WIDER AIMS OF...

PROPOSALS FOR RENEWAL IN ASSESSMENTFOR MALAYSIA TO FACILITATE THE

REALISATION OF THE WIDER AIMS OFPRIMARY SCIENCE EDUCATION

NG KEE CHUAN

Dissertation submitted in part fulfilment of therequirements of the

MA (Science Education) Degree of the University ofLondon.

(Independent Study)

September 1996

10

Department of Science EducationInstitute of EducationUniversity of LondonACKNOWLEDGEMENTS

I would first of all like to thank God for His grace inhelping me to finish this work.

Special thanks to my loving wife, Kwai Yok, for herpatience, help and suggestions. You are great.

I would also like to thank Mr. Arthur Jennings for hismost able guidance and direction as he tutored me throughmany hours of discussions. I have gleaned much from youryears of experience in the field of science education.

My thanks and gratefulness is also acknowledged to theMinistry of Education of Malaysia for this opportunity tostudy and research in the Institute of Education,University of London. It has been an enriching experiencein many ways.

11

ABSTRACT

The aims of science education today have expanded

beyond the typical science domain to encompass a wider

scope of study. In this treatise, a two dimensional

matrix with the scientific-general domain as the

horizontal cline and the explicit-implicit aim of science

education as the vertical cline is proposed as an

operational classification system that helps to garner a

sense of order and perspective to these multiple aims.

This work also attempts to demonstrate that science

education is not merely a knowing of scientific knowledge,

but also has the desired purpose of competency in the

doing of scientific investigations. Thus, there is the

need to acquire a working knowledge of scientific process

skills, procedural understanding of the scientific

method, the practical manipulative skills and the methods

of problem-solving. In addition to this, science

education needs to foster a right feeling regarding

investigations and the role of science - thus, the need

to nurture proper scientific attitudes and values that

work concurrently with the aspects of knowing and doing

that have just been mentioned.

Consequently, the teaching of science can now employ

more creative methods that involves ‘bench work’, ‘word

work’, identifying and appreciating the relevance of

12

science and technology in society and life, browsing

through the annals of the development, struggles and

achievements of science and grappling with moral-belief

related issues.

Nevertheless, such wide and impressive aims of

science education cannot be realised if a complementary

method of assessment is not addressed. Typically, science

teachers teach to the assessment. Unfortunately,

traditional methods of assessments which focus mostly on

theoretical knowledge and are summative in nature are

unable to cope with this variety of aims. There is an

urgent need for new methods of assessment that are more

flexible and valid to measure progression in performance.

Furthermore, proposals for evaluation of the affective

dimension of the students are described. In this

treatise, alternative methods of assessment are explored

as a possible solution to the stated need. Ways to use

these assessments continuously and largely formatively

are suggested so that assessments will influence the

teaching-learning process.

A framework of assessment that integrates both

traditional and newer forms of assessments is formulated.

Criteria for the aspects of knowing, doing and feeling is

mapped out to help primary science teachers in Malaysia

track their students’ progress and thus apply alternative

assessment procedures. Finally, moderation procedures for

13

quality control and quality assurance are also considered

to enhance the reliability of teacher assessment.

14

CONTENTS

ACKNOWLEDGEMENTS 2

ABSTRACT 3

CONTENTS 5

INTRODUCTION 8

CHAPTER 1 : THE AIMS OF SCIENCE EDUCATION

1.1 One big happy family - the matter of integration10

1.2 Who’s in the family? - the matter of inclusion12

1.3 Family relationships - the matrix of aims in science education 14

1.3.1 Scientific knowledge (conceptual understanding)

1.3.1.1 Falsifiable yet reliable15

1.3.1.2 Constructing scientific knowledge17

1.3.2 Scientific methods (procedural understanding) 18

1.3.3 Process skills and manipulative skills23

Recapitulation29

1.3.4 Societal needs and relevance 30

15

1.3.5 History of science 34

1.3.6 Language and words35

Recapitulation37

1.3.7 Scientific attitudes, moral values and beliefs 38

1.3.8 Scientific and technological literacy43

1.3.9 Problem-solving methods and thinking skills 45

Recapitulation46

1.4 The family working together - the matter of practical application 47

1.5 The family up for adoption - the matter of implication for Malaysia

1.5.1 A brief overview of the history of science education

in Malaysia and its current scenario51

1.5.2 Proposals for application in the Malaysian context 54

1.5.3 Steps to bring about change in science education

for Malaysia56

16

CHAPTER 2 : ASSESSMENT IN SCIENCE EDUCATION

2.1 Renewal in approaches to assessment

2.1.1 The early reign of measurement (norm-referenced)

model and traditional assessment methods 59

2.1.2 The growing revolt against the traditional assessment

methods and the measurement model61

2.1.3 Coronation of the standard (criterion-referenced) model? 64

2.1.4 The spread of the standard model - performance assessment 65

2.2 Relevance of new approaches of assessment to science education

2.2.1 Clarifying the plethora of terminology67

2.2.2 The strengths and weaknesses of alternative assessments 73

CHAPTER 3 : PROPOSAL FOR FRAMEWORK OF ASSESSMENT OFPRIMARY SCIENCE FOR MALAYSIA

3.1 Pertinent factors in formulating the framework78

3.2 Overall structure of the framework for assessment 79

17

3.2.1 The aspect of knowing (10%)80

3.2.2 The aspect of doing (15%)81

3.2.3 The aspect of feeling (5%)83

3.3 Recording84

3.4 Moderation procedures87

3.4.1 Moderation for quality control88

3.4.2 Moderation for quality assurance94

CONCLUSION 97

APPENDICES

Appendix A Concepts of evidence and their defination99

Appendix B 9 Challenges of vision 2020 for Malaysia100

Appendix C Characteristics of authentic tests101

Appendix D Clusters of scientific process skills and procedural understanding

for each stage102

Appendix E Proposed recording form103

18

Appendix F Criteria for the aspect of knowing, feeling and doing 104

REFERENCES106

BIBLIOGRAPHY115

19

INTRODUCTION

After a demise of more than 10 years, primary

science was reinstated into the Malaysian primary school

curriculum as a subject of its own in 1994. Therefore,

we, the science educators in Malaysia, not only have a

lot of lost ground to make up but we also need to keep

abreast with current global changes in the world of

science education.

It is with this intention that I have sought to scan

the myriad sources of literature regarding the aims of

science education and attempt to structure these aims

into a working model to incorporate these ideas into the

science curricula in Malaysia.

Nevertheless, it has been a long running problem in

Malaysia that science teachers tend to teach science

using the expository method. Science is thus projected as

a “factual” subject with a plethora of scientific

terminology that needs to be memorised. Even though

practical work is carried out, they tend to be utilised

as sessions to illustrate scientific concepts and

theories.

With the reintroduction of science in the primary

school level in Malaysia, there is not much hope of the

situation improving. In fact, the situation could

20

deteriorate further since many primary science teachers

are not science specialists themselves. Thus, no matter

how appealing the developments in science education are,

they tend to falter when it reaches the implementation

stage in the classrooms and laboratories.

Nonetheless, it is not entirely fair to put the

blame squarely on the shoulders of science teachers.

After all, most of them will say that the expository

method is the most efficient and effective way to get

good results in the theory-based examinations.

Therefore, it is also my intention in this treatise

to address this matter regarding assessment. I hope to

formulate an assessment framework for primary science

that could be introduced to the Malaysian education

system. This framework will need to cover and give

credence to the range of cognitive, psychomotor and

affective dimensions that permeates the study of science

these days.

21

CHAPTER 1 : THE AIMS OF SCIENCE EDUCATION

1.1 ONE BIG HAPPY FAMILY - THE MATTER OF

INTEGRATION

Don’t “adulterate science syllabuses with non-

science” was the impassioned cry of a letter in School

Science Review (Siddons and Spurgin 1990). In these times

of constant change within the science education system

(Poole 1995:16), the question of whether “non-science”

elements should be included in its curriculum bears a

serious consequence to the composition of the aims for

science education.

It is irrefutable that the field of science itself

has undergone a ‘knowledge explosion’ (Whitfield 1971:8)

to an extent that its interface overlaps, intercepts and

interacts in increasing measure with the interfaces of

other fields (Newton 1988:82). Therefore, Martin

(1972:158-9) suggested that the content and structure of

the traditional science courses be reviewed and revised

from time to time so as to include new but related topics

from other fields.

Another factor which had considerable influence on

the composition of the aims of science education was the

emerging idea of liberal general education in the mid-

sixties and seventies. Imbedded in this approach was the

22

overriding ends of education for the all-round

development of a person morally, intellectually and

spiritually (Hirst 1970:24).

Phenix (1964:8) described such a “complete person”

as one who :

should be skilled in the use of speech, symbol and gesture,factually well informed, capable of creating and appreciatingobjects of aesthetic significance, endowed with a rich anddisciplined life in relation to self and others, able to makewise decisions and to judge between right and wrong, andpossessed of an integral outlook. These are the aims of generaleducation for the development of the whole persons.

Thus, liberal educationists like Hirst and Phenix

(1974, 1964) advocated an education which was concerned

with developing the mind in the various forms of knowing

and experience of the world. In line with this, Whitfield

(1971:20) forwarded the behaviourists approach of general

education where the development of “human abilities

through planned use of appropriate activities and

experience” became the main concern of education.

This ability-centred view where “generalised human

abilities” not originating from a particular subject

matter but with related applications inculcated to form

an “educated man” with an assortment of abilities thus

assumed the transfer of training across the logical

boundaries of knowledge (Phenix called these boundaries

of knowledge “realms of meaning” while Hirst called it

23

“forms of knowledge”) (Whitfield 1971:23). This

comprehensive development of the mind and abilities meant

that syllabuses and curricula of various subjects should

not be formed with isolationists tendencies but with

increasing interrelatedness among the forms of knowledge

(Hirst 1974:47).

Thus, science which was differentiated by both Hirst

and Phenix by its distinctive test for truth as an

empirical form of knowing by observation and experiment,

had to be considered in this broader perspective of

interrelatedness with other forms of knowledge and

generalised skills (Whitfield 1971:16).

The changes and propositions for science education

in the last decade reflected this relentless flux to meet

the challenge of renewal to stay relevant with times

(Whitfield 1971:8, Hurd 1971:1-12). It was precisely this

provisional endeavour that introduced and injected new

elements into science curricula propounding an integrated

approach in science education.

Thus, Woolnough (1994:12-14) advocates the aims of

education through science and education in science; the

former constituting the means to further general

educational aims such as interpersonal skills and self

confidence whereas the latter was related to the learning

of specific content and processes of science itself.

24

Newton (1988:33,42) alluded to a similar demarcation

by giving mention to ‘teaching about and through science’

in his argument for humanisation of science education.

This distinction was also delineated by Whitfield

(1971:20) in his call for an ability-centred view where

‘generalised human abilities’ not originating from a

particular subject matter but with related applications

is included to form an educated man with an assortment of

abilities.

In light of these viewpoints, I envisage this

conception as a continuum linking the extremes of science

domain elements and general domain elements for science

teaching aims (See Figure 1.1)

SCIENTIFIC GENERALDOMAIN DOMAIN

Figure 1.1

The cline suggests that at some point near the

middle of the continuum certain elements within that

range exudes both characteristics of scientific domain

and general domain, making the demarcation dubious. This

is where the discussion of what constitutes science and

what doesn’t intensifies leading to the next point of

contention in considering the aims of science education :

25

What should be included into the teaching aims of science

education?

1.2 WHO’S IN THE FAMILY? - THE MATTER OF

INCLUSION

Much has been written and said about what should

constitute science education. A sampling of science

curricula, science projects and the views of science

educationists divulges a broad spectrum of suggestions.

Clearly, this attests to the need for some kind of

categorisation and prioritisation of aims in science

education so as to achieve clarity in aspirations of

teaching and learning.

Meighan (1986:66) in his working definition of what

is known as the ‘hidden curriculum’, suggests that it

consists of the broad categorisation of all aspects of

learning in school not stipulated in the official

curriculum.

Synder (1970:4) writing on the same subject stated

that many tasks though explicitly expressed necessarily

involve a hosts of covert activities which are essential

for successful performance, encompassing the “how” of

learning. These aspects can be incorporated into my

26

earlier continuum of domains by introducing the

additional vertical axis representing the field of

explicit and implicit aims, thus formulating a working

model for categorising and prioritising the aims of

science education (See Figure 1.2).

EXPLICIT

A B

SCIENTIFIC GENERALDOMAIN DOMAIN

D C

IMPLICIT

Figure 1.2

In this context, I have sought to avoid the adoption

of the phrase ‘hidden curriculum’ because it is a

generally known connotation which includes many

sociological factors such as goals of school, the head,

the staff, the children, gender, race, community and

environment, teaching styles, achievement and assessment,

rewards and punishment, timetables, teacher expectations

and so on (Bottery 1990 : 98-100; Meighan 1986:66-163).

The inclusion of so many factors would certainly deluge

the working model and reduce it to a state of

impracticality.

27

Instead, I have restricted myself to the following

definitions :

Explicit aims are those that consist of the main

purpose and targets of science teaching and learning as

envisaged by a science teacher.

Implicit aims are incidental or an indirect consequence

of the explicit aims of science teaching and learning,

yet inclusive and desirable.

Hence, each element could be assigned to one of the

following segments in the matrix (Refer to Figure 1.2) :

A. Explicit aims from the scientific domain

B. Explicit aims from the general domain

C. Implicit aims from the general domain

D. Implicit aims from the scientific domain

When the myriad of elements have been clearly

designated, it would :

* make focusing on the main aims of the teaching-

learning experience easier

* enable more directed and purposeful planning of

lessons

* allow the consideration of the many facets of

learning science

* give versatility in planning lessons

28

1.3 FAMILY RELATIONSHIPS - THE MATRIX OF AIMS IN

SCIENCE EDUCATION

With this matrix as an operational classification

system, I will now embark on the task of attempting to

produce an integrated outlook of the composite aims of

science education. Undertaking this endeavour entails

making value judgements regarding the position of the

different elements within the matrix which could be a

contentious issue. Nevertheless, I hope that the exercise

may clarify at least some of the many aims of science

education. The complete matrix (Figure 1.3) will first be

given in its complete form and its elucidation and brief

considerations of each element ensues.

EXPLICIT

SCIENTIFIC KNOWLEDGE SOCIETAL NEEDS & (CONCEPTUAL UNDERSTANDING) RELEVANCE SCIENTIFIC METHODS HISTORY OF SCIENCE(PROCEDURAL UNDERSTANDING)

PROCESS SKILLS & LANGUAGE & WORDSMANIPULATIVE SKILLS

SCIENTIFIC GENERAL DOMAIN DOMAIN

SCIENTIFIC ATTITUDES MORAL VALUES &BELIEFS

29

SCIENTIFIC LITERACY PROBLEM-SOLVINGMETHODS

TECHNOLOGICAL THINKING SKILLSLITERACY

IMPLICIT

Figure 1.3TWO DIMENSIONAL MATRIX REPRESENTING

THE AIMS OF SCIENCE EDUCATION

1.3.1 SCIENTIFIC KNOWLEDGE (conceptual understanding)

1.3.1.1 FALSIFIABLE YET RELIABLE

It almost goes without saying that the acquisition

of scientific knowledge is one of the main venerable aims

of science education (Claxton 1991:110). Martin (1972:33)

identifies this as ‘propositional knowledge’- knowledge

that something is the case. However, Newton (1988) in

listing the products of science as concepts, theories,

generalisations and laws raises the issue of the

provisional nature of this type of knowledge because the

so-called scientific method which engenders it was

unclear.

Harlen (1988:4) traces this uncertainty to the

hypothetico-deductive view expounded by the philosopher

Popper that no hypothesis could be proved true but that

the testing should instead attempt to disprove a

hypothesis by falsification. This view which countered

30

the Baconian Model of verification by induction

(Woolnough 1985: 6) and ‘naive realism’ spawned the view

regarding the tentative nature of scientific theories

(Harlen 1993: 11-12).

Notwithstanding the provisional nature of scientific

knowledge, it is still the stuff with which the content

of science education and investigation is made up

providing the science teacher with the needed context to

manuevour (Nellist and Nicholl 1986:5). Ritchie

(1971:126) puts it succinctly and pragmatically : “A

criterion for an idea to be ‘scientific’ is, therefore,

not whether it is right, but whether it is useful in

solving existing problem, predicting new ones and

allowing communication between scientist.”

Thus, teaching of science that is commensurate with

the provisional nature of science should reflect this

dynamically evolving body of scientific knowledge. Yet,

at the same time, it should not damn science in the minds

of students by giving the impression that scientific

theories and facts are easily changed and highly

arbitrary (relativism).

Kuhn in his refutation of Popperian views, claims

that in ‘normal science’ the scientific community is more

committed to the further articulation of existing

paradigms than to replacing them with novelties of new

31

paradigms (Easlea 1973:11). This means that currently

accepted theories of science or what Ziman (1980:22,63)

calls ‘valid science’, that is science which is

recognised as valid by research scientists and found in

the ‘scientific archives’ (learned journals, books, maps,

computer tapes), have a great tenacity against change.

Polanyi (1962:138) supports this view, saying, “it

is normal practice of scientists to ignore evidence which

appears incompatible with the accepted system of

scientific knowledge, in the hope that it will eventually

prove false or irrelevant.”

Kuhn further suggests that only during periods of

‘scientific revolutions’ engendered by crisis situations

whereby there exists a growing sense that an existing

paradigm has ceased to function adequately to resolve

arising anomalies, does it force the scientific community

to replace the paradigm in whole or part by a new one

(Kuhn 1962 :92-98).

Such a view thus negates the notion of relativism

that scientific knowledge is in a hopeless state of flux

and uncertainty. Indeed, it asserts that scientific

knowledge is tentative yet rigorous and thus constitutes

a legitimate form and content for science education

(Ziman 1980:18-20).

32

In synthesising the various views as stated above,

one could conclude that students can be trained to

approach science with a healthy scepticism, that is

seeing theories which are new to them when presented are

potentially falsifiable (Popper’s view) but after

rigorous testing have sufficient confidence in proven

theories of normal science (Kuhn and Polanyi’s view).

1.3.1.2 CONSTRUCTING SCIENTIFIC KNOWLEDGE

Another matter which is very much in vogue regarding

scientific knowledge is the difference between

scientists’ science and what is known by an array of

terms such as “children’s science, misconceptions,

alternative frameworks/conceptions, personal constructs,

intuitive ideas or common-sense understanding of

science.” What the latter means is a personal knowledge

of the world which is constructed through personal

perceptions that is often antithetical to views held by

scientists (Osborne and Freyberg 1985, Shapiro 1994,

Driver 1985:1-9, Harlen 1993:14-53).

Thus, the teaching of scientific knowledge is

increasingly seen as a process whereby children exchange

or modify their existing ideas about phenomena in the

world for those of scientists. Gilbert et al. (1986:302-

305) suggests a more modest and manageable goal of making

children aware of an alternative viewpoint (that is, the

33

scientists’ viewpoint) and challenging them to construct

a more unified science view with more general uses. Nevertheless, Reiss (1993:89) warns against the

thinking that the goal of the science teacher is always

and merely replacing children’s prior ideas. Instead he

suggests that the teachers’ tasks should be to help

children see why their own thinking often works,

challenging them to develop this thinking by pursuing new

areas and not always attempting to falsify children’s

ideas. It is through this process that there is the

development of conceptual understanding (Gott and

Mashiter 1991, Harlen and Osborne 1985:142, Gott and

Duggan 1995:25-6).

Thus, Claxton (1991:112-7) proposes the aim of

helping children gain a more sensible working knowledge

of science theories through a practical outworking of

them in actual problem solving situations requiring

children to reflect on what they do. Qualter et al.

(1990:20-1) supports such a view purporting that

meaningful learning occurs in pupils in the context of

exploration where their existing framework of ideas are

tested against the real world and thus helping them to

modify them in light of their experience. Such a

development of personal knowledge of science is a

congenial aim for science teaching.

34

1.3.2 SCIENTIFIC METHODS (procedural understanding)

The problems surrounding the commonly accepted view

of scientific methods and thinking involving rigorous

testing by systematic observation, measurement,

deliberate experimentation by replication, deduction,

proposing hypotheses and rational theory and

communication (Hurd 1971:17, Pike 1964:82-84) has been

attested to in the previous section.

Millar (1994:166-7) even questions if there exists a

generally accepted view of scientific method. What is

clear is that the scientific method should not be viewed

merely as a uniform, precise and rigid set of procedures,

algorithms or rules to be followed to the letter (Hurd

1971:17, Millar 1994:167). Instead, the scientific method

which conceives new discoveries involves the utilisation

of two related factors, namely thinking and tacit

knowledge (Millar 1994:167,174, Hurd 1971:18) which bring

together formal technique and available theoretical

insight (Ziman 1980:60).

First of all, let us consider thinking in science.

Hurd (1971:18) asserts :

Science is not simply an abstraction from empirical data, but

an intellectual creation often suggested by the data. It is the

35

discovery of order among the data that makes the science, and

this process requires a constructive imagination, intuition,

and an intellectual command of relevant concepts.....science is

an intellectual activity which arises from personal experience

and takes place in the minds of men. It is simply a way of

using human intelligence to achieve a better understanding of

nature and nature’s laws.

Therefore, one of the aims of science teaching

should be that students are engaged actively in thinking

as they process data and ideas that accost them in the

learning discourse and situation. This also involves the

exercise of skills and decision making as they apply the

scientific method (Millar 1994:167). Thus the method and

thinking must go concurrently, hand in hand.

But, how does one make decisions when dealing with a

problem in science? The second factor of tacit knowledge

thus comes into play. Polanyi (1969) identified two types

of knowledge - explicit knowledge and tacit knowledge. As

Woolnough and Allsop (1985:33,34) explain, explicit

knowledge is knowledge “articulated and cognitively

assimilated into consciously formed theories” whereas

tacit knowledge “is never consciously articulated but

acquired directly through our senses and held in

readiness for more direct application. However, tacit

learning can occur as instructional teaching is given in

the initial stages but which ultimately will have to be

experienced to be permanently embedded. A good example is

36

learning to swim or ride a bicycle. One could be told the

techniques and shown how with guidance, but the learning

can only occur as the learner tries it out himself .

However, once learnt he can perform the tasks almost

without thinking and effort and would be hard pressed to

explain how he knows it! Furthermore, tacit learning is

permanent and retrievable when needed.

Tacit knowledge in science is thus learning that

occurs when a learner is continually exposed to phenomena

or materials resulting in him/her getting a ‘feel’ for

them (Woolnough and Allsop 1985:34). When the learner is

confronted with a decision-making situation, he/she can

draw upon his/her tacit knowing (from perception of

physiognomy, that is the outward appearance of something)

- a knowing that cannot be put into words. (Polanyi

1964:4). This certainly gives account to the value of

practical work and hands-on experiences in the learning

of science.

But, what of the ‘formal technique’ employed in the

doing of science which I have alluded to earlier?

Recently, there has been an increasing interest among

certain science educators in the need to articulate the

importance of what is called procedural understanding

regarding the teaching of science especially in the

context of practical work (Gott and Duggan 1995:23-39,

Duggan and Gott 1995, Gott and Mashiter 1991, Qualter et

37

al. 1990, Millar et al. 1994). What do they mean by

procedural understanding?

Gott and Duggan (1995:26) defines it this way :Procedural understanding is the understanding of a set of ideas

which is complementary to conceptual understanding but related

to the ‘knowing how’ of science and concerned with the

understanding needed to put science into practice. It is the

thinking behind the doing.

Qualter et al. (1990:glossary) and Gott and Mashiter

delineate this further by giving the procedures involved

namely:

identifying the important variables

deciding on the status of the variables - independent,

dependent or control.

controlling variables

designing an investigation so that the variables can be

manipulated

deciding on scale of quantities used

choosing the range, number and value of measurements,

the degree of accuracy and reliability of judgements

(and any corresponding instruments)

selecting appropriate tabulation, display and

interpretation

In order to avoid undue confusion later, I feel that

it is important to try and differentiate procedural

38

understanding from skills, may it be process skills or

manipulative skills. Gott and Duggan (1995:20) gives a

helpful distinction of skills : “skills...refer to activities

...which are necessary but not sufficient in themselves

to the carrying out of most practical work.” It is the

‘doing through thinking’ aspect of practical work in

science. On the other hand, procedural understanding is

the epistemological aspect of practical work, providing

the framework and basis for investigations in science. It

is, as it were, the essentials of the scientific method

employed in science education.

To further clarify this distinction, I refer to an

integral part of procedural understanding that Duggan and

Gott (1995) calls ‘concepts of evidence’ which are the

concepts underpinning the doing of science in relation to

the evidence as a whole (Gott and Duggan 1995:30). They

structured these concepts of evidence according to four

main stages :

Design of the task

Decisions for measurement

Methods of data handling

Evaluation of the complete task in terms of the

reliability and validity of the ensuing evidence

(See Appendix A for a detailed elucidation of each

understanding associated with each of the stated stages.)

39

Very often decisions based on procedural

understanding need to be made before or after the actual

‘doing’ of investigative work. It is the planning,

formulation and assessment of the framework for the

investigation. For example, at the design stage decisions

regarding which variable to be manipulated (independent

variable), which variable to be measured (dependent

variable) for each change in value of the independent

variable, which variables to be kept constant (control

variable), factors to ensure fair testing with sufficient

and appropriate sample size, have to be thought through

and made before actually designing the experiment.

At the measurement stage, decisions on the choice of

instruments which are suitable with the relative scale

and range / interval , how many readings to take

(repeatability) and suitable tables for recording

measurements have to be made before the actual setting up

of the experiment and taking of measurements. Then, after

the measurements and results have been taken, decisions

on whether the data is reasonable and makes sense

(reliability and validity) are made.

Therefore it is the procedural understanding with

the underlying concepts of evidence that essentially

provides the framework and basis for the investigative

work - the thinking behind the doing. Even though

40

procedural understanding are decisions often made before

or after the ‘doing’ of investigative work, in a way it

also encompasses the whole investigative process because

procedural understanding provides the framework for

investigations. Considerations on the basis of procedural

understanding will be revisited in the process of ‘doing’

as deemed necessary. Thus, procedural understanding is

complementary to the doing aspect of practical work

covered by process skills and manipulative skills.

Because of this basic role of procedural

understanding and the existence of content for concepts

of evidence, Gott and Duggan (1995) have espoused the

inclusion of teaching and assessment of procedural

understanding in science education, affording it the same

degree of importance as conceptual understanding.

Consequently, in the teaching of science, the

scientific method should be seen as involving the

thinking process and the utilisation of the tacit

dimension together with procedural understanding

interacting with prior and appropriate conceptual

knowledge.

1.3.3 PROCESS SKILLS AND MANIPULATIVE SKILLS

Emanating from the realisation of tacit knowledge,

is the approach of teaching or doing science as a craft,

41

that is the art of doing science (Millar 1994:174-5,

Woolnough and Allsop 1985:76). The learner is thus

envisaged as an apprentice to a craftsman, learning as it

were the ‘trade’ through the acquirement of personal

tacit knowledge (Ravetz 1971:73).

This encompasses the methods of scientific work

(procedural understanding), tacit knowledge and the

skills required to do the work. This leads to the third

aim of science education from the matrix in the implicit

aim from the science domain, commonly known as process

skills and manipulative skills.

Watts (1991:137) differentiates skills from

processes by saying that “a skill is an individual’s

contribution to an activity, or towards an objective”

while “a process is a more general means to an end.”

Process skills would encompass abilities such as

observing, classifying, inferring, predicting,

controlling variables, hypothesising, measuring and using

numbers, using the relationship between space and time,

defining operationally, interpreting information,

communicating and experimenting. (Pusat Perkembangan

Kurikulum 1993 :3,4).

It is important to note that process skills are the

cognitive processes, the thoughts that go through the

pupils’ minds as they perform practical science

42

activities - and not the practical manipulative skills

associated with science (Gott and Duggan 1995:19). To

reiterate a distinction which I have already made in the

previous section, process skills is the thinking that

goes on while ‘doing’ science activities while procedural

understanding is the thinking that is done before and

after the ‘doing’ providing the framework and basis for

the investigation.

Process skills have been given new prominence in the

teaching of science for the past decade. From the

Nuffield view of the ‘pupil as scientist’ (Millar 1991:

44-45) to process-led science projects like Warwick

Process Science, Science in Process, TAPS (Techniques for

Assessment of Practical Science) and Science - A Process

Approach, it has certainly enjoyed a very high profile

(Millar and Driver 1987:34-36).

Nevertheless, the process approach is not without

its detractors. Millar (with Driver:1987, 1991, 1994) in

particular has repeatedly questioned the rationale of

process-led curricula and the validity of inductive

overtones in the accentuation of the process skills. His

argument against the teaching of process-led curricula

trails along this line of thinking :

The ‘processes’ as commonly espoused by the process-led

approach are features of general cognition that

43

everyone uses routinely without need for formal

instruction. This means that even from childhood

children already independently start to observe,

classify, hypothesise and so on.

The process-led approach which views process skills as

actual discrete processes which can be taught in

isolation, fails to see that the successful utilisation

of such strategies are actually content/task/context

dependent. In other words, the constructs and prior

knowledge of the pupils bears very much on his success

or failure to perform the task. Furthermore, learners

do not organise knowledge around processes. Therefore

the process-led approach actually makes it more

difficult for pupils to make links to facilitate

understanding.

There is no evidence for the transference of such

processes across domains. Thus, the value of such

emphasis on processes is doubtful.

Nevertheless, Millar is not against the teaching of

more specific or particular kind of process skills. By

this he meant that, instead of taking process skills as

general cognitive thinking skills, the teaching of

scientific thinking should be characterised by the

constructs and purposes that are employed (Millar and

Driver 1987:44). The scientific context and content for

the teaching and utilisation of these skills should be

clear. Thus, for example, it is ‘scientific observation’

44

rather than ‘observation’ that the science teacher should

be concerned about.

In one of his writings, he has endeavoured to

demarcate and isolate general processes from tactics and

techniques (see Figure 1.4)

‘Practical skills’

General cognitive Practical techniques Inquirytactics processes-observe -measure temperature with -repeat measurements,-classify a thermometer to within 1º C -drawgraph to see trend -hypothesise, -separate a solid and a liquid

in data etc. by filtration -identify variables to

etc. alter,measure, control,

etc.

(Cannot be taught) (Can be taught and improved)

Figure 1.4 Sub-categories of ‘practical skills’(Millar 1991:51)

In cutting this fine line of distinction, and with

similar arguments as outlined above, he makes the

conclusion that general cognitive processes cannot be

45

taught but practical techniques and inquiry tactics can

be taught and improved. Thus, a student can be taught to

observe the temperature as recorded by a thermometer.

Hence, the student is not taught to observe but to

observe scientifically. To emphasise this distinction,

perhaps it would be appropriate to call this context-

dependent skills as scientific process skills.

This distinction also enabled him to develop the

area of tactics in his later work with the Procedural and

Conceptual Knowledge in Science (PACKS) project where he

and his associates researched into the area of procedural

understanding, which we have already considered in the

previous section. Their conclusion was that procedural

understanding must be explicitly taught, and not assumed

that it will be ‘picked-up’ (Millar et al.1994:245).

Nevertheless, the findings of the Cognitive

Acceleration through Science Education (CASE) project has

some interesting conclusions. Basing their work on

Piaget’s theory of cognitive development the researches

formulated 30 activities (published under the name

‘Thinking Science’ by Adey et al.:1989) which covers 10

schemata or reasoning patterns which Inhelder and Piaget

proposed were the complete cognitive structure of formal

operations (Adey and Shayer 1994:17), namely:

Control of variables

46

Proportionality

Compensation

Probability

Combinations

Correlation

Classification

Formal models

Compound variables

Equilibrium

(Gott and Duggan 1995:36)

It is interesting to note that the above list has

much in common with scientific process skills and

procedural understanding. For example, control of

variables, compound variables and classification are

terms frequently used for some process skills and

procedural understanding. Similarly, most of the other

cognitive skills also relates to making interpretations

about the relationship between variables like

proportionality, compensation and equilibrium,

correlation and probability. Their basic treatise was

that suitably employed intervention strategies could

accelerate the development of formal operational thinking

(Adey and Shayer 1994:38-59) and reap benefits in the

form of improvement in the pupils’ overall standards of

achievement.

47

According to Adey and Shayer (1994:60-75) the

features of a successful cognitive intervention programme

are as indicated in Table 1.1 below:

FEATURES OFGOOD

INTERVENTIONDESCRIPTION

Duration & density

Concrete Preparation

Cognitive conflict

Construction

Metacognition

Bridging

Intervention at a steady rate over a long period.

Establishing familiarity with the vocabulary, apparatus and framework in which a problem situation will be set.

Presenting an event or observation which the students find puzzling and discordant with previous experience or understanding.

Activity in which the conflict is at leastpartially resolved as students’ minds go beyond their previous thinking capability.

The sense of conscious reflection on the problem solving process and naming of reasoning patterns developed, for future use.

Application of these reasoning patterns tonew contexts in order to (a) generalise them (b) consolidate their use

Table 1.1 Features of good cognitive

intervention strategies

The two year programme was implemented in nine

schools with pupils of 11-12 years where once every two

48

weeks a ‘Thinking Science’ lesson substituted a regular

science lesson (usually a 70 minute period) (Adey and

Shayer 1994:85). The main test instruments to measure

cognitive development were Piagetian Reasoning Tasks

which were administered both before and after the

intervention.

The results obtained after the two year period trial

indicated that after the intervention the experimental

group of pupils showed significantly better levels of

cognitive development than the control group, but no

better performance in science in each school’s end of

year science tests (Adey and Shayer 1990). Nevertheless,

three years after the intervention, the experimental

group performed significantly better at GCSE, not only in

Science but in Mathematics and English as well (Shayer

and Adey 1992, Adey and Shayer 1994:98-103).

Adey and Shayer (1994:103) interpreted the results

this way : “On the face of it, the evidence presented of

long-term far-transfer of the effect of the intervention,

set in a science context, to enhanced achievement in English two

and three years later, supports the hypothesis of a

general cognitive processor which can be positively

influenced by appropriate intervention strategies set in the

context of the ordinary curriculum.”

49

These results seem to support the notion that

incorporating scientific process skills and procedural

understanding into the context and content of science

(conceptual understanding) with suitable intervention

strategies is a win-win situation. It not only improves

achievement in science and other subjects but it also

accelerates general cognitive thinking.

Thus, it seems reasonable not to reject process

skills per se but to be more precise in our thinking and

speech of it. It should be acknowledged that process

skills should be taught in the context of science

content. Scientific process skills and content should go

hand in hand. This is what Kirkham (1989:148) advocates

when he wrote about a balanced science which achieves

equilibrium between context, process and content. He

explains that “we need a content framework that will co-

ordinate with the process framework and over which we can

lay the context framework.”

Particular manipulative skills involving practical

skills like using a thermometer, measuring cylinder,

stop-watch and so on, should be skills that pupils are

taught and trained to do (Woolnough and Allsop 1985) so

that they can perform the scientific process skills

competently. In short, the teaching of scientific process

skills should be an endeavour to promote the tendency to

think in a manner that takes into consideration

50

scientific ideas, reasoning and skills to help solve

problems.

51

RECAPITULATION

The three explicit aims of science education from the scientific domain

are scientific knowledge (conceptual understanding), scientific methods

(procedural understanding) and process skills and manipulative skills.

Scientific knowledge, although provisional is still very reliable and stable and

therefore a worthwhile explicit aim. The goal is to help children develop

conceptual understanding utilising their innate curious nature and their prior

ideas regarding natural phenomena of the world around them so that they

get closer to that of the scientists’ science.

Children also need to understand how this scientific knowledge is

formulated, that is the scientific method involving procedural understanding,

intellectual thinking and the use of tacit knowledge in investigating and

solving problems in science. Together with process skills, they are all engaged

in the learning of the craft of science - in devising experiments and tests,

measuring, structuring investigations, analysing data and findings,

formulating hypotheses and theories. Manipulative skills are of course

necessary practical skills to carry out these investigations.

Without scientific knowledge and content, there can be no science.

Without the scientific method, there can be no determination of scientific

knowledge. Without the process skills and manipulative skills, there can be no

practical ways of working out the method of science. The interlocking of these

three elements thus blends naturally together to form the explicit aims of

science education from the science domain.

52

1.3.4 SOCIETAL NEEDS AND RELEVANCE

“It is surely time that curricular units be composed to serve the

ends we want. They are in this respect socially determined and need

constantly to be reconsidered.”

(Hirst

1974)

Science education surely cannot be excluded from

this social pressure that Hirst wrote about. The

Industrial Revolution of the 19th century witnessed the

form of science, then known as ‘natural philosophy’,

attempt to isolate itself from the “vulgarities” of

practical knowledge by creating ‘pure science’ which

valued scientific knowledge just for knowledge sake and

retreating into their refuge in universities (Aikenhead

1994:15).

This tension and schism between pure science and

applied science which the natural philosophers hoped to

isolate science from possessing a moral or social

responsibility was irrevocably blown apart during World

War II as epitomised by the production and deployment of

the atomic bomb by the team of scientists assembled under

the leadership of Robert Oppenheimer. In the aftermath of

the tumultuous event, Oppenheimer’s confession that

“physicists have known sin” (Solomon 1993:14; Aikenhead

1994:16) betrayed the emerging realisation that

53

scientists and inevitably science itself has a social

responsibility and a moral role.

In addition to this, Ziman (1980:57) referring to

the apparent and seemingly unending dissension between

Robert Merton, Thomas Kuhn, Karl Popper and Michael

Polanyi regarding the nature of science (see discussion

in 1.3.1) , suggests poignantly that it ironically

reveals a basic agreement - that science must be

represented by a social model. This is because the

formulation of scientific knowledge is not merely an

individual process but a communal act forming consensual

knowledge applicable for the community. Ziman (1980:54)

extends this line of thought by saying that ‘valid

science’, that is science as taught conventionally in

schools and universities (Ziman 1994:21), should be

viewed with societal links especially through its

technological applications. Hodson and Prophet (1994:34)

then delineate this by saying that school science should

be viewed as socially constructed, that is, “being the

product of particular sets of choices made by particular

groups of people at particular times”.

Such thinking served as a harbinger to what evolved

into the popularly known ‘science, technology and

society’ (STS) curriculum. In this context, Chapman

(1994) reiterates the need to not blur the distinction

between science and technology. He distinguishes the two

54

in this way: “sciences are about understanding nature;

technologies are about the application of reliable

knowledge, some, but not all from science.” Even though

the two are inevitably linked and related, yet in

practice people often use technologies without knowing or

understanding the sciences behind it. For example, a user

of computer need not know the functions of the hardware

components to use the unit successfully.

Aikenhead (1994:48-49) further describes the essence

of STS as student centred, that is the student linked to

his natural environment through the science component

(the traditional science discipline content), to his

artificially constructed environment through the

technology component and to his social environment

through the society component (see Figure 1.5).

SCIENCE

Natural Figure 1.5 : Environment The

essence of STS EducationAikenhead

(1994:48)

STUDENT

Artificially Constructed Social

55

Environment Environment

TECHNOLOGY SOCIETY

56

Aikenhead (1994:49) goes on to list an impressive

array of aims of STS in the area of social

responsibilities :

individual empowerment

intellectual capabilities such as critical thinking,

logical reasoning, creative problem-solving and

decision making

national and global citizenship, usually “democracy” or

“stewardship”

socially responsible action by individuals

an adroit work force for business and industry

Even though an elucidation of the above aims does

not fall within the purview of this dissertation, it

should be evident by now that there is indeed a dire need

to make science education relevant and not eschew

societal needs. The integrated perspective fits this aim

of science education nicely into the available matrix as

an explicit aim from the general domain. This is because

for science to meet societal needs and be relevant, it

will undoubtedly pull science into association with

economics, industry, global problems and issues that

affect the quality of life which are more of a general

nature.

There is another way of viewing relevance, that is

to see that science education should be relevant to the

57

students’ world. Newton (1988:35-6) comments that while

much of STS curriculum considers science, technology and

society, it tends to be weak on its relevance to the

needs of the individuals. This is especially important

with regards to younger children whose egocentricity

means that their world has not quite developed beyond

themselves, their families and friends.

58

Hence, Harlen (1993:37) writing regarding the

teaching of primary science emphasises the importance of

science content encompassing the children’s immediate

everyday experience of the world so that it matches their

conceptual ability, their closer affinity to relate with

it and thus possessing greater potential for retention.

Of course this aspect of relevance equally applies

to secondary science learning but with a wider girth

encompassing the local community, the nation and the

world community.

Newton (1988:40) envisaged this progression as in

Figure 1.6 below:

A B

Science and the individual

Figure 1.6

Science and society

A’ B’

Thus, he illustrates the increasing proportion of

wider societal elements as the children increase in age

from AA’ to BB’. However, his illustration does seem to

59

suggest that individual and societal elements are

mutually exclusive which in reality is not so. The

individual is part of society. Thus, I would like to

propose a slight variation to Newton’s illustration

(Figure 1.7).

60

Science and society C Figure 1.7

A B Science and the individual Increasing

Age

In the above illustration, it is alluded that the

content of science for the very young has more relevance

to his immediate individual world (Area A). As he grows

in age (moving to the right of the diagram), wider

societal elements begin to be introduced in the child’s

science content (Area B) with still direct relevance to

his individual world, thus emphasising that he is part of

society. Nevertheless, there will also be societal

elements that might not be related to him directly (Area

C). For example, the study of nuclear energy for children

in third world countries that do not have nuclear

reactors. Therefore, we maintain the individual as part

of society as they expand their perspective on the

applications of science and technology in society.

1.3.5 HISTORY OF SCIENCE

“The history of science has long been regarded by

scientists and other academics as the most natural medium

61

for humanising science education.” (Ziman 1994:26) This

is because in the history of science there exists

excellent case studies illustrating the fallible nature

of scientists, the process involved in discoveries,

inventions and innovations, the evolution of ideas and

cultures, and the influence of science and technology in

society - all of which are part of the explicit aims of

science education already referred to in the preceding

sections. (Claxton 1991:122, Ziman 1994:26, Solomon

1993:40-42).

Therefore, the history of science fits easily into

the matrix as an explicit aim from the general domain. In

utilising the history of science as lesson contents

whether in the form of introductions to related science

topics, enactments through drama of scientific events or

replicating experiments of past scientists (Newton

1988:42-55), it humanises science which could often be

seen as cold and unfeeling. It is in this process that

students get a feeling of the human endeavour in real

life stories. They also get a picture of how ideas,

understanding and models of thinking are conceived in the

dynamics of the scientific enterprise and begin to

understand that it is just as much a creative work as in

the field of arts.

Nevertheless, the warning of both Ziman (1994:27)

and Sherratt (1986:203-7) should be heeded in the usage

62

of the history of science. They lamented the tendency to

oversimplify the portrayal of the history of science so

as to give the narrow view that science progressed

because of the achievements of certain individuals.

Instead, care should be taken to show that progress in

science was and is a collective enterprise and a social

endeavour of the community - a perspective very much in

line with the societal needs and relevance of science

education as espoused in the previous section.

1.3.6 LANGUAGE AND WORDS

Sutton (1992:2,3) warns of the tendency to give so

much importance to practical work in the teaching of

science that it so dominates lessons leaving very little

time for students to actually reflect on ideas. Carre

(1981:6,7) also argues that the learning by ‘doing’

should be followed up with reflection upon what they have

done through thinking, talking and writing. Students

should be given time to construct meanings from their

experience. Sutton and Carre thus emphasise the

importance of language and words in the learning of

science.

Meighan (1986:144-5) poignantly articulates the role

of language in learning by saying:

...in acquiring a language...an individual is not simplylearning a neutral sign and symbol system for personal and

63

idiosyncratic use, but is also taking on ideas, meanings,conceptualisations, theories, attitudes and judgements whichare deposited in the language system when they were firstproduced. In a very real sense language represents a highrepository of interpretations and knowledge by which the ideasand recognitions of our ancestors are preserved and handed onto succeeding generations. Thus, in learning a language we areexposed not only to predetermined definitions of the world butalso predetermined explanations of it.

This certainly applies to the role of language and

words in the learning of science. Sutton (1994:62,63)

asserts that science lessons should be viewed as learning

the ideas already formed by the scientific community

(what Khun calls the established paradigms) and not the

actual study of nature itself. He further suggests that

practical work should not be the source from which ideas

come from but only to get a feel of the phenomena.

Therefore he proposes what he calls ‘Word Work’ as

the main activity of science teaching and learning and

not ‘Bench Work’ (practical work). By this he meant that

more time should be allocated for students to reflect,

extricate and construct for themselves the meanings

behind the language and words used to express ideas of

science rather than in actually doing practical work in

the laboratory situation (Sutton 1994:63). He sees a

parallel style of teaching with that commonly applied in

the teaching of literature, whereby students are first

exposed to the source and given sufficient time to

extract, construct and appreciate the meaning intended by

the original author. So, it is for the propositions

64

usually presented to students during the learning of

science (Sutton 1994:18).

Even though I agree with Sutton regarding the need

to have ‘Word Work’, I would think it more practical to

see it as one of the repertoire of styles that could be

employed by a science teacher. Therefore, certain science

lessons can be predominantly students engaging in ‘Word

Work’. At other times, practical work dominate but with

‘Word Work’ integrated into the lesson. This gives the

teacher the flexibility to choose what he feels is the

best way his students can construct ideas and learn the

meanings behind language and words. Nonetheless, Sutton’s

warning that science teachers should not be too caught up

and obsessed with practical work should be heeded and

taken very seriously.

Nevertheless, language and words are common entities

utilised in all fields of knowledge. In fact, commonly

used scientific terminology and nomenclature are also

sometimes used with layperson meanings, for example words

like pressure, react and mass. Because of this, I feel it

is more appropriate to designate language and words as an

explicit aim of science education from the general

domain. This completes the explanation on the group of

explicit aims from the working matrix.

65

RECAPITULATION

Scientific knowledge is a social construct by the scientific community.

Therefore, students, in the process of learning science, are extracting and

constructing meanings behind the language and words that encapsulates the

ideas of scientific knowledge as have been formulated by the scientific

community. The role of the science teacher is to help his students develop his

simple ideas of the world and phenomena closer to that of scientists’ science.

The history of science provides real case studies that illustrate the

evolutionary process of formulating the consensual scientific knowledge and

thus may help in humanising science lessons. Practical work where students

are involved in ‘hands on’ activities require the use of scientific method and

thinking and process skills, giving them a personal feeling of the process.

Science education should also reflect its relevance by helping students

see the applications of science especially through technology and its role in

society. The students gain an ever widening understanding of the world in

which they live and develop skills that they could apply in society.

Therefore, the six elements of explicit aims for science education are

very much interrelated and if presented in an integrated manner with

sufficient overt elucidation of this interconnection, students will hopefully

develop a balanced picture of what constitutes science. This suggests that the

stated elements will often feature concurrently in the teaching of particular

topics in science. They will form the main bulk of content in science lessons

and are foremost in the aspirations of science teachers.

66

1.3.7 SCIENTIFIC ATTITUDES, MORAL VALUES AND BELIEFS

Moving on to the implicit aims of science education,

we will first consider scientific attitudes, moral values

and beliefs. Harlen (1993:72) defines attitudes as “the

state of being prepared or predisposed to react in a

certain way to particular objects, persons or

situations.” It is a tendency to act or undertake some

course of action when faced with a range of similar

situations. Therefore, Harlen (1993:73) stresses that an

attitude can only be said to have been acquired if it is

manifested over and over again in a general pattern such

that it becomes predictable or typical of that person.

But what are scientific attitudes? Of course many

attitudes are applicable and appropriate for all kinds of

learning. But scientific attitudes are those that have

direct relevance to science (Harlen 1993:73) and affects

an individual so that he possesses a propensity to

approach life and its problems scientifically (Martin

1988:146). Scientific attitudes also affect learning

experiences in science in that they encourage the change

of personal ideas and influence the knowledge systems

held by individuals (Secondary Science Curriculum Review

1984:7). Therefore, scientific attitudes do in certain

ways determine the extent of learning that an individual

engages himself.

67

There are a variety of lists from various sources

that encompasses these scientific attitudes but they

generally have much more in common rather than

differences. The Secondary Science Curriculum Review

(1984:7,8) provides a typical list that was expanded from

that of the Assessment of Performance Unit : open-

mindedness, self-criticism, independence of thought,

responsibility, perseverance, co-operation, scepticism,

desire to be well informed, confidence, respect,

willingness to be involved, sensitivity, enthusiasm,

tolerance, persuasiveness, questioning and trust.

On surveying this list, the connotation that can be

derived is that some of the elements can also be regarded

as values especially if one adopts a broad definition for

values as that given by Graham that is values are “what

people think to be good” (Poole 1990:25). Nevertheless,

values are more than just attitudes for they also

engender moral choices and decisions. We can visualise

the relationship between moral values and scientific

attitudes as in Figure 1.8 below:

Moral Values

ScientificAttitudes Figure 1.8

68

The application and teaching of science is

unavoidably and intimately connected to moral choices

(Martin 1972:160) especially in the context of the role

of science in society (see Section 1.3.4). Indeed, Newton

(1988:16) espouses the inclusion of values and moral

decisions to humanise science. Poole (1995:11) lends

additional support for such a view by referring to recent

studies in history and philosophy of science that

revealed how beliefs and values have “thoroughly

permeated the scientific enterprise”, so much so that he

feels that the two cannot be separated in the theory and

practice of science education.

This brings us to the third and probably the most

controversial element as introduced by Poole, that of

beliefs. Belief is something accepted as true (Oxford

Paperback Dictionary 1988:70) and for many people this

includes religious beliefs. But are religious beliefs and

science related in any way?

Christianity, Judaism and Islam teaches that the

world is the creation of God and thus man is to manage

his environment responsibly (Poole 1990:255-6). Buddhism

and its teaching of reincarnation propagates a deep

69

respect for living things. Hinduism and most tribal

religions that leans towards pantheism sees perpetual

connection between God and His creation.

Woolnough (1989:133) referring to the relationship

between science and society raises the question of what

type of society that the teaching of science should be

related to - whether the secular, materialistic and

humanistic culture or one that recognises the existence

of God and its teachings and beliefs.

With such a large following of adherents to various

religions in the world and the fact that most religions

do not teach the compartmentalisation of life into

secular and sacred, it seems reasonable that the teaching

of science should, at least, provide some assistance for

students to consider where religious beliefs and moral

values can be available options of understanding the

world.

Furthermore, science and religion are indelibly

etched and linked in the pages of history. Not only were

many eminent scientists like Copernicus, Galileo, Boyle,

Newton, Faraday, Maxwell and Kelvin believers of God, but

many dedicated their life and work with a perspective of

unfolding the ‘hand and mind’ of God in His creation

(Woolnough 1989:133)

70

Einstein probably says it best (Woolnough 1989:134):

You will hardly find among the profounder sort of scientificminds without a religious feeling of his own...His religiousfeelings take the form of a rapturous amazement at the harmonyof the natural law, which reveals an intelligence of suchsuperiority that, compared with it, all the systematic thinkingand acting of human beings is an utterly insignificantreflection. This feeling is the guiding principle of his lifeand work. It is beyond question closely akin to that which haspossessed the religious geniuses of all ages.

Even in the celebrated and often cited case of the

trial of Galileo by the Roman Catholic Church as a point

for incongruity between science and religion, it should

be noted that Galileo, who himself was a believer of God,

was not opposing religion per se but was in his own mind

defending and protecting the church from erroneously

linking false ideas of man with Biblical texts which

could bring untold damage to the church (Poole 1990:30-

31, Poole 1995:106-113).

Hummel (1986:119-20, 122-23) suggests further that

the actual group that engineered the opposition and

eventual condemnation of Galileo was the scientific

establishment of his day - that of Aristotelianism. This

group of scientists had long been involved in a running

debate with Galileo over his support of Corpenicus

hypotheses of heliocentrism. After failing to refute

Galileo’s assertion and arguments scientifically, they

turned the tide by making it into a theological issue and

dragged the Roman Catholic church into the controversy.

71

Therefore, this incident should not be naively construed

as a clear-cut case of religion versus science as is

commonly projected.

In light of the above points, it seems inexplicable

to dogmatically promulgate the fallacy that all reputable

scientists regard religion as a waste of time and

irrelevant, even to the extent of opposing it. This is

why I support the call of Poole, Woolnough and Reiss

(1993,1992) to deal with the science/religion issue as it

inevitably crops up in the process of learning science.

Nevertheless, we should always bear in mind the

intrinsic differences between science and theology, the

study of religion. For example, Hummel (1986:204)

addressing the biblical and scientific view of nature

succinctly points out that the former reveals the who and

why of the universe, the Creator and His purposes for

nature and humanity while the latter is more concerned

about the how of natural events. To a certain extent,

they address the same matter but from different

perspectives and for different purposes. Therefore,

science and religion do not necessarily contradict each

other but rather complement each other as they address

different aspects of the same matter.

Furthermore, Hummel (1986) adds that the biblical

message employs the everyday language of sense perception

72

which is accessible to all and for all times while

science seeks to convey its message through the more

specialised and precise technical and mathematical

language. Therefore, this distinctive and contrasting use

of language is another significant difference that should

be clearly delineated in any discourse about science and

religion. It is not a matter of which type of language

being more accurate but which style is more appropriate

for their different purposes. In short, we should be

careful not to force science or religion say anything

that is not its basic intention, but should recognise

their different purposes, perspectives and use of

language. Any failure to do so would plunge one into a

mire of confusing debacle that reaps no benefits to

anyone. Therefore it is vitally important that we should

bear in mind that essentially science and religion seek

to answer different questions pertaining to life but

nevertheless are inevitably intertwined on many fronts.

Expanding from the previous illustration (Figure

1.8), we can view the relationship of the three elements

and scientific knowledge in the making of moral decisions

and choices as in Figure 1.9 :

Scientific Knowledge

Moral

73

Values

Scientific BeliefsAttitudes

MORAL DECISIONS ,CHOICES & JUDGEMENTS

Figure 1.9

The illustration thus makes it clear that the

interplay between scientific attitudes, moral values,

beliefs and scientific knowledge is the basis upon which

moral decisions, choices and judgements are made.

Another question that needs to be considered at this

point is : Why are these three elements considered

implicit aims of science education? Poole (1990:67-8)

suggests that beliefs and values implicitly come into

science education. In expanding this view, I suggest that

scientific attitudes, moral values and beliefs will

inevitably arise in the course of learning science. It is

unavoidable though it can be ignored!

74

Furthermore, I think that these three elements are

more caught than taught. Children pick it up from models

they observe and are extremely astute in detecting

anomalies between what is explicitly ‘preached’ by the

teacher and what is openly practised. Thus, Reiss

(1992:129) suggests that these related issues can be

openly dealt with as they arise naturally in the course

of science lessons. Sometimes, the science teacher could

even initiate the discourse that relates to the three

stated elements.

Therefore, in saying that the three elements are

implicit aims, I mean that they do not need to be the

main concern of science teachers as they plan their

lessons. These will arise naturally from the explicit

aims that we have already determined (1.3.1-1.3.6). The

science teacher can then decide when and how to deal with

these issues openly with his students.

1.3.8 SCIENTIFIC AND TECHNOLOGICAL LITERACY

Jacobson and Bergman (1980:74) describe scientific

literacy as “the ability to understand and discuss

developments in science and technology as they are

communicated via newspapers, magazines, television and

other public media”. This entails making personal

judgements and opinion upon issues arising from societal

concerns that affect the quality of life. Therefore,

75

Reiss (1993:33) suggests that science education that

seeks to produce scientifically literate students should

enable them to apply their knowledge and understanding of

science learnt in school to everyday issues.

Nevertheless, Jenkins (1990:48-49) points out that

though scientific literacy seems to be a noble cause for

science education, yet it suffers from too differing

interpretations by different advocates regarding what it

actually means. Furthermore, when it is delineated into

detailed aims it usually amounts to a large and

formidable list that seems to be depressingly

unattainable.

For example, Krugly-Smolska (1990:476) quotes the

Ontario lists that describes the scientifically literate

person:

understands scientific concepts, principles and conceptualnetworks;

puts scientific knowledge to practical use; realises that science is a human endeavour involving both

process and product; knows that all phenomena cannot be understood immediately

and that sometimes theories are used to provide tentativeexplanations;

recognises not only the usefulness of science but itslimitations as well; knows that scientific knowledge issubject to change; and distinguishes between scientific factand personal opinion;

uses reliable scientific and technological information inthe process of personal life management and societaldecision making;

communicates both his/her scientific knowledge and theprocess of inquiry;

76

applies science concepts, theories, processes, and values ininvestigating everyday problems and in making responsibledecisions;

respects the ethics of the scientific method; appreciates the importance of the scientific enterprise and

responds to the stimulus and enrichment afforded by thescientific community;

interrelates science and technology, realising that both ofthem influence and are influenced by society;

relates science to other fields of learning, such as socialsciences, humanities, health sciences, business andtechnological studies, and the arts;

has a rich and exciting view of the universe, the world, andthe environment as a result of a knowledge of science, itsapplications, and its implications;

develops an interest in science-related skills - cognitive,manipulative, and attitudinal - that may be applied tolifelong learning, to career awareness, and to leisurepursuits;

interacts with various components of his/her universe in amanner that is consistent with a sound environmental ethic;

values scientific and technological research and development(Ontario Ministry of Education 1987, pp.13-14)

For this reason, I have placed scientific and

technological literacy as an implicit aim of science

education from the science domain in the matrix. Like

other explicit aims considered in the preceding section,

they arise naturally from the main content derived from

the explicit aims of science education. Furthermore, much

of what constitutes scientific and technological literacy

overlaps considerably with the elements from the explicit

aims.

1.3.9 PROBLEM-SOLVING METHODS AND THINKING SKILLS

77

Lally (1994:219) defines problem-solving as “the set

of mental and physical strategies used to reach the goal

or aim to complete the project”. This implies that time

and effort is required by the ‘solver’ to reach the

‘solution’ which initially had no immediate answer or

steps leading to a resolution.

Watts (1991:131-3, also in Lally 1994:220)

differentiates two types of problem-solving in science -

he calls them PS1 and PS2. By PS1 he means problem-

solving as a purely intellectual process, usually solving

quantitative, well defined, ‘pencil and paper’ problems

commonly associated with traditional academic science

education.

On the other hand, PS2 is problem-solving as an

implementation tasks which has a practical and largely

qualitative solution involving real life contexts. It is

the open-ended, problem-solving investigations described

by Lock(1990), whose tasks are usually not so well

defined but suitable to help develop problem-solving

strategies in individuals.

Watts (1991:131) observed that from the late 1980s,

problem-solving activities were becoming more of the PS2

type - puzzles, design and make, extended project work

and ‘real-world’ problems that usually require skills,

processes, problem-solving methods or design process. He

78

adds that this is the result of the search for relevance

and application of science especially through technology.

A very much related development is that of the

emerging call to teach thinking skills - critical and

creative. Hirst and Peters (1970:31) assert that critical

thinking does not just happen or develop naturally but

has to be nurtured and built into individuals. Barnes,

Britten and Rosen who were English specialists (quoted by

Meighan 1980:109) advocate the teaching of science to be

more concerned with thought rather than just mere

information. Baker (1991:3) suggests even further that

the highest level of thinking that is metacognition be

taught to students so that they can begin to take

responsibility for their own learning and thus become

independent learners.

Problem-solving methods and thinking skills are very

much in line with current aspirations of science

education to train students so that they are able to use

their skills and knowledge in dealing with every day

problems. Nevertheless, I have opted to include them as

implicit aims of science education because they usually

arise naturally during lessons based on the explicit

aims, especially when students are engaged in the

scientific method and thinking during ‘bench-work’

(practical work) or as they grapple with meanings during

‘word-work’.

79

RECAPITULATION

The implicit aims of science education emanate naturally from the

explicit aims of science education. It does not need to be awkwardly contrived

and forced into science lessons. Scientific attitudes which are more caught

than taught is nurtured in the course of involvement in practical work as

students apply process skills and engage themselves in the scientific method

and thinking. In the process, students are also learning and practising

problem-solving methods and thinking skills. Moral values and beliefs arise

naturally when appropriate issues relating to societal needs and relevance

are contemplated. It can also arise as students delve in the annals of the

history of science.

Scientific and technological literacy arise as the application of science

in society forms the content of science lessons. It is hoped that in the process

students will become independent, self-directed learners who know how to

access information, extract and construct meaning for application and make

societal decisions.

Even though, the six elements have been designated as implicit aims,

yet they can be dealt with overtly in the lesson when they emanate naturally

from the explicit aims. They are implicit in the sense that they need not be the

main concern of the science teacher as they plan the content of their lessons.

They are as it were waiting at the peripheral of the main content of science,

waiting to emerge at appropriate junctures of the teaching-learning process.

Science teachers need to be cognisant and alert to these implicit aims so that

he or she can capitalise on the situation when it arises.

80

1.4 THE FAMILY WORKING TOGETHER - THE MATTER OF

PRACTICAL APPLICATION

I will now attempt to pull together the ideas and

conclusions from the preceding sections to form a working

model for practical application in the teaching-learning

endeavour of science education.

The teaching of science should be student-centred.

Therefore, the aims of science education which have been

considered are what science teachers hope their students

will acquire and be able to demonstrate as proof of

having learnt them. I summarise these desired

performances into three simple categories as the student

knowing, doing and feeling. (See Figure 1.10)

81

IMPLICIT AIMS

EXPLICIT AIMS

KNOWING

STUDENT

DOING FEELING

Figure 1.10 MODEL FOR STUDENT-CENTRED SCIENCE EDUCATION

Firstly, the student comes to be in a state of

‘knowing’ when he is involved in activities that helps

him extract and construct meanings locked behind the

concepts, ideas, words and language of science as

formulated by the past and present scientific community.

Secondly, the student comes to be in a state of ‘doing’

when he is involved in practical work and

investigations. Thirdly, the student comes to be in a

state of ‘feeling’ when he is constantly exposed to

natural phenomena and situations which elicit personal

responses.

82

‘Knowing’ is the objective category, ‘doing’ is the

semi-objective category and ‘feeling’ is the subjective

category. These categories encompass the explicit and

implicit aims of science education in that they are

inter-linked as seen in Figure 1.11 below:

83

STUDENT

extracting &constructingmeaning practical work

& investigations constant exposure

KNOWING DOING FEELING

(objective) (semi-objective) (subjective)

Scientific Knowledge Scientific Methods Tacit Knowing

EXPLICIT Societal Needs and Relevance Process SkillsAIMS History of Science Manipulative Skills

Language and Words

IMPLICIT Scientific Literacy Problem-solving Methods Scientific AttitudesAIMS Technological Literacy Thinking Skills

Moral Values & Beliefs

Figure 1.11 WORKING MODEL FOR SCIENCE EDUCATION

For a science teacher, the three important steps in

the teaching of science are planning, implementation and

evaluation. At the planning stage, he focuses on the

explicit aims to form his main content material while

bearing in mind and anticipating the emergence of

situations that would foster the implicit aims. He should

also always be aware of his students’ different

abilities, prior knowledge and inclinations as he plans

so that the lesson is suited to their needs.

84

At the implementation stage, all the elements can

come into play but his focus is still on the explicit

aims while being constantly aware and alert to the

implicit aims arising here and there.

Finally, at the evaluation stage, various questions

regarding assessment has to be answered first. Is it a

formative or summative assessment? Is it norm referenced

or criterion referenced? Perhaps an even more basic

question need to be asked -what does he want to assess

and can it be assessed? These are questions that will be

our primary concern in the next chapter.

Nevertheless, going back to our categories, probably

the easiest area to assess will be the objective

‘knowing’ category especially from the main aims of

science education. The semi-objective ‘doing’ category

will be more difficult and challenging, while logically

the subjective ‘feeling’ category will be even more

difficult.

85

1.5 FAMILY UP FOR ADOPTION - THE MATTER OF

IMPLICATION FOR MALAYSIA

1.5.1 A BRIEF OVERVIEW OF THE HISTORY OF SCIENCE

EDUCATION IN MALAYSIA AND ITS CURRENT SCENARIO

Science education first became a part of formal

education in Malaysia in 1930 when an expatriate science

teacher, F. Daniel, developed a General Science course

for all pupils at the secondary level (Molly 1992:252).

Rote learning of scientific facts was the norm then.

The Ministry of Education began science curriculum

reform with the inception of the ‘Special Project’

(Projek Khas) in 1968. Resource books for primary school

teachers of science and mathematics were produced to help

them teach those subjects by the enquiry method. This

reform was carried on to the secondary school level. In

1969, the Scottish Integrated Science syllabus was

adapted for the teaching of science at the lower

secondary level (Form 1-3 Ages 13-15). The different

disciplines of science which was called ‘Modern

Physics/Chemistry/Biology For Malaysian Schools’ was the

pursuit of students in the science stream for the upper

secondary level (Form 4-Upper Six, Ages 16-19) while the

arts stream had the integrated ‘Modern Science for

Malaysian Schools’ (Rampaian Sains). This was the time

when practical work and recipe-like worksheets were a

86

normal feature - the emphasis was ‘learning by doing and

discovery’ through guided investigations after the spirit

of the Nuffield Science projects (Allsop 1991:32; see

also Table 1.2).

Name of project Level Basis Year of introduction

Projek Khas (Special Project)

Primary I-VI 1968

Integrated Science for Malaysian Schools

Forms I-III Scottish Integrated Science Syllabus

1969

Modern Physics/Chemistry/ Biology for Malaysian Schools

Form IV-V (Science stream)

Nuffield Physics/ Chemistry/ Biology O level

1972

Modern Science for Malaysian Schools

Form IV-V (Artsstream)

Nuffield Secondary Science

1974

Table 1.2 School science projects in Malaysia (1960-1979) (Molly 1992:254)

Nevertheless, Lewin (1975:11-12) reported that in

spite of the introduction of the reforms, the teaching of

science had remained very much teacher-centred and

didactic - a situation he perceived due to the assessment

exams which tested the recall of facts, thus engendering

the prevalence of rote learning techniques.

The influential Cabinet Report on the Implementation

of Education Policies in the year 1979 engendered the

back to the basics movement bringing educational reform

which eventually resulted in the formulation and

implementation of the New Primary School Curriculum

87

(NPSC) at the primary school level in 1983 (Educational

Planning and Research Division, EPRD 1989:26,75). Primary

Education was demarcated into two phases : Phase I (Year

1-3, ages 6-9) and Phase II (Year 4-6, ages 10-12). At

Phase I, the 3 R’s of reading, ‘riting and ‘rithmetic

were emphasised. Malay Language, English Language and

Mathematics were the three main subjects. Science as a

subject was thus discarded.

Science contents only made its appearance at Phase

II, and even then not as a single subject of its own but

as the integrated, interdisciplinary general subject

called ‘Man and His Environment’ (Alam dan Manusia).

The science content was freely mingled with that of

civics, local studies, local history and geography.

Unfortunately, this did not turn out well as most of

the teachers who taught this subject came from the art

stream and were not confident of teaching the science

content. The science aspect was thus either badly taught

(using the rote learning and expository style of passing

on facts) or even worse completely ignored! Science

laboratories in primary schools were closed down and/or

converted to ordinary classrooms while the science

apparatus was locked away in obscurity.

In 1991, Malaysia’s Prime Minister, Datuk Seri Dr.

Mahathir Mohamad’s momentous paper “Malaysia : The Way

88

Forward” was the harbinger to the national ‘Vision 2020’

which seeks to transform Malaysia into an industrialised

and fully developed nation by the year 2020 (EPRD

1989:74-75). Dr. Mahathir outlined nine strategic

challenges for the nation (see Appendix B) of which the

sixth was “establishing a scientific and progressive

society, innovative and forward looking, not only as a

consumer to technology but also as a contributor to the

scientific and technological civilisation of the future”.

This certainly meant that more Malaysian students

need to be educated in the sciences. Reality was on the

contrary as after 10 years of NPSC, there was a

significant drop in students who pursued science based

subjects in the higher secondary level. The Ministry of

Education took immediate steps to check this decline and

science was reintroduced as a subject in 1994 beginning

from Phase II of the primary school level.

The New Primary Science Curriculum (1993:7,8)

covered five fields of science: Living Environment,

Physical Environment, Material Environment, the Earth and

the Universe, and the Technological World. Scientific

skills encompassing process skills and manipulative

skills was given prominence together with the science

content. Thinking skills, attitudes and values were also

clearly outlined and specified as part of the aims of the

new curriculum (Pusat Perkembangan Kurikulum 1993:5,6).

89

Meanwhile, at the secondary school level, the

Integrated Secondary School Curriculum (ISSC) was

introduced as a continuity of the NPSC. As is suggested

in its name, the new curriculum sought to present

education as an integrated whole with strategies like

language, values and skills cutting across the

curriculum.

Science was a core subject meaning that it was

compulsory for every student at the Form 1-3 (ages 13-15)

level. It was an integrated science syllabus. At Form 4-5

(ages 16-17), an integrated science syllabus was

maintained also as a core subject to be taken by all

students except those who took the three individual

science subjects of physics, chemistry and biology.

Additional Science was also introduced as an optional

subject for those students who had opted for more art-

based subjects to take so that they could still switch to

the science disciplines after their Form 5 if they chose

to do so and performed reasonably well in this subject

and mathematics. Clearly, the Ministry of Education was

making an effort to encourage more students to take up

science.

The secondary school science syllabus was a

continuity of the primary science syllabus with the same

emphasis on scientific knowledge, scientific skills and

90

values. (Introductions in ‘Huraian Sukatan Pelajaran

Sains Tingkatan I-III, Sains Tambahan Tingkatan IV and V,

Fizik, Kimia, Biologi Tingkatan IV and V).

1.5.2 PROPOSALS FOR APPLICATION IN THE MALAYSIAN CONTEXT

(a) MORE TIME TO EMPHASISE QUALITY AND DEPTH

The first application I would like to propose for

science education in Malaysia is more emphasis on quality

rather than quantity, more depth rather than breadth. It

is a common complaint of teachers in Malaysia that the

science syllabus covers so much ground that they have to

rush and push through each topic. I suggest that more

time is needed for science teachers to allow their

students to achieve knowing, doing and feeling.

Students need more time to slow down and extract and

construct meanings. Thinking, contemplation, discourse,

discussions, debates and investigations cannot be hurried

or compressed without losing their desired aims. The

flurried pace at which science teachers are forced to

teach science lessons because of the constraints imposed

by the overloaded science curriculum is not very

conducive for such reflective and active learning.

91

The role of practical work in science education also

needs to be reviewed and thought through. In Malaysia,

science lessons seem to be experiments after experiments,

each different from period to period. Students are

usually expected to finish their experiment within a

period of about one hour, write up their report and hand

it in to be marked and returned, so that the whole cycle

could be repeated but with a whole new topic.

Then, there is the matter that these experiments are

usually closed, convergent and carefully tailored to

illustrate or confirm theories and concepts (Qualter et

al. 1990:4). It is highly questionable whether the

enormous quantity of experiments that Malaysian students

undergo are actually meaningful or make sense to them. Is

it merely the process of students trying to discover what

is in the mind of their science teachers and producing

those answers that is desired by them? If that is the

case, what is the advantage of such an approach over that

of didactic expository approach? As Ausubel et al.

(1968:377) states emphatically : “Yet science courses at

all academic levels are traditionally organised so that

students waste many valuable hours collecting and

manipulating empirical data that, at very best, help

students to rediscover or exemplify principles that the

instructor could present verbally and demonstrate

visually in a matter of minutes.” The game of ‘students

92

trying to discover the mind of their teachers’ is not

what investigations is all about.

As we have seen in our earlier discussion, practical

work can be viewed as craft training. Therefore students

need to be given time to get a feel of the phenomena,

acquire tacit knowing and scientific attitudes, apply

scientific methods, process skills and manipulative

skills and engage in thinking, problem solving and

abstract theoretical construction. Such a view engenders

the need to have less experiments but in a more

comprehensive and in depth manner. Meanings have to be

deliberately extracted from the investigations. ‘Word

Work’ can replace some experiments and is certainly

essential in each investigation undertaken. All these

require science teachers to slow down and take time to go

for quality and depth in their investigative practical

work.

In practice then, I suggest that every time science

teachers conduct practical lessons on a particular topic,

they allocate three to four periods (each period being

between 30-40 minutes) for students to do experiments,

another one period for students to discuss in their

various groups and write up their methods, observations

and conclusions and following that another period to

present or discourse with the other groups and their

93

teacher their results. ‘Word Work’ will be the main focus

during the final period for discourse.

Some time should also be allocated in the school

time table to allow students to suggest or present

whatever topics related to science that interests them.

Thus, the science teacher encourages his students to keep

rough notes of matters that interest them. From time to

time the teachers can then ask them to present what they

have noted.

Another related reason why I propose quality over

quantity in science lessons is because we need to see

students in Malaysia taking responsibility for their own

learning. The spoon-feeding situation where the teacher

determines what the students learn is not adequate in

light of the explosion of knowledge. It is impossible to

teach everything that the student needs before they leave

school. Therefore, it is better to train students how to

access information and extract meaning for application.

The science teacher plays the role of trainer, motivator,

stimulator, guide, mediator and facilitator (Qualter et

al. 1990:xiv).

(b) MORE HUMANISED SCIENCE FOR RELEVANCE AND MEANING

In Malaysia, science tends to be taught in a cold

and unfeeling manner - hard facts and theories are dealt

94

with. This can be one of the reasons why more and more

students opt out of science. Science is also often taught

as if scientific knowledge is unchanging. Therefore, I

propose that science education be humanised.

The nature of science as provisional yet applicable

knowledge, socially constructed and communally verified

needs to be imbedded in science lessons. The history of

science, which is often neglected and often seen as

unnecessary in science lessons, could be utilised as

excellent case studies for the often turbulent and hard

road leading to scientific discoveries and revolutions.

The students will begin to appreciate the process of

constructing scientific knowledge and not have the often

simplistic and fairy-tale like view that scientific

discoveries are the easy work of a few brilliant

individuals. Controversies, competing theories, social

and political factors, moral values and beliefs should

not be swept under the rug in the consideration of the

history of science education.

Of course, science education cannot always be locked

in the past. The present and the future should also

feature prominently in science lessons. Thus, current

issues relevant to individuals and society may it be

local, national or global needs to be introduced more

rapidly into science education. The computer with the

rapidly developing world of CD-ROMS and Internet, needs

95

to be incorporated into science education. All these will

help humanise science education and develop interests and

appreciation of science.

1.5.3 STEPS TO BRING ABOUT CHANGE IN SCIENCE EDUCATION

FOR MALAYSIA

The proposals in the previous section requires some

radical paradigm shifts. I feel three steps need to be

taken before these changes can be realised, namely :

The review of science curricula

The re-education of science teachers

The rethinking about assessments

Science curricula and syllabus need to be trimmed

down so that teachers have more time to emphasise quality

and depth. It also needs to be more open ended so that

current issues relevant to Malaysia and its role in the

global world could be more easily incorporated as needed.

Nevertheless, any efforts to bring change is doomed

to failure if science teachers are not convinced of the

rationale behind the changes. Certainly, science teachers

needs to be re-educated to a certain extent. They need to

96

see that science education do not have to be so academic

but more practical and relevant. Science teachers needs

to gain confidence that they can keep in touch with

developments and current issues. They need to model the

dynamics of science in their lessons.

The third step is crucial. Undoubtedly, Malaysian

science teachers or even teachers in general teach with

the final assessment and examinations in mind. Their

teaching approach will be adapted to produce the optimum

results in examinations. Of course this is a valid and

reasonable reason. Thus, a rethinking about assessments

needs to be undertaken. It is this important factor that

I will now consider.

97

CHAPTER 2 : ASSESSMENT IN SCIENCE EDUCATION

2.1 RENEWAL IN APPROACHES TO ASSESSMENT

2.1.1 THE EARLY REIGN OF MEASUREMENT (NORM-REFERENCED)

MODEL AND TRADITIONAL ASSESSMENT METHODS

For many years, perhaps the greater part of this century, theassessment of academic learning proceeded with little changeand with little perceived need for change.

(Biggs 1995:1)

The institutionalisation of mass education in the

nineteenth century engendered the introduction and

proliferation of educational assessment. This coupled

with the pragmatic need for some kind of mechanism for

selection in the rapidly expanding industrial economies,

popularised the measurement (norm-referenced) model of

assessment because the high stakes environment compelled

a high premium being put on assessment methods that

appeared to be fair and objective with high levels of

reliability (Broadfoot 1996:35-6).

What is this measurement model? This model is based

on the trait theory or the theory of individual

differences (Taylor 1994:236). Guilford (1959:6) defines

a trait as “any distinguishable, relatively enduring way

in which one individual varies from others”. This would

98

include physiological and psychological variables like

weight, height and intelligence. It was Galton’s earlier

discovery that these variables could be represented by

the normal curve which led to the belief that measurement

of individual traits could be reported relative to a

distribution of measures of that variable for other

people (Taylor 1994:227). This became the basis of

psychometric procedures which were similarly employed for

the measurement model.

99

Taylor (1994:236) thus outlines three distinguishing

assumptions that undergirds the measurement model :

1. Individual differences : humans consistently differ from oneanother on various human traits.

2. Relative measurement : One individual’s measurement on atrait can be reported relative to the distribution of othersimilar individuals’ measurements on that trait.

3. Reliable measurement : We can develop instruments thatreliably measure these individual differences on trait.

The third assumption concerning reliability was a

great concern in test item production. Reliability is the

consistency with which a test or other device measures

whatever it measures (Hanna 1993:8). In the context of

the predominantly multiple-choice type of questions in

testing, reliability was inevitably linked to the

discriminating power of the test item which is the

ability to differentiate examinees with higher and lower

scores on tests (Taylor 1994:238,240). The value of each

test item was judged on this basis and also on its

independence from other items. Therefore, items where

almost all testees answered in a similar way, whether

correctly or otherwise, were duly discarded as it did not

function as a discriminator. This preoccupation with

reliability eventuated a concentration by test producers

on that which was readily measurable, such as content

knowledge.

In line with this, a feature that developed into the

distinguishing mark of what is known as the traditional

100

(or conventional) technique of assessment is the

summative purpose of assessment. It was designed to

provide information at the end of a unit’s instruction

about students’ terminal achievement. This type of

assessment, being in the form of multiple choice

questions, was quick and easy to administer and process.

It soon came to dominate the educational landscape in

many countries including Malaysia.

101

2.1.2 THE GROWING REVOLT AGAINST THE TRADITIONAL

ASSESSMENT METHODS AND THE MEASUREMENT MODEL

After more than a century of unabated reign,

rumblings of discontent about generally accepted

traditional methods of assessment based on the

measurement model began to slowly grow in intensity and

frequency. Let us consider five of the main reasons for

this growing revolt.

(a) It is independent of the instructional process

One of the main complaints about traditional methods

of assessment was that it was separated from the daily

practical activity of teaching and learning. Wiggins

(1989a:704) likens this practice to that of only taking

the pulse rate of a patient - it might reveal a problem

but it has no effect whatsoever on improving the

patient’s physical health! Similarly, summative

assessment which is carried out at the end of the

teaching unit, usually reveals pupils’ learning

deficiencies and difficulties but the results will have

almost no consequence to the pupils’ actual learning

process. Wiggins (1989a:704) asserts that assessment is

central to instruction saying, “Any tests and final exams

inevitably cast their shadows on all prior work. Thus

they not only monitor standards, but also set them.” In

other words, teachers tend to teach to the test. What and

102

how they teach will depend on what and how the assessment

is like.

Maeroff (1991:275) brings this line of thinking one

step further by saying that assessment should drive

instruction, and vice versa, just like the way the front

and rear axles impel one another in a four-wheel drive

vehicle. Thus, if assessment is sheared off from

instruction, the findings of the assessment may not tell

a great deal about what students have actually learnt

from their classroom experience. Furthermore information

from such assessment has very limited use or relevance to

actual teaching. Qualter et al. (1990:94) notes that at

most these assessments tell how well (or badly) pupils

have done at year end (whatever that means!), can be used

for placement or streaming purposes or for the next

year’s planning for the teacher. Nevertheless, they

bemoan the fact that it does not help teachers detect

pupils’ areas of difficulty early enough to correct or

take remedial steps to help them in their learning.

(b) It does not primarily measure educational objectives

As we have noted earlier, the emphasis on

reliability of the measurement model means that items for

a test are selected first and foremost for their

discriminating power. Therefore, even though

representation of learning objectives within a content

103

area is a consideration when selecting items for a test,

the tests are not primarily developed to provide solid

information about how well students are learning various

objectives (Taylor 1994:241). For example, if all pupils

answered a certain question correctly, that item is

quickly discarded because of its poor discriminating

power. On the other hand, it could also mean that the all

pupils have learnt that matter well!

Bloom et al. (1981:52-3) write succinctly :

Education is a purposeful activity and we seek to have thestudents learn what we teach. If we are effective in ourinstruction, the distribution of achievement should be verydifferent from the normal curve. In fact we may even insistthat our educational efforts have been unsuccessful to the extentthat the distribution of achievement approximates the normaldistribution ....

Thus, Haertel (1991) is of the opinion that such

approaches to assessment lacked a basis in educational

theory and the knowledge base of teaching.

(c) It is weak in validity

Validity tells us whether a test measures what it

claims to measure, how well it measures it and what can

be inferred from that measurement (Neill and Medina

1989:690). Wiggins (1993:202) claims that because of the

preoccupation with reliability, validity is often

sacrificed. This means that the test-makers of

traditional assessments who develop multiple choice

104

questions generally end up being more concerned with the

precision of scores than with the intellectual value of

the challenge. Therefore, the forms of testing and

scoring were usually indirect and generic, designed to

minimise the ambiguity of tasks and answers. The tests

make the complex simple by dividing it into isolated,

independent and simplistic chores that assess only

artificially isolated “outcomes”. Consequently, the

problems were usually contrived and the cues artificial.

Such assessment, according to Wiggins (1993:202) lacked

the power to reveal whether students have the capacity

and ability to use wisely what knowledge they have.

(d) It is inadequate to measure changing educational aims

A climate of rapid technological change has

revolutionised educational aims so that they reflected

the need to train people who have the appropriate range

of skills and attitudes to cope with and undertake a

variety of roles. (Broadfoot 1996:36). Therefore, higher

level thinking skills, problem solving ability, personal

and social competencies and attitudes, skills and

processes have become new educational aims. Traditional

assessment methods are evidently inadequate to measure

such educational aims.

For example, in science education, we have

considered the practical ‘doing’ of science. Scientific

methods (procedural understanding), process and

105

manipulative skills and scientific attitudes are just

some of the aims that will be very difficult to assess

summatively solely through paper and pencil tests. In any

case, it would only give a partial and perhaps even

misleading picture of pupils’ actual abilities.

Therefore, Wiggins (1989a:704) notes that assessment

is most accurate and equitable when it entails human

judgement and dialogue, so that the person tested can ask

for clarification of questions and explain his or her

answers. Of course, traditional assessments methods have

no provision for such practices.

(e) It is not in line with the new view that pupils learn by actively

constructing ideas.

A view of learners as passive absorbers of facts,

skills and algorithms while teaching was conceived as

transmitting knowledge from teacher to learner has been

sidelined by new research findings. As we have considered

in the previous chapter, learners are actually active

participants who build their own understanding of subject

matter by constructing their own interpretations and

relate new information to their existing knowledge and

understanding (Osborne and Wittrock 1983). This new view

of student learning prompts Wilson (1992:125) to advocate

the need to start measuring understanding and models that

individual students construct for themselves during the

106

learning process. Traditional methods of testing which

provide a ‘snap-shot’ of a single point in time however

cannot and do not assess nor demonstrate the growth in

knowledge and performance (Raizen and Kaser 1989).

2.1.3 CORONATION OF THE STANDARD (CRITERION-REFERENCED)

MODEL?

In the 1970’s, a different form of assessment came

into being - the standard (criterion-referenced) model.

Taylor (1994:243) gives the assumptions of this model as

follows:

1. Public educational standards can be set.

2. They can be reached by most students, albeit by

different kinds of performance.

3. Fair and consistent judgements are possible by

educators trained to determine whether the

standards have been met.

The marked difference and improvement in the

standard model was the elevation of clear educational

objectives, thus emphasising the link between assessment

and instruction. Students were aware of the tested

objectives though not of its specific content. This was

an important development as we have seen that teachers do

teach to the test. Thus an assessment based on course

objectives has the added potential to promote the

achievement of those educational objectives. Because of

107

this, Raizen and Kaiser (1989) are of the opinion that

assessment can be a powerful tool for reform where

changing the nature of assessment can lead to changing

the nature of instruction.

Nevertheless, the early proponents of the standard

model faltered when they adopted the traditional methods

of assessment, where tests were given in a single

sitting. Thus, in essence, they ended up with the same

set of problems as accounted in the previous section.

2.1.4 THE SPREAD OF THE STANDARD MODEL - PERFORMANCE

ASSESSMENT

As long as we hold simplistic monitoring tests to be adequatemodels and incentives for reaching our intellectual standards,student performance, teaching and our thinking and discussionabout assessment will remain flaccid and uninspired.

(Wiggins 1989a:713)

Educational reform and renewal in the eighties

entailed a plethora of ideas on assessment. The United

Kingdom (UK) led the way through the seminal work of the

Assessment of Performance Unit (APU) set up by the

Department of Education and Science in 1975. At that

time, the purpose was to monitor performance in several

of the major curricular areas on a national level

(Christofi 1988:38). The APU team devised, conducted and

refined through trial and consultation an innovative

approach to assessment by using extended tests on small

108

samples of pupils which sought to assess them in Science,

Maths and English through their performance (Gott and

Duggan 1995:41). The process observed in the pupils as

they performed their tasks was assessed as was their

final product. Even though it was not the expressed aim

of the APU team to formulate an assessment framework for

individual performance , yet their findings in this area

proved to be very influential and was incorporated into

the first National Curriculum for UK. Furthermore, the

techniques developed could easily be adapted to be used

by classroom teachers for their own purposes (Christofi

1988:38).

In England and Wales, the 1988 Education Reform Act

(ERA) heralded a National Curriculum which described the

matters, skills and processes to be taught as ‘programmes

of study’ and the knowledge, skills and understanding as

‘attainment targets’ (ATs) which all pupils were expected

to have reached at the end of each of the four ‘Key

Stages’, that is Key Stage 1(ages 5-7), 2 (ages 7-11), 3

(ages 11-14) and 4 (ages 14-16).

The interesting feature was the national assessment

programme as outlined in the report of the Task Group on

Assessment and Testing (TGAT, 1988) which stipulated that

regular assessment of students against these ATs was to

be by both continuous teacher assessment (TA) and

Standard Assessment Tasks (SATs), the latter administered

109

at the end of each Key Stage. Clearly, this was a

standard model of assessment but with a defining

difference from the earlier standard model described in

the preceding chapter - it had internal assessments

conducted by the teachers themselves (TA) which used

performance assessments over a period of time (Broadfoot

1996:41). At the ages 7, 11, and 14, the internal TA’s

results were combined with the external SATs and reported

as a holistic assessment of each pupil towards the end of

that school year (McCallum 1996:74-5).

Across the waters to the United States of America

(USA), similar reforms in assessment were sweeping across

the educational scenario. ‘Authentic assessment’,

‘alternative assessment’, ‘portfolio assessment’,

‘dynamic / responsive assessment’ and ‘performance

assessment’ were topics which dominated discussions on

assessment in the educational circle (Wiggins 1989a,

1991, 1993; Maeroff 1991; Taylor 1994; Darling-Hammond

1994). This plethora of terms essentially all meant the

same thing - it was an indication that qualitative

judgement by the practitioner(teacher) through the

assessment of performance over a period of time

(longitudinal) and which influenced teaching instruction

was gaining acceptance among educators. This was evident

when Pipho (Taylor 1994:234) reported in 1992 that 40

states the USA have begun legislating for or were

110

developing new assessments that progressed toward

standards.

Broadfoot (1996) reported similar changes in

Australia and France while Rogers (1991) noted the same

in Canada. What is distinctly clear is that performance

assessment has arrived posing a serious challenge to

traditional methods of assessment. Bateson (1994:234)

describes this as a paradigm shift on the part of many

educators, administrators and politicians which involved

a complete shift in basic beliefs and values in

measurement.

Broadfoot (1995:37) gives a perceptive summary of

this emerging new culture in educational assessment :

1. An increasing emphasis on formative, learning-integratedassessment throughout the process of education.

2. A commitment to raising the level of teacher understandingand of expertise in assessment procedures associated withthe devolution of responsibility for quality assurance inthe certification process.

3. An increasing emphasis on validity in the assessment processwhich allows the full range of curriculum objectivesincluding cognitive, psychomotor and even affective domainsof learning to be addressed by the use of a wider range ofmore ‘authentic’ techniques for gathering evidence oflearning outcomes.

4. An increasing emphasis on describing learning outcomes interms of particular standards achieved - often associatedwith the pre-specification of such outcomes in a way thatreflects the integration of curriculum and assessmentplanning.

5. An increasing emphasis on using the assessment ofindividual pupils’ learning outcomes as an indicator of thequality of educational provision, whether this be at thelevel of the individual classroom, the institution, thestate, the nation or for international comparisons.

111

2.2 RELEVANCE OF NEW APPROACHES OF ASSESSMENT TO

SCIENCE EDUCATION

2.2.1 CLARIFYING THE PLETHORA OF TERMINOLOGY

At this juncture, it will be advantageous to clarify

some of the terminology frequently employed in

discussions about assessments. I will begin with the more

generic terms first (See Table 2.1).

112

TERMINOLOGY DEFINITION AND CLARIFICATION QUESTION(S) TOBE ANSWERED FOREDUCATIONISTS

Test An instrument or systematic procedurefor measuring a sample of behaviour, frequently for selection or placement. What is important is the instrument’s predictive power rather than its content. It is usually administered in a one-shot time-constrained situation.

How well does the individual perform - either in comparison withothers or in comparison witha domain of performance task?

Measurement The systematic process of obtaining anumerical (quantitative) description of the degree to which an individual possesses a particular characteristicor attribute.

How much did the individual get?

Assessment The process of collecting, interpreting and synthesising information (either quantitative or qualitative or both). It is to determine the extent to which pupils are achieving instructional objectives - usually aimed at gaugingeducational outcomes.

What has the individual learnt?How good is theindividual in it?

Evaluation The application of judgement concerning results of assessment to planning teaching and learning - to aid in decision-making for future educational instruction.

What do I (the teacher) changeor implement inthe future?

Table 2.1 Generic terminology and their definitions(Collated and adapted from : Gronlund 1985:5, Glasser

1990 as quoted by Darling-Hammond, Hanna 1993:6-7, Foden1993:161)

Testing is just one type of measurement. Measurement

is limited to quantitative descriptions of pupils always

113

expressed in numbers (for example : Lim correctly

answered 25 of the 50 questions). It does not include

qualitative descriptions (example: Tina’s work is neat

and systematic). Assessment, on the other hand, pertains

to either or both of the following : quantitative

descriptions (measurement) or qualitative descriptions

(non-measurement) of pupils. It involves making value

judgements concerning the individual (example : Bala is

making good progress in science) and is usually an on-

going and continuous process.

Evaluation links assessment to future planning of

teaching and learning by being the conduit that channels

assessment findings to be applied in instructional

decisions. Foden (1993:153) shows this inter-linking in

the cyclic pattern as in Figure 2.1 below :

Planning

Reporting Implementing

Evaluating Assessing

Recording

114

Figure 2.1 The teaching cycle

We will now consider the many ‘new’ approaches to

assessments that have been gaining increasing acceptance

and popularity among educationists (See Table 2.2)

115

ASSESSMENT/METHOD

DESCRIPTION

Performance Assessment

Authentic Assessment

Portfolio Assessment

Teacher Assessment (Internal Assessment)

Dynamic / Responsive Assessment

Standardised Testing

Profiling

The pupil completes or demonstrates the same behaviour that the assessor desires to assess.

The pupil not only completes or demonstrates the desired behaviour, but also does it in a real-life context.

The pupil either independently or with the teacher’sassistance, judges and submits for assessment his/her best work completed over a period of time which includes repeated drafts of the same piece of work in order to demonstrate the development (improvement) the pupil has achieved over time.

The practitioner (that is the teacher teaching the pupils) assess his/her students’ actual performance as they carry out the given task over a period of time. It is an internal, school-based or classroom-based assessment.

Intervention strategies are dynamically employed by the instructor cum assessor to help pupils attain abetter level of understanding as they perform the tasks. Assessment is made on the pupils highest learning achieved after intervention.

Tests (often paper and pencil type) whose measurement is deliberate, planned, separate from other instructional activities, administered and scored under controlled or standard conditions. (It can be wide-scaled or nationally distributed tests. In the USA it is usually multiple-choice type of questions which are commercially produced)

The method of displaying the results of an assessment. It is a panoramic representation, numerical, graphical or verbal, of how a student appears to assessors across a range of qualities, orin respect of one quality seen through a range of qualities.

Table 2.2 Different types of assessmentand method

116

(Collated and adapted from Christofi (1988:5-6), Neilland Medina (1989:688-9, Meyer 1992:39-40, Hanna 1993:3-4,Bachor et al 1994:255, Henning-Stout 1994:29-51)

Let us consider two examples of planned assessment

in science that elucidates some of the assessment

described in Table 2.2.

CASE 1 : THE PAPER TOWEL INVESTIGATION

This is an assessment of hands-on science used by

Baxter et al (Baxter et al 1992, Shavelson and

Baxter1992). Year five pupils were given laboratory

equipment such as beakers, a scale, trays, eye dropper

and stop-watch. They were asked to determine which of

three different paper towels soaks up the most water.

They were also told that they could use all or some of

the equipment provided, as they deemed necessary. The

pupils were instructed to record in a notebook the steps

of the investigation they planned and implemented, the

equipment(s) used, the variable controlled and the basis

in which they arrived at their conclusion. They were to

record it in such a way that “a friend could replicate

the investigation”. The science teacher then observed

students conducting their planned investigation on an

individual basis using the scoring system as indicted in

Figure 2.2.

Student : ___________ Observer : ____________ Score : __________Script : _____________

117

1. Method : A. ContainerB. Drops C. Tray (surface) Pour water in/put towel intowel on tray / pour water on Put towel in/ pour water inpour water on tray / wipe up 1 pitcher or 3 beakers/glasses

2. Saturation A. Yes B. No C. Controlled

3. Determine Result A. Weigh towel B. Squeeze towel / measure water (weight or volume) C. Measure water in/out D. Time to soak up water E. No measurement F. Count number of drops until saturated G. See how far drops spread out H. Other __________________

4. Care in Measuring Yes No

5. Correct Result Yes No

Grade Method Saturate DetermineCare in Correct Result Measuring Answer A Yes YesYes Yes Yes B Yes YesYes No Yes/No C Yes Yes/ControlledError Yes/No D Yes NoMissing Yes/No F -------------------------------------NoAttempt-----------------------------------

Figure 2.2 Paper towels investigation - hands-on scoreform

The scoring system which graded a pupil’s

performance from A to F, was decided on the scientific

soundness of the method employed by the pupil at each of

the five categories - method, saturation, determine

118

result, care in measuring and correct result. This was

done through observation by the teacher as the pupil did

the investigation.

CASE 2 : INDEPENDENT REPORT ON A SCIENCE RELATED TOPIC

Pupils were given a choice to either conduct an

independent investigation and write a full report on it

or write an article on a specific topic which they were

interested in and had done a library research. The pupils

worked at their own pace through out the school year

making appointments to meet with their teacher to report

on their progress from time to time, when they felt ready

to do so. During those appointments the teacher would

question the students’ thinking and plans so as to help

them improve on their work. They then submitted a folder

with all their resources, drafts of their planning ideas,

diagrams and written work together with the final

report/article. The teacher kept an on-going assessment

record of each pupil which was incorporated into the

profile of the student.

DISCUSSION

Case 1 is an example of a performance assessment.

The pupils were asked to perform specific behaviours like

carrying out a fair test (as they designed and

implemented the method), controlling variables (type of

119

saturation), care in measuring and correct interpretation

of results. The assessor (in this case the teacher)

observed each pupil as they did the investigation and

assessed their performance according to pre-specified

behaviours stipulated on the score form. These are the

general characteristics of performance assessment. But is

it an authentic assessment? No! Because the context was

contrived - in real life, individuals seldom plan,

implement and make deductions on investigations in one

sitting under particular time limits.

What about Case 2? Is it a performance assessment or

an authentic assessment? The answer is both! The folder

acting as a portfolio contained the examples of actual

student performance, although much of the structure

associated with testing has been removed. It is also an

authentic assessment because the performance is assessed

in a context more like that encountered in real life. For

example, the students worked at their own pace, deciding

how long to spend on the various stages of the

investigation/library research, creating and writing as

many rough drafts as they saw necessary before deciding

on their final copies.

Therefore, performance assessment refers to pre-

specified student responses to be examined in process

while authentic assessment has that extra dimension of

the context in which that response is performed. As Meyer

120

(1992:40) wrote, “While not all performance assessments

are authentic, it is difficult to imagine an authentic

assessment that would not also be a performance

assessment.”

Case 1 and 2 are also examples of internal teacher

assessment where the teacher was the assessor. Case 2 is

also an example of a portfolio assessment (where a

collection of the students’ developing work were

assessed). Furthermore, dynamic/responsive assessment was

also employed in that the teacher intervened in the

pupils’ learning through questioning techniques and

supervision. Assessment was thus made considering the

overall progress made after intervention.

In the following section, we will evaluate the

strengths and weaknesses of these ‘new’ forms of

assessments and consider how they can be employed in the

overall framework of assessment in science education to

match with the aims of science education which we have

considered in Chapter 1.

2.2.2 THE STRENGTHS AND WEAKNESSES OF ALTERNATIVE

ASSESSMENTS

The root of the word assessment reminds us that as assessorshould “sit with” a learner in some sense to be sure that thestudent’s answer really means what it seems to mean.

(Wiggins1989a:708)

121

In the preceding section, I have underscored the

fact that the many ‘new’ approaches to assessments have

distinctive features which are quite different from

traditional assessments. For the sake of practicality for

discussion, I will use the term ‘alternative assessment’ to

encompass all these different assessments. Let’s recap

some of the general features of alternative assessments

which we have considered hitherto.

It assesses pupils performance over a period of time.

It includes quantitative and qualitative descriptions.

It is an internal, school or classroom-based assessment

conducted by the teachers themselves.

It is an on-going assessment which continuously

influences further instruction.

It is essentially based on the standard model - the

standards usually commensurate with either teaching or

curriculum objectives.

It emphasises validity, that is the accuracy with which

it measures those pre-specified standards.

A comparison of the above features with the

objections against the traditional measurement model of

assessment (see Section 2.1.2) seems to suggest that

alternative assessment can be a viable solution for the

ills of traditional assessment. In addition to this,

educationists have alluded to the following strengths of

alternative assessment.

122

Wiggins (1989a) in his paper advocating authentic

assessments, suggests these reasons :

It assesses the pupils’ ‘habits of mind’, that is their

thoughtful understanding which has developed and is

developing. It is not simply testing the pupils’ short

term recall of facts. It also reduces the possibility

of an accident or thoughtless (if correct) response.

Instead, the patterns of success and failure and the

reasons behind them are observed.

What is to be assessed is usually known from the start

and repeatedly taken so that learning is assessed.

Thus, it emphasises students’ progress toward mastery.

It usually does not rely on unrealistic and arbitrary

time constraints.

It usually involves problems which are less structured

and hence more closely resembles real-life problems.

(For a more comprehensive list, see Appendix C).

Viechnicki (1993) writing on portfolio assessment

discerns three positive effects on the quality of

learning :

* It capitalises and adds value to the actual work in

classroom.

* It enhances teacher and student involvement in

assessment.

* It promotes pupil reflection and self assessment.

123

Athanases’ (1994) research on teacher assessing

portfolios discovered the added-value in changing the

practices of the teacher :

It convinced teachers of the need to keep more detailed

and varied records.

It helped teachers show progression of the pupils’ work

for reporting to pupils, parents, administrators and

other teachers.

And perhaps the most encouraging element was that it

promoted an increased focus in the individual students,

enabling the teachers to assess and adjust instruction

more frequently.

All this paints a very positive picture about

alternative assessments. Nevertheless, alternative

assessments has it’s own set of difficulties and problems

which we will now consider.

Firstly, there is the problem of reliability.

Alternative assessment does require teachers to make

human judgement through observations and scrutiny of

often very varied portfolios. This raises the matter of

the subjectivity of human judgement which engenders

questions of the reliability of teacher-assessment

(Wiggins 1989a:710). Furthermore, the open-ended nature

of most performance-based tasks entail very different

124

routes undertaken by pupils as they endeavour to find

solutions resulting in complexity in assessment. In

response to this problem, Wiggins (1989a:709) suggests

that reliability for alternative assessments could be

achieved through a process called “moderation”, in which

teachers of the same subjects gathered to compare results

and to set common criteria for grading.

This leads to the next problem : time and money,

which both Wiggins (1989a:710) and Maeroff (1991:281)

identified as the real obstacle to alternative

assessments. It cannot be denied that alternative

assessments are very demanding both on the teachers’ time

and energy (Maeroff 1991:279). Furthermore, if

moderation is required to ensure reliability, it would

compound the problems of the already overloaded teachers

- not to mention the financial strains that it posits.

These are major problems which education authorities

and experts are currently grappling with. Bateson’s

(1994:237-8) suggestion of blending older tried-and-true

methods and techniques and the more recently developed

methods is wisdom that should be given due consideration.

In spite of the above stated problems, alternative

assessments is still a viable option especially for

science education whose diverse aims which we have

considered in Chapter 1 requires an equally multifaceted

125

assessment tool. In Chapter 1, I have proposed that the

myriad explicit and implicit aims of science education

could be visualised as categorised in the objective

category of ‘knowing’, the semi-objective category of

‘doing’ and the subjective category of ‘feeling’. In the

objective category of ‘knowing’, where pupils are

actively involved in extracting and constructing meanings

of science concepts and phenomena with an emphasis on

application, alternative assessments provides the added

dimension of gauging this process of learning which

traditional assessment is impotent to do.

Then, in the category of semi-objective category of

‘doing’, there are many aspects which only through the

observation of students’ performance can there be any

reasonable and acceptable assessment undertaken. Such is

the case for assessing pupils’ procedural understanding

(scientific methods), scientific process skills and

manipulative skills.

As for the subjective category of ‘feeling’ like

scientific attitudes, moral values and beliefs which

certainly require human judgement, alternative

assessments provide the most appropriate and acceptable

method of collecting data for assessment.

It is with this perspective, in mind that I will now

try to formulate a suitable framework that will combine

126

both traditional and alternative assessments to be

utilised for primary science education in Malaysia.

127

CHAPTER 3: PROPOSAL FOR FRAMEWORK OF ASSESSMENTOF PRIMARY SCIENCE FOR MALAYSIA

3.1 PERTINENT FACTORS IN FORMULATING THEFRAMEWORK

In the endeavour to formulate the framework of

assessment for primary science in the Malaysian context,

the following factors were considered:

Since non-science specialists teachers do teach primary

science in Malaysia, the framework should be easy to

pick-up, master and implement. Therefore, a simple list

of criteria is preferred to an elaborate one.

Since excessive paper work is detrimental to the

already overloaded primary teachers, flexibility in

recording the various aspects to be assessed should be

a common feature. Even though a recording form will be

proposed, it can still be improvised and adapted by

individual teachers according to their preferred styles

and inclinations.

Since alternative assessment is very new to Malaysian

primary science teachers, a blending of the traditional

and alternative forms of assessment is congenial.

Since the new Malaysian Primary School Science Syllabus

(1993) espouses the importance of scientific knowledge,

scientific skills and scientific attitude, all three

128

areas will be featured prominently in the assessment

framework.

Since it is desirable that assessment should influence

the teaching and learning process, assessment will be

both formative and summative. Formative assessment is

an on-going process that is carried out in conjunction

with the learning process. Teachers will be involved in

intervention strategies to help their students improve

their learning of science and continuous feed-back

given to students so that they know of their progress.

Since it is propitious for both teachers and students

to reflect on the teaching and learning process in the

classroom, the assessment should be so structured as to

involve both the teachers and students in the process

of assessment.

3.2 THE OVERALL STRUCTURE OF THE FRAMEWORK FORASSESSMENT

According to the conclusion derived from section 1

(see 1.4), I will be using the three aspects of knowing,

doing and feeling to structure the framework. These

aspects with its respective components will be assessed

through the national examination (conducted at the end of

year six and is of a summative nature) and a school-

based, internal assessment (carried out annually from

129

year 4-6 and the average of the three marks calculated at

the end of year six; it is of a formative nature but

whose grades are translated to the overall summative

assessment). The overall marks are determined from the

total of the national assessment marks added to the total

of the three translated internal assessment marks.

Allocation of percentage marks are as indicated in Figure

3.1

ASSESSMENT OF PRIMARY SCIENCE

National AssessmentSchool-based Assessment

(Summative)(Formative and Summative)

70% 30%

Figure 3.1

The national summative assessment will cover

scientific knowledge, societal needs and relevance,

history of science, language and words from the knowing

aspect and scientific process skills and procedural

knowledge from the doing aspect. Except for the doing

aspect this type of assessment has been a regular feature

in primary school in Malaysia. Therefore, I will not be

elaborating on this assessment.

130

The school-based, internal assessment which will be

utilising the more versatile methods of alternative

assessment will cover all three aspects of knowing, doing

and feeling (see Figure 3.2). Assessment will focus

considerably on the process of learning and not merely on

the products of work.

DOING KNOWING

Practical Assessment of Portfolio -Assessment of process skills and contentlearning procedural knowledge progression

Activities : Activities: Laboratory work Book/informationsearch Design and make Project reports Field work Survey Paper and pencil tests

15 %10 %

FEELING

Holistic impressions

5 %

Figure 3.2 Aspects of school-based, internal assessment

3.2.1 THE ASPECT OF KNOWING (10%)

131

The focus of assessment in this aspect is the

student’s learning process of content matter. Portfolios

where students themselves choose the work that they would

like to be assessed in collaboration with their teachers

is a suitable method of assessing this aspect of knowing.

Primary science teachers will be able to track their

student’s construction of knowledge and understanding in

a more authentic manner. Suitable activities that could

be employed in the building of portfolios are

book/information search and project report.

Table 3.1 maps out the five criteria which will be

assessed in this aspect. It is set out in the format

employed by Glauert (1996) for tracking student’s

progress in primary science.

Criteria Descriptors forGrade 1

Descriptors for Grade 2

Range Expressing ideas that relate to one situation.

Making links between different situations, using the same idea to explain a range of experiences.

Degree ofgeneralisation

Describing observations made.

Generalising about relationshipsusing models or concepts.

Application

Merely expressing ideas without using it.

Applying scientific knowledge todifferent situations to solve problems or relating their significance to societal concerns.

Scientific terminology

Using everyday / common language to describe events, phenomena or ideas.

Using correct scientific terminology when giving descriptions.

Construction of understanding

Repeating memorised ideas.

Independent and scientifically correct expressions of ideas with explanations that reveals understanding.

132

Table 3.1 (Expanded from Glauert 1996:41)

The above descriptors describe the progress that is

hoped as students perform a certain task. It is the

process of developing the ideas and not the end product

that is assessed. Teachers allocate the grade according

to the descriptor of best fit for each criteria. The

qualities of the higher grade subsumes the qualities of

the lower grade. A grade of 0 is given for a particular

criteria if a student does not manifest the qualities of

that criteria as described by the descriptor for grade 1.

3.2.2. THE ASPECT OF DOING (15%)

The focus of this aspect is the student’s scientific

process skills and procedural knowledge. Since the list

of scientific process skills and procedural knowledge can

be very long, I have simplified them into three main

clusters, that is the planning cluster (hypothesising and

controlling variables), the implementing cluster

(observing or/and measuring) and the concluding cluster

(interpreting and communicating). Appendix D gives a more

complete list of the various process skills and

procedural knowledge that is related in each cluster.

Assessment can be elicited from laboratory-based

activities, design and make activities, field work and

surveys. Paper and pencil tests could be used to

133

supplement the need to assess the practical thinking

skills of each cluster. This can reduce the workload of

the teacher who need not have to carry out time-consuming

activities to verify their students progress.

Allocation of grades are made according to the

descriptor of best fit for each cluster. Table 3.2

provides the descriptors for the planning cluster, table

3.3 provides the descriptors for the implementing cluster

and table 3.4 provides the descriptors for the concluding

clusters.

Grade DESCRIPTORS1 Able to mention relevant features in attempting an

explanation based on everyday experience. Plans as he/she goes along.

2 Able to formulate at least one question or statement that can be investigated.

3 Able to identify and describe relevant variables.4 Adopts a systematic approach. Beginning to recognise the

need for a fair test. Able to identify at least one variablewhich should be kept the same for a fair test and identify an appropriate variable to measure or compare.

5 Able to offer explanation based on scientific knowledge and theories showing an awareness of their tentative nature. Able to translate their own or suggested ideas of explanation into a question or statement that can be investigated. Able to plan a fair test by changing one variable (independent variable) and observing or measuring the effect (dependent variable) while keeping all other relevant variables (control variables) the same.

Table 3.2 : Planning cluster : Hypothesising and controllingvariables

Grade DESCRIPTORS1 Able to observe and record gross or obvious features of

phenomena or object.2 Able to observe and record qualitative evidence by noticing

details of phenomena or object.

134

3 Able to collect qualitative or quantitative data in at leasttwo dimensions by focusing on observations relevant to the problem in hand.

4 Able to select and use appropriate instruments to collect quantitative data in the form of standard unitsorAble to notice differences between similar objects or events.

5 Able to select and use suitable measuring instruments in standard units considering range and accuracy of measurements. Able to notice differences between similar objects or events and/or similarities between different objects or events. Able to check or repeat observations and measurements to improve accuracy. Able to record evidence ordata clearly and appropriately as they carry out the work.

Table 3.3 : Implementing cluster : observing and/or measuring

135

Grade DESCRIPTORS1 Able to make interpretations related to some part of the

data (rather than preconceived ideas) even if only by loose associations. Able to give oral descriptions of the investigation and findings.

2 Able to make interpretations based on all available data. Able to give written descriptions in the form of notes and drawings of the investigation and findings.

3 Able to identify patterns in a set of data. Able to use tables or other standard framework for recording findings ofinvestigation.

4 Able to make generalised statement(s) based on evidence of data.

5 Able to evaluate data in relation to original problem. Able to justify predictions in terms of observed relationship(s) or pattern(s) showing how evidence has been used. Able to give oral or written description of the investigation using notes, drawings, tables and other standard framework appropriate for audience and purpose.

Table 3.4 : Concluding cluster : interpreting and communicating

(Collated from Russell and Harlen 1990:16-23,

Science in the National Curriculum 1995:3,8, Glauert

1996:39-40, Davis 1989:50-53)

3.2.3 THE ASPECT OF FEELING (5%)

The focus of this aspect of feeling are the

scientific attitudes that are desirable for scientific

investigations. Since this is a highly subjective aspect

for assessment, I suggest that holistic impressions that

the teacher develops of each student’s inclinations

through their interaction with them throughout the school

year be the judgement for the attainment of the five

criteria of scientific attitudes.

136

As Harlen (1993:72-3) has pointed out, an attitude

can only be said to exist if it becomes typical of that

individual and is displayed in a range of similar

situations. The science teacher is in the best position

to notice these inclinations in the daily outworking of

learning and doing science.

Table 3.5 provides the five criteria of scientific

attitudes and their descriptors. The list of criteria

could have been longer but only these five have been

chosen as they are of particular relevance to science

(Harlen 1993:73) and are suitable building blocks for

primary science.

137

Criteria Descriptors forGrade 1

Descriptors for Grade 2

Curiosity Being slow to noticenew things and needsto be prompted.

Noticing details, asking enquiringquestions; seeking for explanations and shows an eagerness to know.

Respect for evidence

Showing a tendency to report results that are in line with their original ideas while ignoringconflicting evidence.

Demonstrating open-mindedness in making conclusions from the available evidence with a willingness to consider conflicting evidence.

Flexibility

Showing a tendency to stick to preconceived ideas.

Showing a willingness to reconsider ideas and other points of view; also recognises the tentative nature of ideas with a readiness to change the,

Critical reflection

Needing encouragement and help to review methods, findings and ideas.

Showing a willingness to review and evaluate independently; also suggests ways to change for improvement.

Sensitivity to living things and the environment

Requiring supervision and rules to show appropriate concern and responsibility.

Demonstrating an awareness of the needs of living things and the importance of respecting the environment.

Table 3.5 Scientific attitudes (Collated form Glauert 1996:38-39 andHarlen 1993:72-79)

Grades are given in a similar manner as that

described in the aspect of knowing. Nevertheless, in the

final analysis, the total of the grades of all the

criteria for feeling should be divided by 2 so that it

could be used directly for totalling to the overall

marks.

3.3 RECORDING

138

Now that we have identified the criteria to be

assessed in alternative assessment, it is imperative that

some kind of recording system be formulated to help

primary science teachers keep a tracking record of their

students’ individual progress. A flexible format for

continuous recording is required so that the proposed

formative assessment does not turn out in reality and

practice as a summative assessment where teachers fill in

the necessary papers or forms at the end of the year just

because they are expected to hand in something. Of

course, some kind of monitoring system to ensure that

such a situation does not happen is desirable - and I

propose that the school inspectors be assigned this

important task.

139

The recording system should also avoid the following

pitfalls that Lock and Wheatley (1989:108) have

identified:

Bulky recording system and mark books which are difficult to

use in the classroom and laboratory.

Time consuming filling in of scores, grades and

achievements.

Inaccessibility (and sometimes incomprehension) of the data.

This recording system should also serve as a

reflective tool for teachers so that their planning of

instruction is continually being changed as a result of

the feedback that he/she is getting through the

assessment process. The primary science teacher should

also be involved in implementing intervention strategies

to help his/her students improve in their learning of

science. This is of course in line with the formative

approach to assessment and the application of dynamic

assessment.

There are basically two approaches to formulating

recording systems:

(a) A task-centred system.

(b) A skill / criterion -centred system.

According to Lock and Wheatley (1989:108), a task-

centred system is one where students are given a

practical tasks and their performance on all (or most)

140

skills or processes are assessed and recorded from the

beginning to the end. The aim is to grade all students in

the group on the same criteria in the same context on the

same occasion. The disadvantage of this system is that it

encourages mini-practical examinations within the class

context and does not really foster the integration of

assessment with learning.

A skill/criterion-centred system, allows for either

whole classes to be assessed on one or more skills in one

context or for a few students to be assessed on a range

of criteria on different occasions and contexts (Lock and

Wheatley 1989:108). Because of the flexibility of this

system and also due to the fact that it causes least

disruption to normal school lessons, I propose that we

adopt this approach.

Appendix E gives a simple recording form that could

be utilised by primary science teachers to keep track of

their students work, performance, written work and

progress throughout the year. The column for ‘prose

comments’ allows the teacher to record indicators that

exemplify the fulfilment of a particular criteria.

141

3.4 MODERATION PROCEDURES

In the discussion of the strengths and weaknesses of

alternative assessments (see Section 2.2.2), I have

already alluded to the problem of reliability. Harlen

(1994:16) identifies the possible sources of error that

affect the reliability of alternative assessments:

Variation in the demand or opportunity provided by the tasks

undertaken by students.

Differences in interpretation of performance criteria or

marking schemes by assessors.

The intrusion of irrelevant contextual information in making

judgements assessors.

In light of the above problems, the process of

moderation has become a prominent feature whenever

alternative assessments are employed to enhance

reliability (Gipps 1994:78). But what exactly is

moderation and how does it work?

Harlen (1994:16-18) has suggested that moderation

procedures could be categorised into two main groups

according to their purpose, that is:

Moderation for quality control - This type of moderation is

carried out after assessment has been made with the

primary purpose of adjusting the outcome of assessment

142

to improve fairness for groups and individuals. The

focus is the product of assessment.

Moderation for quality assurance - This type of moderation

takes place before the assessment is completed and its

intention is to improve the quality of the assessment

process. Thus, the focus is on the process of assessment.

143

3.4.1 MODERATION FOR QUALITY CONTROL

Examples of moderation for quality control are:

Inspection of samples sent in by post.

Inspection of samples by visiting moderators.

The use of reference or scaling tests for

statistical moderation.

The first two examples require teachers to submit to

a group of ‘experts’, who make up the examining board,

samples of their students’ work which exemplify various

levels of grading and their justification for awarding

those grades. The weakness of this procedure is that

students’ final products do not always represent a valid

or complete measure of their practical abilities or the

process involved (Kempa 1986:87). Furthermore, only a

small sample could be evaluated due to time, cost and

management constraints (Harlen 1994:19).

The third example, statistical moderation, is a

procedure for adjusting students’ internal grades using

the results of an externally marked test taken by all

students (Harlen 1994:18). Wilson (1992:134) proposes

that such moderating procedures are employed on two

underlying assumptions:

Practising teachers know best the relativities among their

own students but know much less about the relativities

between classes and schools.

144

External raters, on the other hand, are assumed to be good

at comparing classes, but not so good at measuring within

the classes (or perhaps are too expensive for this task).

In the framework that I have proposed in Section

3.2, the reference standard to which internal teacher-

given grades are adjusted is the external national

examination (which is a theory, paper-and-pencil exam)

that all students have to sit at the end of year six. The

choice of using theory examination as a reference

standard have been criticised by Gipps (1994:80-5) and

Satterly (1994:67-8). Basically they question whether the

high reliability and trust normally accorded to such

external examination is justified. Furthermore, the

reasonability of such an application depends on whether

there is a significant correlation between students’

practical ability (which is the focus of internal

assessment) and their theoretical knowledge (which is the

focus of most external, paper-and-pencil examinations).

It calls to question the validity of using theory marks

to adjust what is essentially practical marks. It is

because of such objections that in the United Kingdom,

the application of public examinations to adjust

internal, teachers’ grades have been dropped (Harlen

1994:21). On the other hand, Australia still continues

this practice (Broadfoot 1994:46).

Nevertheless, I would still like to propose using

statistical moderation in this manner because of the

145

following reasons. Firstly, alternative assessment is

very new to Malaysian educators and public. As Wilson

(1992:136) points out a gradual transition would make

acceptance of the newer forms of assessments more likely.

Secondly, a ‘running-in’ period is required since primary

science teachers need to be trained and given time to

develop skills necessary for assessing their students’

performance. The public examination that is still the

accepted norm in Malaysia at the present moment could

provide the stabilising factor during this period of

transition. This is also important to ensure public

confidence in the assessment framework.

146

External Assessment for Internal Assessment for all students. all students. (Theory marks) (PracticalGrades)

Compute: Mean and standard Compute: Mean and standard

deviation deviation

Adjust practical grade factor for each

teaching set, j.

Compute: standard deviation Adjust standard deviation of of theory marks for each practical grade for teaching teaching set, j. set, j.

Rescale individual grade:

147

Figure 3.3: PROCEDURE FOR STATISTICAL MODERATION OFINTERNAL ASSESSMENT

The procedure employed for statistical moderation of

teacher’s grades derived from internal school-based

assessment are usually based on the notion of ‘linear

scaling’. Figure 3.3 shows the procedure for statistical

moderation of internal assessment. The following

equations are used to calculate the adjustment factor for

a teaching set, j (which could comprise of students in a

class or a school who are being assessed).

148

where

is the mean practical grade of teaching set j,

after moderation.

is the average of mean practical grades for all

teaching sets (the whole population

of students being assessed).

is the mean theory mark of teaching set j, derived

from the external theory-based

exam (national examination).

is the average of mean theory marks, for all teaching

sets.

is the adjusted standard deviation of practical

grades for teaching set j, after

moderation.

ST is the standard deviation of theory marks for

teaching set j.

is the mean of standard deviations of practical

grades within sets, for all teaching

sets.

is the mean of standard deviations of theory marks

within sets, for all teaching

sets.

149

I will illustrate this from the following example

derived from a study by Slimming (1971) which is partly

related by Kempa (1986:90-1). In this example, the

teacher assessment of practical abilities was compared

with students’ performances on a paper-and-pencil theory

test. The teacher assessment was graded on a five-point

scale for some 30 teaching sets. The following

derivations were calculated:

The mean practical grade for all teaching sets, =

3.0

The mean standard deviation of practical grades for

all teaching sets, = 0.7

Correspondingly, the theory test for all teaching sets

produced, the following:

The mean set theory marks, = 49.0%

The mean standard deviation for theory marks, =

11.0%

The following correspondence could then be calculated to

be used for comparing with the teachers’ given practical

grade. (Table 3.6)

Mean theory mark forteaching set, . (%)

Mean practical grade forteaching set, .

27.0 1.638.0 2.349.0 3.060.0 3.7

150

71.0 4.4Table 3.6

For example, when the theory mark is 27.0, the

corresponding practical grade can be calculated as

follows:

= 3 + (27 - 49)0.7/11

= 1.6

Table 3.7 shows the mean practical grade arrived at byteachers in School A and B.

School Mean teacher-derived

practical grade

Theory score(mean)

Moderatedpractical grade

(mean)

A 3.56 44.4 2.7B 2.71 59.2 3.7

Table 3.7

When the teacher-derived practical grade from school

A is compared with the figures in Table 3.6, it is

obvious that this is a case of over-assessment.

Therefore, moderation of this teaching set is carried out

as shown below:

= 44.4

= = 2.7

151

Hence, moderation reduces the mean grade to 2.7 from

3.56 (downward-scaling) (As indicated in the fourth

column in table 3.7)

On the other hand, the teacher-derived practical

grade in school B is too low compared to the set’s theory

performance in Table 3.6 (a case of under-assessment). A

similar moderation procedure when applied results in an

upward scaling of the mean practical grade for this

teaching set from 2.71 to 3.7. (See the fourth column in

Table 3.7.)

How does this affect the individual practical grade

marks of each student? The moderated practical grade of

an individual, i, in a teaching set j, , is

obtained from his or her raw grade, , by applying the

following formula :

From the above formula, when the external ratings

(theory) and the teacher’s ratings are aligned, that is

= and = then =

or, in words, when the teacher and the external

raters are using the same overall scoring framework, the

teacher’s ratings will be the final ratings. This does

not posit that the teacher’s ratings equal the external

152

ratings for all students, but only that the teacher’s

ratings have the same average and the same spread as

those of an external rater. Therefore, the statistical

moderation procedure adjusts the mean and the spread of

the practical grades awarded by a teacher for the

particular teaching set, j.

I will illustrate this by returning to the example

of school A and B. Let’s say that it was determined that

school A had a standard deviation of 7.7 for the theory

marks while school B had a standard deviation of 20.9. If

a student in school A who was assessed internally and

received an original practical grade of 4, statistical

moderation could be worked out for his particular grade

as follows:

153

= 7.7 = 0.49

= 2.7 + = 3.4

Therefore, this student’s original practical grade

of 4 would be reduced through the moderation procedure to

3.4, which matches with the downward scaling we had

earlier determined.

On the other hand, a student in school B with a

similar grade of 4 would have it enhanced to 4.6, which

matches with the upward scaling through the adjustment

factor earlier determined.

This statistical procedure (as mapped out in Figure

3.3) could be programmed on a computer database and the

adjustment made quite rapidly. Nevertheless, it should be

noted that this statistical moderating procedure cannot

be applied reasonably to small teaching sets of less than

four students. In such cases, direct visual inspection is

required.

3.4.2 MODERATION FOR QUALITY ASSURANCE

154

Wilson (1992:129-132) stresses that for the success

of introducing alternative assessments, it is essential

to establish a network of information and support for

teachers which will consist of (a) a staff-development

network, (b) a collegial-support component and (c) a

feed-back component. Group moderation (or sometimes

called group, consensus or social moderation, agreement

panels or agreement trials) provides excellent

opportunities for all these to occur (Gipps 1994:78).

These are meetings where teachers discuss examples

of work done by their students so that shared

understandings of the criteria in operation are agreed

upon (Harlen 1994:23). Both the processes and the

products of the assessment are considered. The provision

of exemplars, that is samples of marked or graded work,

sharing and discussion of indicators for performance

levels of each criteria and an atmosphere where opinions

are freely expressed and deliberated, clarifies the

outworking of the assessment procedure (Gipps 1994:79).

Group moderation not only improves inter-marker

reliability and supports the process of assessment, but

is also an excellent vehicle for teacher development. As

Gipp reports, “It seems that coming together to discuss

performance or scoring is less personally and

professionally threatening than discussing, for example,

pedagogy.” Thus, group moderation has the added value of

155

affecting teachers' ideas and teaching practices.

Furthermore, the sessions are invaluable for providing

feedback to the developers and central administrators of

the assessment framework. As a result, exemplification of

the criteria’s standards and descriptors could be refined

and improved.

The disadvantages of group moderation are that it is

time-consuming and costly. Nevertheless, Harlen’s

(1994:24) comment helps to put this in perspective:

“Quality in moderation is the process which optimises the

reliability of an assessment at a cost which is balanced

by the benefits in terms of the purposes of the

assessment and contributions to professional

developments.”

In Malaysia, most districts and states have what is

called Teacher’s Activity Centre (Pusat Kegiatan Guru,

PKG). These centres are usually centrally located within

the districts. Therefore, I propose that these centres be

used as group moderation centres. Selected primary

science teachers, lecturers from teachers training

colleges, assessment officers from the education

department, school inspectors and school heads/assistant

school heads could meet to carry out group moderation

exercises. Eventually, these moderation groups could

build up a data bank of exemplars for assessment

procedures. These key personnel could then be consulting

156

‘experts’ to train up other primary science teachers to

carry out school-based assessment.

It is hoped that within a few years of its

implementation, group moderation procedures will be

sufficiently good to ensure quality so that statistical

moderation could be done away with. Ultimately, it is

professionally desirable that the teachers’ judgement in

assessment be trusted and respected.

157

CONCLUSION

The varied explicit and implicit aims of science

education nowadays necessitate the need for a more

versatile method of assessment commensurate with the

importance accorded to the aspects of knowing, doing and

feeling in the learning of science. Traditional methods

of assessment that only assess theoretical knowledge

betrays the importunate problem of advocating scientific

investigations during the learning of science but

ignoring it altogether when it comes to the important

summative assessment.

Therefore, new forms of assessment, poignantly

called alternative assessments, that require school-based

teacher assessment that are conducted concurrently with

the teaching-learning process are proposed to remedy this

state of impropriety in science education for Malaysian

primary schools. This continuous and more authentic

assessment of students process of learning and their

performance over a period of time, is formative in nature

and thus is expected to enhance the prospect of learning

science in primary school. This teacher assessment is

also used summatively and criteria for the aspects of

knowing, doing and feeling with their corresponding

descriptors are proposed.

158

Nevertheless, it is acknowledged that introducing

such radical reforms to the assessment structure of

Malaysia is not going to be an easy endeavour. The

assessment framework which I have proposed in this

treatise needs to be tested on a small sample of primary

science teachers and their students. It will probably

need to be revised and fine tuned. Exemplars of the

criteria for knowing, doing and feeling need to be

collected for training purposes.

Articles and papers need to be written, published

and presented to inform and convince educators and the

general public in Malaysia that alternative assessment is

an equally equitable and valid form of assessment that

complements traditional assessment methods.

The staff of the Assessment Unit of the Ministry of

Education of Malaysia needs to trained to cope with the

more demanding tasks of analysing results and undertake

statistical moderation. Ultimately, it is preferable that

group moderation will take over the task of ensuring

reliability, thus diminishing the role of statistical

moderation. Therefore, the network for group moderation

needs to be set up and supported.

The task of training thousands of primary science

teachers to carry out school-based assessment confidently

and appropriately is challenging. Monitoring the

159

implementation of alternative assessment to avoid short

fall and abuses is another important factor to consider.

Nevertheless, I am convinced that in the long run,

the benefits of introducing alternative assessments far

outweighs the initial difficulties. I envision that

students will begin to receive a more balanced education

in science. They will be motivated in their daily classes

as they realise that their work and performance will be

valued in the final analysis. Primary science teachers

will be more professional in their work as they are

trusted with the tasks of assessing their own students

whom they interact with and know personally. Teacher

development will be a consistent feature that will

enhance performance.

With such optimistic potentials and more, I am

convinced that this is the way forward for assessment in

primary science in Malaysia and alternative assessment

will facilitate the realisation of the wider aims of

science education.

160

APPENDIX A

CONCEPTS OF EVIDENCE AND THEIR DEFINITION(Gott and Duggan 1995:31)

CONCEPTSOF EVIDENCE

DEFINITION

ASSOCIATED WITHDESIGN

Variableidentification

Fair test

Sample size

Variable types

Understanding the idea of a variable and identifyingthe relevant variable to change (the independentvariable) and to measure, or assess if qualitative(the dependent variable).

Understanding the structure of the fair test interms of controlling the necessary variables and itsimportance in relation to the validity of anyresulting evidence.

Understanding the significance of an appropriatesample size to allow, for instance, for probabilityor biological variation.

Understanding the distinction between categoric,discrete, continuous and derived variables and howthey link to different graph types.

ASSOCIATEDWITHMEASUREMENT

Relative scale

Range and interval

Choice ofinstrument

Repeatability

Accuracy

Understanding the need to choose sensible values forquantities so that resulting measurements will bemeaningful. For instance, a large quantity ofchemical in a small quantity of water causingsaturation, will lead to difficulty indifferentiating the dissolving times of differentchemicals.

Understanding the need to select a sensible range ofvalues of the variables within the task so that theresulting line graph consists of values which arespread sufficiently widely and reasonably spaced outso that the ‘whole’ pattern can be seen. A suitablenumber of readings is also subsumed in this concept.

Understanding the relationship between the choice ofinstrument and the required scale, range of readingsrequired, and their interval (spread) and accuracy.

Understanding that the inherent variability in anyphysical measurement requires consideration of theneed for repeats.

Understanding the appropriate degree of accuracythat is required to provide reliable data which willallow a meaningful interpretation.

ASSOCIATED WITH DATA HANDLING

Tables

Graph type

Understanding that tables are more than ways ofpresenting data after it has been collected. Theycan be used as ways of organising the design andsubsequent data collection and analysis in advanceof the whole experiment.

Understanding that there is a close link betweengraphical representations and the type of variablethey are to represent. For example, a categoric

161

Patterns

Multivariatedata

independent variable such as type of surface, cannotbe displayed sensibly in a line graph. The behaviourof a continuous variable on the other hand is bestshown in a line graph.

Understanding that patterns represent the behaviourof variables and that they can be seen in patternsin tables and graphs.

Understanding the nature of multivariate data andhow particular variables within those data can beheld constant to discover the effect of one variableon another.

ASSOCIATED WITH THE EVALUATION OF THE COMPLETE TASK

Reliability

Validity

Understanding the implications of the measurementstrategy for the reliability of the resulting data;can the data be believed?

Understanding the implications of the design for thevalidity of the resulting data; an overall view ofthe task to check that it can answer the question.

APPENDIXB

9 CHALLENGES OF VISION 2020 FOR MALAYSIA(Education Planning and Research and Division 1990:75)

Establishing a united Malaysian nation with a sense ofcommon and shared destiny;

Creating a psychologically liberated, secure, anddeveloped Malaysian society with faith and confidence;

Fostering and developing a mature democratic society;

Establishing a fully moral and ethical society;

Establishing a matured, liberal and tolerant society;

162

Establishing a scientific and progressive society,innovative and forward looking, not only as a consumerto technology but also as a contributor to thescientific and technological civilization of thefuture;

Establishing a fully caring society and a caringculture;

Ensuring an economically just society; and

Establishing a prosperous society, with an economy thatis fully competitive, dynamic, robust and resilient.

APPENDIXC

CHARACTERISTICS OF AUTHENTIC TESTS[Wiggins (b) 1989:45]

A. STRUCTURE AND LOGISTICS

1. Are more appropriately public; involve an audience, a panel, andso on.

2. Do not rely on unrealistic and arbitrary time constraints.

163

3. Offer known, not secret, questions and tasks.4. Are more like portfolios or a season of games (not one-shot).5. Require some collaboration with others.6. Recur - and are worth practising for, rehearsing, and retaking.7. Make assessment and feedback to students so central that school

schedules, structures, and policies are modified to support them.

B. INTELLECTUAL DESIGN FEATURES

1. Are “essential” - not needlessly intrusive, arbitrary, orcontrived to “shake out” a grade.

2. Are “enabling” - constructed to point the student toward moresophisticated use of the skills or knowledge.

3. Are contextualized, complex intellectual challenges, not“atomized” tasks, corresponding to isolated “outcomes”.

4. Involve the student’s own research and use of knowledge, for which“content” is a means.

5. Assess students’ habits and repertoires, not mere recall or plug-in skills.

6. Are representative challenges - designed to emphasize depth more thanbreadth.

7. Are engaging and educational.8. Involve somewhat ambiguous (“ill-structured”) tasks or problems.

C. GRADING AND SCORING STANDARDS

1. Involve criteria that assess essentials, not easily counted (butrelatively unimportant) errors.

2. Are not graded on a “curve” but in reference to performancestandards (criterion-referenced, not norm-referenced).

3. Involve demystified criteria for success that appear to students asinherent in successful activity.

4. Make self-assessment a part of the assessment.5. Use a multifaceted scoring system instead of one aggregate grade.6. Exhibit harmony with shared schoolwide aims - a standard.

D. FAIRNESS AND EQUITY

1. Ferret out and identify (perhaps hidden) strengths.2. Strike a constantly examined balance between honoring achievement and

native skill or fortunate prior training.3. Minimize needless, unfair, and demoralizing comparisons.4. Allow appropriate room for student learning styles, aptitudes, and

interests.5. Can be - should be - attempted by all students, with the test

“scaffolded up”, not “dumbed down”, as necessary.6. Reverse typical test-design procedures : they make

“accountability” serve student learning (Attention is primarilypaid to “face” and “ecological” validity of tests.)

164

APPENDIX D

CLUSTERS OF SCIENTIFIC PROCESS SKILLS AND PROCEDURALUNDERSTANDING FOR EACH STAGE

SCIENTIFIC METHOD SCIENTIFIC THINKING(PROCEDURAL UNDERSTANDING) (PROCESS SKILLS)

Design : PLANNING CLUSTER : Classifying Variable identification HYPOTHESISING AND RaisingQuestionsFair test CONTROLLING Sample size VARIABLESVariable types

Relative scale IMPLEMENTING CLUSTER : Usingthe relationshipRange and interval OBSERVINGbetween space and Choice of instrument AND MEASURING time

RecordingClassifying

Tables CONCLUDING CLUSTER : InferringGraph Types INTERPRETING AND

Predicting

165

Patterns COMMUNICATING DefiningoperationallyMultivariate data InterpretinginformationReliability Critically reflectingValidity

APPENDIX E

STUDENT’S NAME : YEAR : CLASS :SCHOOL :

CRITERIA DATE PROSE COMMENTS GRADEKNOWING

1. Range

2. Degree of generalisation

3. Application

4. Scientific terminology

5. Construction of understanding

DOING

1. Planning : Hypothesising and controlling variables2. Implementing : Observing and/or measuring

166

3. Concluding : Interpreting and communicatingFEELING

1. Curiosity

2. Respect for evidence

3. Flexibility

4. Critical reflection

5. Sensitivity to living things and the environment

Total :

Signature : ________________________________

Teacher’s Name : ___________________________

APPENDIX F

THE ASPECT OF KNOWING (10%)

Criteria Descriptors forGrade 1

Descriptors for Grade 2

Range Expressing ideas that relate to one situation.

Making links between different situations, using the same idea to explain a range of experiences.

Degree ofgeneralisation

Describing observations made.

Generalising about relationshipsusing models or concepts.

Application

Merely expressing ideas without using it.

Applying scientific knowledge todifferent situations to solve problems or relating their significance to societal concerns.

Scientific terminology

Using everyday / common language to describe events, phenomena or ideas.

Using correct scientific terminology when giving descriptions.

167

Construction of understanding

Repeating memorised ideas.

Independent and scientifically correct expressions of ideas with explanations that reveals understanding.

THE ASPECT OF FEELING (5%)

Criteria Descriptors forGrade 1

Descriptors for Grade 2

Curiosity Being slow to noticenew things and needsto be prompted.

Noticing details, asking enquiringquestions; seeking for explanations and shows an eagerness to know.

Respect for evidence

Showing a tendency to report results that are in line with their original ideas while ignoringconflicting evidence.

Demonstrating open-mindedness in making conclusions from the available evidence with a willingness to consider conflicting evidence.

Flexibility

Showing a tendency to stick to preconceived ideas.

Showing a willingness to reconsider ideas and other points of view; also recognises the tentative nature of ideas with a readiness to change the,

Critical reflection

Needing encouragement and help to review methods, findings and ideas.

Showing a willingness to review and evaluate independently; also suggests ways to change for improvement.

Sensitivity to living things and the environment

Requiring supervision and rules to show appropriate concern and responsibility.

Demonstrating an awareness of the needs of living things and the importance of respecting the environment.

168

THE ASPECT OF DOING (15%)

Grade DESCRIPTORS1 Able to mention relevant features in attempting an

explanation based on everyday experience. Plans as he/she goes along.

2 Able to formulate at least one question or statement that can be investigated.

3 Able to identify and describe relevant variables.4 Adopts a systematic approach. Beginning to recognise the

need for a fair test. Able to identify at least one variablewhich should be kept the same for a fair test and identify an appropriate variable to measure or compare.

5 Able to offer explanation based on scientific knowledge and theories showing an awareness of their tentative nature. Able to translate their own or suggested ideas of explanation into a question or statement that can be investigated. Able to plan a fair test by changing one variable (independent variable) and observing or measuring the effect (dependent variable) while keeping all other relevant variables (control variables) the same.

Planning cluster : Hypothesising and controlling variables

Grade DESCRIPTORS1 Able to observe and record gross or obvious features of a

phenomena or object.2 Able to observe and record qualitative evidence by noticing

details of a phenomena or object.3 Able to collect qualitative or quantitative data in at least

two dimensions by focusing on observations relevant to the problem in hand.

4 Able to select and use appropriate instruments to collect quantitative data in the form of standard unitsorAble to notice differences between similar objects or events.

5 Able to select and use suitable measuring instruments in standard units considering range and accuracy of measurements. Able to notice differences between similar objects or events and/or similarities between different objects or events. Able to check or repeat observations and measurements to improve accuracy. Able to record evidence ordata clearly and appropriately as they carry out the work.

Implementing cluster : observing and/or measuring

169

Grade DESCRIPTORS1 Able to make interpretations related to some part of the

data (rather than preconceived ideas) even if only by loose associations. Able to give oral descriptions of the investigation and findings.

2 Able to make interpretations based on all available data. Able to give written descriptions in the form of notes and drawings of the investigation and findings.

3 Able to identify patterns in a set of data. Able to use tables or other standard framework for recording findings ofinvestigation.

4 Able to make generalised statement(s) based on evidence of data.

5 Able to evaluate data in relation to original problem. Able to justify predictions in terms of observed relationship(s) or pattern(s) showing how evidence has been used. Able to give oral or written description of the investigation using notes, drawings, tables and other standard framework appropriate for audience and purpose.

Concluding cluster : interpreting and communicating

170

REFERENCES

ADEY, P. and SHAYER, M. (1990) ‘Accelerating thedevelopment of formal thinking in the middle and highschool students.’, Journal of Research in Science Teaching, 27 (3),267-85.

ADEY, P. and SHAYER, M. (1994) Really Raising Standards : CognitiveIntervention and Academic Achievement. (London : Routledge).

ADEY, P.S. and SHAYER, M. (1989) Thinking Science : TheCurriculum Materials of the CASE Project. (London : Thomas Nelsonand Sons).

AIKENHEAD, G. (1994) ‘The Social Contract of Science : Implications forTeaching Science.’ in SOLOMON, J. and AIKENHEAD, G. (eds)(1994). 11-20.

AIKENHEAD, G. (1994) ‘What is STS Science Education?’ inSOLOMON, J and AIKENHEAD, G. (eds) (1994) 47-59.

ALLSOP, T. (1991) ‘Practical Science in Low-incomeCountries.’ in WOOLNOUGH (ed) (1991) 31-40.

ASSESSMENT OF PERFORMANCE UNIT (1984). Science in Schools Age 11 :Report No.3 (United Kingdom : Department of Education andScience, Department of Education for Northern Ireland,Welsh Office).

ASSESSMENT OF PERFORMANCE UNIT, SCIENCE WORKING GROUP(1977) Asssessment of Scientific Development : Consultative Paper on Plansfor the Assessment of Pupil’s Scientific Development - October 1977 (UnitedKingdom : APU)

ATHANASES, S.Z. (1994) ‘Teachers’ reports of the effectsof preparing portfolios of literacy instruction.’, TheElementary School Journal, 94 (4), 421-39.

171

AUSUBLE, D.P., NOVAK, J.D. and HANESIAN, H. (1978)Educational Psychology : A Cognitive View. (New York : Holt,Rinehart and Winston). Second Edition.

BACHOR, D.G., ANDERSON, J.O., WALSH, J. and MUIR, W.(1994) ‘Classroom assessment and the relationship torepresentativeness, accuracy and consistency.’, The AlbertaJournal of Educational Research, 40 (2), 247-62.

BAKER, L. (1991) ‘Metacognition, Reading and ScienceEducation’ in SANTA, C.M. and ALVERMANN, D.E. (eds)(1991).

BATESON, D. (1993) ‘Psychometric and philosophic problemsin “authentic” assessment : Performance tasks andportfolios.’, The Alberta Journal of Educational Research, 40 (2),233-45.BAXTER, G.P., SHAVELSON, R.J., GOLDMAN, S.R. and PINE, J.(1992), ‘Evaluation of procedure-based scoring for hands-on science assessment.’, Journal of Educational Measurement, 29(1), 1-17.

BIGGS, J. (1995) ‘Assessing for learning : Somedimensions underlying new approaches to educationalassessment.’, The Alberta Journal of Educational Research, 41 (1), 1-17.

BLOOM, B.S., MADAUS,G.F. and HASTINGS, J.T. (1981)Evaluation to Improve Learning. (New York : McGraw Hill).

BOTTERY , M. (1990) The Morality of the School : The Theory and Practiceof Values in Education. (London : Cassell Educational Limited).

BRIERLEY, J. (1964) Science in its Context : A Symposium with SpecialReference to Sixth Form Studies. (London : Heinemann EducationalBooks).

BROADFOOT, P. (1994) ‘Approaches to Quality Assurance andControl in Six Countries.’, in HARLEN, W. (Ed) (1994),26-52.

172

BROADFOOT, P. (1996) ‘Performance Assessment inPerspective : International Trends and Current EnglishExperience.’ in CRAFT, A. (Ed) (1996), 35-65.

BROWN, J., COOPER, A., HORTON,T., TOATES, F. and ZELDIN,D. (Eds) (1986) Science in Schools. (England : Open University Press).

CARRE, C. (1981) Language Teaching & Learning - 4. Science.(London : Word Lock Educational).

CAVENDISH, S., GULTON, M., HARGREAVES, L. and HARLEN, W.(1990) Observing Activities: Assessing Science in the Primary Classroom.(London : Paul Chapman Publishing Ltd.).

CHAPMAN, B. (1994) ‘The Overselling of Science Educationin the 1980s’ in LEVINSON, R. (ed) (1994). 190-205.

CHRISTOFI, C. (1988) Assessment and Profiling : A Practical Guide.(London : Cassell Educational Limited).

CRAFT, A. (Ed) (1996) Primary Education : Assessing and PlanningLearning. (London : Routledge).

DARLING-HAMMOND, L. (1994) ‘Performance-based assessmentand educational equity.’, Harvard Educational Review, 64 (1), 5-30.

DAVIS, B.C. (Ed) (1989) GASP (Graded Assessment in Science Project):Teacher’s Handbook. (London : Hutchinson Education).

DRIVER, R., GUESNE, E. and TIHERGHIEN, A. (Eds) (1985)Children’s Ideas in Science. (England : Open University Press).

DUGGAN, S. and GOTT, R. (1995) ‘The place ofinvestigations in practical work in the UK curriculum.’,International Journal of Science Education, 17 (2), 137-47.

EASLEA, B. (1973) Liberation and the Aims of Science : An Essay onObstacles to the Building of a Beautiful World. (London : Chatto &Windus Ltd.).

173

EDUCATIONAL PLANNING AND RESEARCH DIVISION (1990) Educationin Malaysia 1989. (Kuala Lumpur : Ministry of EducationMalaysia ).

FODEN, A. (1993) ‘Integrating Assessment in the Curriculum.’ inSHERRINGTON, R. (ed) (1993) 152-62.

GILBERT, J.K., OSBORNE, R.J. and FENSHEM, P.J. (1986)‘Children’s Science and its Consequences for Teaching’ in BROWN, J. etal (eds)(1986). 302-315.

GIPPS, C. (1994) ‘Quality in Teacher Assessment’, inHARLEN, W. (ed) (1994), 71-86.

GLAUERT, E. (1996) Tracking Significant Achievement in PrimaryScience.(London: Hodder and Stoughton).

GOTT, R. and DUGGAN, S. (1995) Investigative Work in the ScienceCurriculum. (Buckingham : Open University Press)

GOTT, R. and MASHITER, J. (1991) ‘Practical Work inScience - A Task-based Approach?’ in WOOLNOUGH, B. (ed)(1991).

GRONLUND, N.E. (1985) Measurement and Evaluation in Teaching. (NewYork : Macmillan Publishing Company). Fifth Edition.

GUILFORD, J.P. (1959) Personality. (New York : McGraw-Hill).

HAERTEL, E.H. (1991) ‘New forms of teacher assessment.’,Review of Research in Education, 17, 3-29.

HANNA, G.S. (1993) Better Teaching through Better Measurement.(Fort Worth, USA : Harcourt Brace Jovanovich CollegePublishers).

HARLEN , W. (1993) Teaching and Learning Primary Science. (London: Paul Chapman Publishing Ltd.).Second Edition.

HARLEN, W. (1988) The Teaching of Science. (London : DavidFulton Publishers).

174

HARLEN, W. (Ed) (1994) Enhancing Quality in Assessment. (London :Paul Chapman Publishing Ltd.).

HARLEN, W. and OSBORNE, R. (1985) ‘A model for learningand teaching applied to primary science.’, Journal ofCurriculum Studies, 17 (2), 133-46.

HENNING-STOUT, M. (1994) Responsive Assessment : A New Way ofThinking about Learning. (San Francisco : Jossey-BassPublishers).

HIRST, P.H. (1974) Knowledge and the Curriculum : A Collection ofPhilosophical Papers. (London : Routledge & Kegan Paul).

HIRST, P.H. and PETERS, R.S. (1970) The Logic of Education.(London : Routledge & Kegan Paul).

HODSON, D. and PROPHET, R.B. (1994) ‘Why the ScienceCurriculum Changes : Evolution or Social Control?’ inLEVINSON, R. (ed) (1994) 22-38.

HUMMEL, C.E. (1986) The Galileo Connection : Resolving Conflictsbetween Science & the Bible. (Illinois, USA : InterVarsityPress).

HURD, P.D. (1971) New Directions in Teaching Secondary SchoolScience. (Chicago: Rand McNally & Company).

HURD, P.D. (1971). New Directions in Teaching Primary School Science.(Chicago : Rand Mcnally & Company).

JACOBSON, W.J. and BERGMAN, A.B. (1980) Science For Children : ABook for Teachers. (New Jersey : Prentice Hall, Inc.). 2ndEdition.

JENKINS, E. (1990) ‘Scientific literacy and schoolscience education’, School Science Review, 71 (256) 43-51.

KEMPA, R. (1986). Assessment in Science. (Cambridge : CambridgeUniversity Press).

175

KIRKHAM, J. (1989) ‘Balanced science : Equilibriumbetween Context, Process and Context.’ in WELLINGTON, J.(ed) 135-50.

KRUGLEY-SMOLSKA, E.T. (1990) ‘Scientific literacy indeveloped and developing countries’, International Journal ofScience Education, 12 (5), 73-480.

LALLY, V. (1994) ‘Problem-solving in School Science’ inWELLINGTON, J. (ed) (1994) 218-39.

LEVINSON, R. (Ed) (1994) Teaching Science. (London :Routledge).

LEWIN, L. (1975) ‘Science Education in Malaysia and SriLanka’, IDS Discussion Paper, No.74, University of Sussex.

LOCK, R. (1990) ‘Open-ended, problem-solvinginvestigations : What do we mean and how can we usethem?’, School Science Review, 71 (256) 63-72.LOCK, R. and WHEATLEY, T. (1989) ‘Recording process,skill and criterion assessments.’, School Science Review, 70(253), 108-113.

MAEROFF, G.I. (1991) ‘Assessing alternative assessment.’,Phi Delta Kappan, 73 (4), 272-81.

MARTIN , M. (1972) Concepts of Science Education : A PhilosophicalAnalysis . (Illinois : Scott, Foresman and Company).

McCALLUM, B., GIPPS, C., McALISTER, S. and BROWN, M.(1996). ‘Teachers’ Own Assessment.’, in CRAFT, A. (ed)(1996) 74-99.

MEYER, C.A. (1992) ‘What’s the difference betweenauthentic and performance assessment?’, EducationalLeadership, 49 (8), 39-40.

MILLAR, R. (1991) ‘A means to an end : the role ofprocesses in science education’ in WOOLNOUGH , B. (ed)(1991). 43-52.

176

MILLAR, R. (1994) ‘What is Scientific Method’ inLEVINSON, R. (ed) (1994) 164-177.

MILLAR, R. and DRIVER, R. (1987) ‘Beyond Processes.’,Studies in Science Education,14, 33-62. (Yorkshire : Studies inEducation Limited).

MILLAR, R., LUBBEN, F., GOTT, R. and DUGGAN, S. (1994)‘Investigating in the school science laboratory :Conceptual and procedural knowledge and their influenceon performance.’, Research Papers in Education, 9 (2), 207-48.

MOLLY, L.N.N. (1992) ‘School curriculum reforms inMalaysia : World influences and national context.’ ,International Journal of Science Education, 14 (3), 249-263.

NEILL, D.M. and MEDINA, N.J. (1989) ‘Standardized testing: Harmful to educational health.’, Phi Delta Kappan, 70 (9),688-97.

NELLIST, J. and NICHOLL, B. (1986) ASE Science Teachers’Handbook. (London : Hutchinson & Co. Publishers Ltd.).

NEWTON , D.P. (1988) Making Science Education Relevant.(London : Kogan Page).

OSBORNE, R. and FREYBERG, P. (1985) Learning in Science : TheImplications of Children’s Science. (Auckland : Heinemann).

OSBORNE, R.J. and WITTROCK, M.C. (1983) ‘Learning science: A generative process.’, Science Education, 67, 489-508.

OXFORD PAPERBACK DICTIONARY (1988) (Oxford : OxfordUniversity Press).

PENNELL , A. and ALEXANDER , D. (1990) The Management ofChange in the Primary School : Implementing the National Curriculum inScience and Design Technology. (New York : The Falmer Press)

PHENIX, P.H. (1964) Realms of Meaning : A Philosophy of theCurriculum for General Education. (New York : McGraw Hill BookCompany).

177

PIKE, D.F.B. (1964) ‘Scientific Method in Peace and War’in BRIERLEY, J. (ed) (1964) 82-93

POLANYI, M. (1962) Personal Knowledge : Towards a Post-CriticalPhilosophy. (London : Routledge & Kegan Paul).

POLANYI, M. (1968) The Tacit Dimension. (London : Routledge &Kegan Paul).

POLANYI, M. (1969) Knowing and Being. (London : Routledge &Kegan Paul).

POOLE, M. (1990) ‘Beliefs and values in science education- A Christian perspective (Part 1)’, School Science Review, 71(256) 25-32.

POOLE, M. (1990) ‘Beliefs and values in science education- A Christian perspective (part 2)’, School Science Review, 71(257) 67-73.

POOLE, M. (1995) Beliefs and Values in Science Education.(Buckingham : Open University Press).

PUSAT PERKEMBANGAN KURIKULUM (1989) Huraian Sukatan PelajaranSains Tambahan : Tingkatan IV. (Kuala Lumpur : KementerianPendidikan Malaysia).

PUSAT PERKEMBANGAN KURIKULUM (1989) Huraian Sukatan PelajaranSains : Tingkatan III. (Kuala Lumpur : Kementerian PendidikanMalaysia).

PUSAT PERKEMBANGAN KURIKULUM (1990) Huraian Sukatan PelajaranSains : Tingkatan I. (Kuala Lumpur : Kementerian PendidikanMalaysia)

PUSAT PERKEMBANGAN KURIKULUM (1990) Huraian Sukatan PelajaranSains : Tingkatan II. (Kuala Lumpur : Kementerian PendidikanMalaysia).

178

PUSAT PERKEMBANGAN KURIKULUM (1991) Huraian Sukatan PelajaranFizik : Tingkatan IV. (Kuala Lumpur : Kementerian PendidikanMalaysia).

PUSAT PERKEMBANGAN KURIKULUM (1991) Huraian Sukatan PelajaranBiologi : Tingkatan IV. (Kuala Lumpur : Kementerian PendidikanMalaysia).

PUSAT PERKEMBANGAN KURIKULUM (1991) Huraian Sukatan PelajaranKimia : Tingkatan IV. (Kuala Lumpur : Kementerian PendidikanMalaysia).

PUSAT PERKEMBANGAN KURIKULUM (1992) Huraian Sukatan PelajaranFizik : Tingkatan V. (Kuala Lumpur : Kementerian PendidikanMalaysia).

PUSAT PERKEMBANGAN KURIKULUM (1992) Huraian Sukatan PelajaranKimia : Tingkatan V. (Kuala Lumpur : Kementerian PendidikanMalaysia).

PUSAT PERKEMBANGAN KURIKULUM (1992) Hurian Sukatan PelajaranBiologi : Tingkatan V. (Kuala Lumpur : Kementerian PendidikanMalaysia).

PUSAT PERKEMBANGAN KURIKULUM (1992) Hurian Sukatan PelajaranSains Tambahan : Tingkatan V. (Kuala Lumpur : KementerianPendidikan Malaysia).

PUSAT PERKEMBANGAN KURIKULUM (1993) Huraian Sukatan PelajaranSains Sekolah Rendah (Malaysian Primary Science Syllabus). (KualaLumpur : Kementerian Pendidikan Malaysia).

QUALTER, A., STRANG, J., SWOTTON, P. and TAYLOR, R.(1990) Exploration : A Way of Learning Science. (Oxford : BlackwellEducation).

RAIZEN, S.A. and KASER, J.S. (1989) ‘Assessing sciencelearning in elementary school : Why, what and how?’, PhiDelta Kappan, 70 (9), 718-22.

179

RAVETZ, J.R. (1971) Scientific Knowledge and its Social Problems.(Oxford : Clarendon Press).

REISS , M.J. (1993) Science Education For a Pluralist Society.(Buckingham : Open University Press).

REISS, M.J. (1992) ‘How should science teachers teach therelationship between science and religion?’, School ScienceReview, 74 (267) 126-30.

RITCHIE, B. (1971) ‘Physics’ in WHITFIELD, R. (ed) (1971)126-136.

ROGERS, W.T. (1991) ‘Educational assessment in Canada :Evolution or extinction?’, The Alberta Journal of EducationalResearch, 37 (2), 179-92.

RUSSEL, T. and HARLEN, W. (1990) Practical Task: Assessing Sciencein the Primary Classroom. (London : Paul Chapman PublishingLtd.).

SANTA, C.M. and ALVERMANN, D.E. (Eds) (1991) Science Learning: Processes and Applications. (Newark, Delaware: InternationalReading Association, Inc.)

SATTERLY, D. (1994) ‘Quality in External Assessment.’, inHARLEN, W. (ed) (1994), 53-70.

SECONDARY SCIENCE CURRICULUM REVIEW (1984) Towards theSpecification of Minimum Entitlement : Brenda and Friends. (London :Schools Council Publications).

SHAPIRO, B. (1994) What Children Bring to Light : A ConstructivistPerspective on Children’s Learning in Science. (New York : TeachersCollege Press).

SHAVELSON, R.J. and BAXTER, G.P. (1992) ‘What we’velearned about assessing hands-on science.’, EducationalLeadership, 49 (8), 20-5.

SHAYER, M. and ADEY, P.S. (1992) ‘Accelerating thedevelopment of formal thinking in the middle and high

180

school students II.’, Journal of Research in Science Teaching, 29(1), 81-92.

SHERRATT, W.J. (1986) ‘History of Science in the Science Curriculum : anHistorical Perspective’ in BROWN, J. et al.(eds) (1986) 198-212.

SHERRINGTON, R. (Ed) (1993) The ASE Primary Science Teachers’Handbook. (Cheltenham, England : Stanley Thornes).

SIDDONS , C. and SPURGIN , B. (1990) ‘Beliefs and valuesin science education’, Forum, School Science Review, 72 (259),145-6.

SLIMMING, D. (1971) ‘Investigation into some aspects ofteacher-based assessment of sixth-form chemistrystudents’ interpretational abilities.’ MSc. thesis.(University of East Anglia) partly refered to in KEMPA,R. (1986).

SOLOMON, J. (1993). Teaching Science, Technology and Society.(Buckingham : Open University Press).

SOLOMON, J. and AIKENHEAD, G. (Eds) (1994) STS Education :International Perspectives on Reform.(New York : Teachers CollegePress).

SUTTON, C. (1992) Words, Science and Learning. (Buckingham :Open University Press).

SUTTON, C. (1994) ‘Well, Mary, what are they saying here?’ inLEVINSON, R. (ed) (1994) 61-75.

TAYLOR, C. (1994) ‘Assessment for measurement orstandards : The peril and promise of large-scaleassessment reform.’, American Educational Research Journal, 31(2), 231-62.

VIECHNICKI, K.J., BARBOUR, N., ROHRER, J. and AMBROSE, R.(1993) ‘The impact of portfolios assessment on teacherclassroom activities.’, Journal of Teacher Education, 44 (5),371-7.

181

WATTS, M. (1991) The Science of Problem Solving : A Practical Guide forScience Teachers. (London : Cassell Educational Ltd.).

WELLINGTON, J. (1994) Secondary Science : Contemporary Issues andPractical Approaches. (London : Routledge).

WELLINGTON, J. (Ed) (1989) Skills and Processes in ScienceEducation : A Critical Analysis. (London : Routledge).WHITE, R.T. (1988). Learning Science. (Oxford : BasilBlackwell).

WHITFIELD , R. (1971) Disciplines of the Curriculum. (London :the MIT Press)

WIGGINS, G. (1993) ‘Assessment : Authenticity, contextand validity.’, Phi Delta Kappan, 75 (3), 200-14.

WIGGINS, G. (a) (1989) ‘A true test : Toward moreauthentic and equitable assessment.’, Phi Delta Kappan, 70(9), 703-13.

WIGGINS, G. (b) (1989) ‘Teaching to the (authentic)test’, Educational Leadership, 46 (7), 41-7.

WILSON, M. (1992) ‘Educational leverage from a politicalnecessity : Implications of new perspectives on studentassessment for Chapter 1 evaluation.’, Educational Evaluationand Policy Analysis, 14 (2), 123-44.

WOOLNOUGH , B. (1994) Effective Science Teaching. (Buckingham :Open University Press).

WOOLNOUGH, B. (Ed) (1991) Practical Science : The Role and Reality ofPractical Work in School Science. (Buckingham : Open UniversityPress).

WOOLNOUGH, B. and ALLSOP, T. (1985) Practical Work in Science.(Cambridge : Cambridge University Press) .

WOOLNOUGH, B.E. (1989) ‘Faith in science?’, School ScienceReview, 70 (252) 133-7.

182

ZIMAN, J. (1976) The Force of Knowledge : The Scientific Dimension ofSociety. (Cambridge : Cambridge University Press).

ZIMAN, J. (1980) Teaching and Learning about Science and Society.(Cambridge: Cambridge University Press)

ZIMAN, J. (1994) ‘The Rational of STS Education is in the Approach’ inSOLOMON, J and AIKENHEAD, G. (eds) (1994) 21-31.

183

BIBLIOGRAPHY

ALBERTS, R.V.J., VAN BEUZEKOM and CITO, I.D.R. (1986)‘The assessment of practical work : A choice ofoptions.’, European Journal of Science Education, 8 (4), 361-369.

APU (1977) Consultative Paper on Plans for the Assessment of Pupils’Scientific Development - October 1977.

APU (1977) Sample Questions in Science at Age 11. (London :Department of Education and Science).

ARCHBALD, D.A. and NEWMAN, F.M. (1988) Beyond StandardizedTesting : Assessing Authentic Academic Achievement in the Secondary School.(Virginia : National Association of Secondary SchoolPrincipals).

ARCHENHOLD, F. et al. (1991) Assessment Matters No. 5 - Profilesand Progression in Science : A Booklet for Teachers. (London : SchoolExaminations and Assessment Council).

ASSESSMENT OF PERFORMANCE UNIT (1984) Science in Schools, Age11 : Report No.3. (London : Department of Education andScience, Department of Education for Northern Ireland,Welsh Office).

AUSTIN, R. (1991) Assessment Matters No. 7 - Patterns andRelationships in School Science : A Booklet for Teachers. (London : SchoolExaminations and Assessment Council).

BRYCE, T., McCALL, J., MacGREGOR, J., ROBERTSON, I. andWESTON, R. (1983) TAPS : Techniques for Assessment of Practical Skills inFoundational Science (Teacher’s Guide). (London : HeinemannEducational Books Ltd.).

BRYCE, T., McCALL, J., MacGREGOR, J., ROBERTSON, I. andWESTON, R. (1988). TAPS 2 : Techniques for Assessing Process Skills inPractical Science (Resource Pack). (London : Heinemann EducationalBooks).

184

BUCHAN, A.S. and JENKINS, E.W. (1992) ‘The internalassessment of practical skills in science in England andWales, 1960-1991: some issues in historicalperspective.’, International Journal of Science Education, 14 (4),367-380.

DONELLY, J.F. and GOTT, R. (1985) ‘An assessment-ledapproach to processes in the science curriculum.’,European Journal of Science Education, 7 (3), 237-251.

DWIGHT, K. (1989) ‘Building in assessment.’, School ScienceReview, 70 (252), 119-125.

LOCK, R. (1987) ‘OCEA-the development of gradedassessment scheme in science (1982-4).’, School Science Review,68 (244), 570-575.

LOCK, R. and FERRIMAN, B. (1989) ‘OCEA-the development ofa graded assessment scheme in science - Part III Schooltrials 1986.’, School Science Review, 70 (252), 103-112.

LOCK, R. and WHEATLEY, T. (1989) ‘Recording process,skills and criterion assessments - student systems.’,School Science Review, 71 (255), 145-150.

RIPPIN, T. and WILSON, G. (1989) ‘Assessing processacross the primary-secondary interface.’, School ScienceReview, 70 (253), 95-102.

RUSSEL, A. et al. (1991) Assessment Matter : No. 8 - Observation inSchool Science - A Booklet for Teachers (London : SchoolExaminations and Assessment Council).

SCHILLING, M., HARGREAVES, L., HARLEN, W. and RUSSEL, T.(1990) Written Tasks : Assessing Science in the Primary Classroom.(London : Paul Chapman Publishing Ltd.).

STRANG, J. (1990) Assessment Matters : No. 2 - Measurement in SchoolScience : A Booklet for Teachers. (London : School Examinations andAssessment Council).

185

SWAIN, J.R.L. (1985) ‘Towards a framework for assessmentin science.’, School Science Review, 67 (238), 145-151.

SWAIN, J.R.L. (1989) ‘The development of a framework forthe assessment of process skills in a Graded Assessmentin a Science Project.’, International Journal of Science Education,11 (3), 251-259.

SWAIN, J.R.L. (1991) ‘The nature of assessment ofscientific explorations in the classroom.’, School ScienceReview, 72 260), 63-77.

TAYLOR, R.M. and SWATTON, P. (1990) Assessment Matters : No. 1 -Graph Work in School Science : A Booklet for Teachers. (London : SchoolExaminations and Assessment Council).

WILSON, G. (1989) ‘Assessing process - an assessmentscheme for a year 1 science course.’, School Science Review, 70(253), 113-121.

WOOLNOUGH, B.E. and TOH, K.A. (1990) ‘Alternativeapproaches to assessment of practical work in science.’,School Science Review, 71 (256), 127-131.

186