The Role of Assessment and Evaluation in the Teaching-Learning Environment
Evaluation is a process, not a single event. As such it involves several steps leading to important decisions impacting on students and classroom processes. A model showing this relationship is shown below. As it shows, evaluation is a three-step process. It involves the systematic collection of relevant data, the professional interpretation of what that information means and the act of informed decision-making on the basis of that interpretation. A discussion of each of these stages follows.
1. COLLECTION OF INFORMATION
The systematic collection of relevant data is an important first step in the process of student evaluation. It requires the appropriate planning and design of procedures and instruments for collection of information on student outcomes, either at the formative or summative stage of the process. A discussion of implications for data collection at different levels of the system is next.
» At the Classroom Level
In the case of the classroom teacher, this stage of the process utilizes a variety of measures such as observational techniques, projects, presentations, quizzes and teacher-made tests. Each of these should correspond to what is being taught in the classroom, be “weighted” in some way to reflect importance, and measure the goals of the intended curriculum as prescribed at each respective provincial level in Canada.
» Program Evaluations
For program level evaluation, this stage usually involves the development and administration of a variety of instruments such as achievement tests, surveys of student attitudes and perceptions, performance assessment procedures, and surveys or observations of classroom practices. Each of these instruments is linked to the prescribed curricula of respective jurisdictions and provides information on outcomes and practices relevant to achievement and instruction. Often achievement measures focus on student outcomes at the end of a range of grade levels in each subject, rather than on those limited to a single course such as Grade 6 mathematics.
» Standardized Tests
Information collected on standardized tests reflects the knowledge and skills considered important from across a number of jurisdictions. These often relate to important life skills including applications of consumer needs, knowledge considered important for an educated citizen, and problem solving skills. This information doesnêt cover the entire breadth of a curriculum but focuses on areas considered important, consisting of subsets of common skills and areas of knowledge.
» Credentialing Examinations
Provincial credentialing tests such as subject-specific Grade 11 or 12 examinations collect information corresponding to important curriculum outcomes prescribed at the course level. They are designed to reflect outcomes from the curriculum considered important to examine and are usually articulated to schools through a detailed description of the content and cognitive levels to be examined.
2. INTERPRETATION OF RESULTS
The interpretation of results is a key component in the process of evaluation. It is essential, however, that it take place within an appropriate context. To better understand what results tell us and what they do not, it is important to keep in mind the nature of the audience examined, the design and purpose of the test, and the limitations of the data. A discussion of interpretation at different levels of the system follows.
» Teacher Measures
A major advantage that teachers have in interpreting results from their own measurements is first-hand knowledge of students in their classrooms. Given this contextual understanding they are in a good position to assess how well, for example, Billy or Daljit understands the concepts being examined. However, there are some limitations in using only their own measures for this purpose. For example, one teacherês perception of “excellence” may be quite different than anotherês. This limitation is best addressed when results from teacher-made tests are combined from time to time with external measures linked to standards and norms.
» Program Assessments
The interpretation of results from program level assessments usually occurs at each of the following reporting levels: school, district, and province. Information provided for this purpose often includes norms and standards associated with each level. The process usually involves a subjective procedure where a committee or panel reaches consensus on how well the jurisdiction under review has met expected and desired levels of achievement, while taking into account the context for learning. A school in a high socioeconomic area, for example, would likely expect higher scores than one in a disadvantaged area. Provinces usually provide schools and districts with procedures and guidelines to help in undertaking this process.
» Standardized Tests
The interpretation of results from standardized tests should include references to both normative comparisons and expectations relative to the nature of the population under review. Through a combined analysis of these, a better understanding of results can be gained. At each level of reporting (the student, classroom, school and district), profiles of strengths and weaknesses can be developed, which can lead to a plan for improvement.
» Credentialing Examinations
Data from credentialing examinations focus at the student level and are then aggregated to school, district and provincial levels. In addition to student scores, summaries usually include results by topic and cognitive level. When results are interpreted at this level it is common, not only to base them on expectations relative to the nature of the student population but also relative to how other schools and districts performed. It is important to utilize both references in the process. For example, if only expectations for the population under review were used, then it would be possible to lose sight of objective comparisons. On the other hand, if only normative comparisons were used, important factors impacting on performance, such as socio-economic status, would not be taken into account.
3. EFFECTIVE DECISION-MAKING
Once results have been interpreted within an appropriate context, areas of strength and weakness should be identified and a growth plan developed. It is incumbent on those making the interpretation to develop a plan for the maintenance of strengths and the improvement of areas of weakness based on findings.
At the student level, decisions may involve a plan for remediation at the formative stage of evaluation or the reporting of a standing at the summative stage. Identification of strengths and weaknesses based on results aggregated to the classroom, school, district or provincial level areas can provide direction for a number of follow-up activities. Among these may be a change in methodology, the development of resource materials, the allocation of resources, and revision of curriculum content and design.
4. SUMMARIZING THE COMPONENTS OF A BALANCED MODEL
The balanced model proposed includes a variety of measures intended for a variety of purposes. Its components include teacher-made measures, program assessments, credentialing examinations, and standardized tests. A summary of these components is shown in Table 1-1 with a list of the advantages and disadvantages of each.
The above presented the concept of evaluation as a process involving the systematic collection of information, the appropriate interpretation of results, and the act of making informed decisions on the basis of findings. It is contended that there is a role for the components of a balanced evaluation model to play in each step of th