1 Knowing what students know Judging the worth of what students knows 2 Test : เครื่องมือวัดการเรียนรูของผูเรียน Assessment : กระบวนการหาขอมูลจากหลากหลายวิธี /แหลง มักเปนตัวเลข เพื่อสะทอนการเรียนรูของผูเรียน มาก-นอย “การวัดผล” Evaluation : กระบวนการวิเคราะห แปลผล ตัดสิน ขอมูลจากการวัดผล ผาน-ตก “การประเมินผล” 3 Nominal Ordinal Interval Ratio Attributes are only named Attributes can be ordered Distance is meaningful Absolute zero 4
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
How do you feel today ? 1 - Very unhappy2 – Unhappy3 – OK4 – Happy5 – Very happy
5
What is the temperature difference between these 2 glasses of water ?
30oC 40oC
0oC still have temperature 6
What is the difference in height between A and B ?
A = 6 feet tall B = 6 feet tall
The difference is zero (Absolute zero)
7
Ratio scaleInterval scaleOrdinal scaleNominal scale
8
Indirect measurementIncomplete : cannot measure every objectives The test scores : interval scale (no absolute zero) Relative measurement: 70 marks mean nothing, only….The test(tool) always has error
9 10
11
Why should students be assessed ?Safeguarding the public / Certification
Monitoring the programmeFeedback to students
12
� Final grade of record
� Feedback to improve
13
Formative evaluation is typically conducted during the development or improvement of a program or product (or person, and so on) and it is conducted, often more than once
The purpose of formative evaluation is to validate or ensure that the goals of the instruction are being achieved and to improve the instruction
14
"When the cook tastes the soup, that’s formative; when the guests taste the soup, that’s summative."
Formative evaluation is conducted to provide program staff evaluative information useful in improving the program.
15
Brown Frederick (1976)Formative evaluation : Test, report, observation, asking-answering questions during classroom teaching
Exam content should match course objectives.Important topics should be weighted more heavily than less important topics.The testing time devoted to each topic should reflect the relative importance of the topic.The sample of items should be representative of the instructional goal.
18
Under-/ Post-graduate studentsWhich year ?
Who teachesWho does not teach (colleague)Certification body
19
Beginning in endCourse
20
Evaluation tools / Test format should be
Students have to something to reflect their learningwriting (TEST / paper-pencil exam)acting, doing, speaking (NON-TEST)
21
Students are asked to choose the correct response to the question as constructed/provided by the teachers
Selected response question(item) SRQ
Constructed response question(item) CRQStudents construct their own answer to the question
� higher level� recall…comprehension� content coverage is high � low� no practice in writing skill � improve writing skill� easy to score � hard to score &
time consuming� vague feedback � clear feedback� easier condition for cheating � preventing cheating
27
Content validity = adequate sampling of contentCriterion-related validity
“Show how it is possible to determine the height of a tall buildingwith the aid of a barometer”
"The Barometer Story" by Professor Alexander Calandra(1911–2006)
33
Explain how to determine the height of the tall building by using barometer in terms of
“ How does the barometer work? ”
34
rtt > 0.7
-1……………0……………+1
35
Difficulty index (p) = 0.2-0.8 / 0.5-0.6
36
Discrimination power ( r ) > 0.2 / > 0.35
Knowledge, Skills, attitudeRecall, Data interpretation skill, Problem solving skill
37
Best practice guideline in evaluation :Stage 1 : Clarify purpose of the assessmentStage 2 : Define what is to be testedStage 3 : Select appropriate test methodsStage 4 : Address practical& technical issues of
administration and scoringStage 5 : Set standard for performance
Cambridge Conference on Medical Education 199138
39 40
Multiple choice questions were introduced into medical examinations in the 1950s and have been shown to be more
reliable in testing knowledge than the traditional essay questions.
It is generally, agreed that MCQs should not be used as a sole assessment method in summative examinations,
but alongside other test forms.
41
Multiple Choice QuestionsPros Cons
� reliable test scores� high content coverage� reduced guessing, compared to T/F question� testing various cognitive ability levels
� take time & effort to write well
� favor the simple recall of facts� are highly dependent on students’ reading and instructors’ writing skills� offer cues so that correct response may be guessed through elimination
42
Schuwirth et al, found that context-rich questions lead to thinking processes which represent problem solving ability better than those elicited by context-free questions.
Higher Order Thinking Skill
43
1 = Correct answer(Key)4 = Distractors
A………………………B…………………… C………………………D……………………E……………………
A………………………B…………………… C………………………D……………………E……………………
44
Lead-in statementLead-in statement
1 = Correct answer(Key)4 = Distractors
A………………………B…………………… C………………………D……………………E……………………
A………………………B…………………… C………………………D……………………E……………………
45 46
MCQ เดาไดเสมอ
สรางไมดี ยิ่งเดาไดงาย
47 48
Mitochondria evolved from free-living bacteria that could carry our oxidative phosphorylation. For this reason, they have circular genomes that reproduce independently of the nuclear genome.
A. ContentB. OrganizationC. Size
What characteristic is relatively constant in mitochondrial genomes across species?
49
GnRH agonists have been developed that increases binding affinity to GnRH receptors and resist enzymatic degradation within the hypothalamus and pituitary gland. They have been used for the following clinical conditions. Except
A. endometriosisB. precocious pubertyC. assisted reproductive technologies, such as IVFD. menopauseE. treatment of fibroids
50
Which of the following conditions manifests as chronic airspacedisease on this chest radiograph?
A 30-year-old man presented with a 4-month history of dyspnea, low grade fever, cough, and fatigue.Given the following chest radiograph, what is the most likely diagnosis?
51
What is the hematopoietic substance synthesized by kidney?
A. RenninB. AngiotensinC. ErythropoietinD. AldosteroneE. Cortisol
52
A 55-year-old man with a history of chronic kidney diseasefor 5 years, visits the hospital because of fatigue. Physical examination : Pale conjunctivae, no jaundice, others within normal limit. Complete blood count (CBC) examination : hemoglobin: 8.7 gm/dL, hematocrit: 26%, MCV: 92 fL (80-100), MCH: 33 pg (27-31), MCHC: 33 gm/dL.
The deficit of which of the following substances is the cause of the abnormal findings from CBC examination in this patient?
The deficit of which of the following substances is the cause of anemia in this patient?
Of which of the following substances is abnormally synthesizedaccording to this CBC finding?
53
Stem should be in simple, understood languageDon’t overlap response alternativesMake all distractors plausible/homogeneousPresent alternatives in logical / numerical orderMake each item independent of others on testAvoid “all of the above” ; “none of the above”Avoid negative itemMore items increase reliability, to a point.
54
Five distracters MCQ are more reliable but harder to write.Don’t repeat wording from the stem in the correct option.Trick questions have no place in education.Place correct answer at randomWay to judge a good stemStudent who know the content should be able to answer
before reading the alternatives.55 56
57
Item analysis is a process of examining class-wide performance
on individual test items.
58
• Qualitative analysis
• Quantitative analysis
Consider whether the items are content related and well written
Look at statistical properties
Aim = to increase the validity and reliability59
This task should be done by the teachers to evaluate the test to be used in term of :