Analysis of Students' Critical Thinking Ability Instruments in Thermodynamics Material Phase F SMA/MA Reviewed from Validity, Reliability, Level of Difficulty and Differentiation

Authors

  • Nurjannah Batubara UNIVERSITAS NEGERI PADANG
  • Emiliannur UNIVERSITAS NEGERI PADANG

Keywords:

Critical Thinking Ability, validity, reliability, difficulty, and differentiation

Abstract

This study aims to determine the quality of critical thinking ability test instruments on the material Phase F Thermodynamics of SMA/MA in terms of validity, reliability, level of difficulty, and differentiating power. The research method used is a descriptive technique. The data consists of 31 sheets of student answers in grade XI of the 2025/2026 school year. The data were analyzed using several formulas of validity, reliability, difficulty, and differentiation. The method used to collect data is the test method. The results showed that 20 questions were valid and 1 question was invalid. The level of reliability is good where the value of the reliability coefficient r_11>  is 0.842 so that the question instrument is declared reliable. Meanwhile, the difficulty level of the 21 questions was stated to be moderate, the difficulty index was in the range of 0.30 < kindergarten ≤ 0.70. The difference in the good category is 2 questions, in the enough category there are about 12 questions, which is said to be bad there are about 7 questions.

Downloads

Download data is not yet available.

References

Whitby, G. B. (2007). Introduction: Why new pedagogies? Strands of relevance. ACEL 2007 International Conference Sydney, Australia, 1–11.

Istiyono, E. (2014). Developing Higher Order Thinking Skill Test Of Physics (Physthots) For Senior High School Students. Educational Research and Evaluation, 18(1), 1–12.

Mahardini, T., Khaerunisa, F., Wijayanti, I. W., & Salimi, M. (2019). Research Based Learning (Rbl) To Improve Critical Thinking Skills. Social, Humanities, and Educational Studies (SHES): Conference Series, 1(2), 466. https://doi.org/10.20961/shes.v1i2.26816

Ennis, R. H. (2011). The Nature of Critical Thinking. Informal Logic, 6(2), 1–8.

https://doi.org/10.22329/il.v6i2.2729

Indahwati, S. D., Rachmadiarti, F., & Hariyono, E. (2023). Integration of PJBL, STEAM, and Learning Tool Development in Improving Students' Critical Thinking Skills. IJORER: International Journal of Recent Educational Research, 4(6), 808–818. https://doi.org/10.46245/ijorer.v4i6.434

Sudijono. (2012). Introduction to Educational Evaluation. Jakarta: Pt. Raja Grafindo Persada.

Arikunto, S. (2013). Basics of Educational Evaluation. Jakarta: Bumi Aksara.

Fitriani, R., & Anshori, I. (2019). Analysis of Multiple-Choice Questions for the Final Semester Exam of Fiqih Subjects at MTsN 1 Lamongan Academic Year 2018/2019. Journal of Education and Learning, 8(1), 45–54.

Naga, D. S. (2013). Score Theory on Education Measurement. Jakarta: Gunadarma.

Mardapi, D. (2017). Measurement, Assessment, and Evaluation of Education. Yogyakarta: Parama Publishing.

Azwar, S. (2012). Reliability and Validity. Yogyakarta: Student Library.

Taherdoost, H. (2016). Validity and Reliability of the Research Instrument; How to Test the Validation of a Questionnaire/Survey in a Research. International Journal of Academic Research in Management (IJARM), 5(3), 28–36.

Sullivan, G. M. (2011). A primer on the validity of assessment instruments. Journal of Graduate Medical Education, 3(2), 119–120.

https://doi.org/10.4300/JGME-D-11-00075.1

Bajpai, S., & Bajpai, R. (2014). Goodness of measurement: reliability and validity. International Journal of Medical Science and Public Health, 3(2), 112-116.

Sukiman. (2012). Development of Learning Evaluation System. Yogyakarta: And then there is Madani.

Boopathiraj, C., & Chellamani, K. (2013). Analysis of test items on difficulty level anddiscrimination index in the test for research in education. International journal of social science & interdisciplinary research, 2(2), 189-193.

Kocdar, S., Karadag, N., & Sahin, M. D. (2016). Analysis of the Difficulty and DiscriminationIndices of multiple-choice questions according to cognitive levels in an open and

Nunnally, J. C., & Bernstein, I. H. (1994). Psychometric Theory (3rd ed.). New York: McGraw-Hill.

Sundayana, R. (2020). Educational Research Statistics: Alphabet.

Sugiyono. (2013). Quantitative, Qualitative, and R&D Research Methods (Issue January).

Guilford, J. P. (2005). Fundamental Statistics in Psychology and Education (4th ed.). New York: McGraw-Hill Book Company.

Anas Sudijono. (2011). Introduction to Educational Evaluation. Jakarta: Raja Granfindo Persada.

Kuseiri and Suprananto. (2012). Measurement and Valuation Education.Yogyakarta: Graha Ilmu.

Novianty, Amalia. 2022. Analysis of Critical Thinking Skills of High School Students on Static Fluid Materials

Sumarna Supranata. (2005). Analysis, Validity, Reliability and Interpretation of Results

2004 Curriculum Implementation Test. Bandung: Remaja Rosdakarya.

Zainal Arifin. (2013). Learning Evaluation. Bandung: Remaja Rosdakarya.Supriatna, D. (2018). Analysis of Multiple Choice Questions for Science Subject Class VIII. Journal of Education and Evaluation, 6(2), 45–53

Downloads

Published

2025-12-17

How to Cite

Batubara, N., & Emiliannur. (2025). Analysis of Students’ Critical Thinking Ability Instruments in Thermodynamics Material Phase F SMA/MA Reviewed from Validity, Reliability, Level of Difficulty and Differentiation. Physics Learning and Education, 3(4). Retrieved from https://ple.ppj.unp.ac.id/index.php/ple/article/view/297