Docimology is a specialized field of pedagogy and psychology that focuses on the systematic study, analysis, and improvement of evaluation and testing processes in education. As a scientific discipline, it seeks to ensure that assessment methods are not only accurate and fair but also appropriate for measuring students' performance, knowledge, and skills. [1] [2]
The term "docimology" derives from the Greek words dokimos ("tested, proven") and logos ("study"), [3] signifying "the study of testing." Henri Piéron (1881–1964), a distinguished French psychologist and educator, is widely regarded as the founder of docimology. [4] [5] He was one of the first to systematically analyze educational evaluation and its psychological and social impacts on students and teachers. Piéron highlighted the critical role of objectivity and reliability in assessments, laying the groundwork for subsequent research in this area. [6]
Docimology is intrinsically connected to the concept of assessment for learning, emphasizing the use of evaluation not merely as a tool for measuring knowledge but as a means to enhance and support student learning. By focusing on formative assessments, it transforms evaluation into an instrument for fostering personal growth and educational improvement. [7]
In contemporary practice, docimology has evolved significantly, shaped by technological innovations and advancements in educational science. Beyond traditional education, it now plays a vital role in professional certification, employee recruitment, and psychological testing, promoting fairness and effectiveness across diverse domains.
Docimology is grounded in several core principles:
The origins of docimology can be traced back to the early 20th century, with the rise of standardized testing in education. Notable contributions include Alfred Binet’s work on intelligence testing and the subsequent development of psychometric theories by scholars such as Charles Spearman and L.L. Thurstone [8] . Over time, docimology evolved to critique and improve evaluation practices, addressing biases and systemic inequities inherent in many testing systems.
Key milestones include:
The integration of digital technologies has revolutionized docimology. Online testing platforms and Artificial intelligence (AI) tools enable more precise and transparent assessments by reducing human error and bias. For instance, AI-driven analytics can identify patterns in student responses, offering insights that help educators tailor their teaching strategies. These innovations have expanded the scope of docimology, making it a cornerstone of contemporary evaluation practices across educational and professional settings.
Modern developments include:
Despite these advances, challenges remain, particularly in ensuring these technologies are free from algorithmic biases and accessible to all individuals, regardless of their socioeconomic background. Continuous research and development are critical to addressing these concerns and maximizing the potential of digital innovations in docimology. [9]
Docimology encompasses several critical dimensions of assessment:
In educational settings, docimology informs the design and implementation of assessments to ensure they are fair, reliable, and valid. Key applications include:
In professional contexts, docimology is applied to develop fair and effective evaluation methods for hiring and certification processes:
The integration of AI and machine learning has expanded docimology's applications, particularly in automating and enhancing assessment processes:
Standardized tests have been criticized for reflecting the norms and values of dominant cultures, which can disadvantage individuals from minority backgrounds. For instance, the National Education Association highlights that such tests often fail students from communities of color due to inherent biases. [14]
Access to resources like test preparation services is often unequal, leading to systemic barriers for students from lower socioeconomic backgrounds. This disparity can result in lower test scores, which may not accurately reflect a student's true abilities or potential. [15]
The increasing use of AI in assessments has introduced concerns about algorithmic bias. AI systems can inadvertently perpetuate existing biases present in their training data, leading to discriminatory outcomes. For example, AI algorithms in healthcare have shown biases that affect patient care across different racial groups. To mitigate these issues, it's essential to develop culturally responsive assessments and incorporate equity-focused metrics into evaluation processes. Ongoing research and policy interventions aim to create more inclusive and fair assessment systems. [16]
The following outline is provided as an overview of and topical guide to education:
Psychometrics is a field of study within psychology concerned with the theory and technique of measurement. Psychometrics generally covers specialized fields within psychology and education devoted to testing, measurement, assessment, and related activities. Psychometrics is concerned with the objective measurement of latent constructs that cannot be directly observed. Examples of latent constructs include intelligence, introversion, mental disorders, and educational achievement. The levels of individuals on nonobservable latent variables are inferred through mathematical modeling based on what is observed from individuals' responses to items on tests and scales.
A standardized test is a test that is administered and scored in a consistent, or "standard", manner. Standardized tests are designed in such a way that the questions and interpretations are consistent and are administered and scored in a predetermined, standard manner.
The Stanford–Binet Intelligence Scales is an individually administered intelligence test that was revised from the original Binet–Simon Scale by Alfred Binet and Théodore Simon. It is in its fifth edition (SB5), which was released in 2003.
Educational software is a term used for any computer software that is made for an educational purpose. It encompasses different ranges from language learning software to classroom management software to reference software. The purpose of all this software is to make some part of education more effective and efficient.
Educational assessment or educational evaluation is the systematic process of documenting and using empirical data on the knowledge, skill, attitudes, aptitude and beliefs to refine programs and improve student learning. Assessment data can be obtained by examining student work directly to assess the achievement of learning outcomes or it is based on data from which one can make inferences about learning. Assessment is often used interchangeably with test but is not limited to tests. Assessment can focus on the individual learner, the learning community, a course, an academic program, the institution, or the educational system as a whole. The word "assessment" came into use in an educational context after the Second World War.
Electronic assessment, also known as digital assessment, e-assessment, online assessment or computer-based assessment, is the use of information technology in assessment such as educational assessment, health assessment, psychiatric assessment, and psychological assessment. This covers a wide range of activities ranging from the use of a word processor for assignments to on-screen testing. Specific types of e-assessment include multiple choice, online/electronic submission, computerized adaptive testing such as the Frankfurt Adaptive Concentration Test, and computerized classification testing.
Cognitive tests are assessments of the cognitive capabilities of humans and other animals. Tests administered to humans include various forms of IQ tests; those administered to animals include the mirror test and the T maze test. Such testing is used in psychology and psychometrics, as well as other fields studying human and animal intelligence.
Imaging informatics, also known as radiology informatics or medical imaging informatics, is a subspecialty of biomedical informatics that aims to improve the efficiency, accuracy, usability and reliability of medical imaging services within the healthcare enterprise. It is devoted to the study of how information about and contained within medical images is retrieved, analyzed, enhanced, and exchanged throughout the medical enterprise.
In the United States education system, School Psychological Examiners assess the needs of students in schools for special education services or other interventions. The post requires a relevant postgraduate qualification and specialist training. This role is distinct within school psychology from that of the psychiatrist, clinical psychologist and psychometrist.
Adaptive learning, also known as adaptive teaching, is an educational method which uses computer algorithms as well as artificial intelligence to orchestrate the interaction with the learner and deliver customized resources and learning activities to address the unique needs of each learner. In professional learning contexts, individuals may "test out" of some training to ensure they engage with novel instruction. Computers adapt the presentation of educational material according to students' learning needs, as indicated by their responses to questions, tasks and experiences. The technology encompasses aspects derived from various fields of study including computer science, AI, psychometrics, education, psychology, and brain science.
Artificial intelligence in healthcare is the application of artificial intelligence (AI) to analyze and understand complex medical and healthcare data. In some cases, it can exceed or augment human capabilities by providing better or faster ways to diagnose, treat, or prevent disease.
CodeSignal is a technology company that provides a technical assessment and learning platform for software developers. Founded in 2015 and headquartered in San Francisco, California, CodeSignal offers coding tests, assessments, and learning platforms designed to measure and improve coding skills.
Algorithmic bias describes systematic and repeatable errors in a computer system that create "unfair" outcomes, such as "privileging" one category over another in ways different from the intended function of the algorithm.
Joy Adowaa Buolamwini is a Canadian-American computer scientist and digital activist formerly based at the MIT Media Lab. She founded the Algorithmic Justice League (AJL), an organization that works to challenge bias in decision-making software, using art, advocacy, and research to highlight the social implications and harms of artificial intelligence (AI).
Computational psychometrics is an interdisciplinary field fusing theory-based psychometrics, learning and cognitive sciences, and data-driven AI-based computational models as applied to large-scale/high-dimensional learning, assessment, biometric, or psychological data. Computational psychometrics is frequently concerned with providing actionable and meaningful feedback to individuals based on measurement and analysis of individual differences as they pertain to specific areas of enquiry.
Automated decision-making (ADM) involves the use of data, machines and algorithms to make decisions in a range of contexts, including public administration, business, health, education, law, employment, transport, media and entertainment, with varying degrees of human oversight or intervention. ADM involves large-scale data from a range of sources, such as databases, text, social media, sensors, images or speech, that is processed using various technologies including computer software, algorithms, machine learning, natural language processing, artificial intelligence, augmented intelligence and robotics. The increasing use of automated decision-making systems (ADMS) across a range of contexts presents many benefits and challenges to human society requiring consideration of the technical, legal, ethical, societal, educational, economic and health consequences.
AI literacy or artificial intelligence literacy, is the ability to understand, use, monitor, and critically reflect on AI applications. The term usually refers to teaching skills and knowledge to the general public, particularly those who are not adept in AI.
Matthias von Davier is a psychometrician, academic, inventor, and author. He is the executive director of the TIMSS & PIRLS International Study Center at the Lynch School of Education and Human Development and the J. Donald Monan, S.J., University Professor in Education at Boston College.
Lawrence M. Rudner is a research statistician and consultant whose work spans domains, including, statistical analysis, computer programming, web development, and oyster farming. He is the owner and president of Oyster Girl Oysters, and is an instructor at the Chesapeake Forum and the Chesapeake Bay Maritime Museum. He is the founder and former editor of the Practical Assessment, Research, and Evaluation journal.
This article needs additional or more specific categories .(November 2024) |