The Journal of the American Dental Association
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


J Am Dent Assoc, Vol 135, No 2, 173-184.
© 2004 American Dental Association

This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by CHAMBERS, D. W.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by CHAMBERS, D. W.
Related Collections
Right arrow Practice Management

CLINICAL PRACTICE

Portfolios for determining initial licensure competency



DAVID W. CHAMBERS, Ed.M., M.B.A., Ph.D.


   ABSTRACT
 TOP
 ABSTRACT
 THE ISSUE OF VALIDITY
 PORTFOLIOS AND COMPETENCY
 THE PORTFOLIO APPROACH
 PORTFOLIO EVALUATION AND...
 PORTFOLIO EVALUATION AND THE...
 INITIAL LICENSURE IS POLITICAL...
 CONCLUSIONS
 REFERENCES
 
Background. Because attempts to improve initial licensure examinations have not been grounded in measurement theory, partial and inadequate remedies have led to a cycle of refutations, defenses and political polarization.

Types of Studies Reviewed. The author reviewed the psychometric literature, focusing on high-stakes professional decisions. Editorials in the dental literature and position papers of involved organizations often use words from this literature without incorporating its fundamental concepts.

Results. The reliability of one-shot initial licensure examinations is estimated to be approximately r = .40, which is a value well under the standard for such tests in other professions. Validity has not been investigated rigorously, but the one-shot format and proposals to remove live patients certainly would reduce validity. The use of portfolios—a small number of evaluations in several realistic task domains—is a viable means of achieving psychometric standards for initial licensure decisions.

Clinical Implications. Boards are charged with making valid and reliable licensure decisions, not with conducting examinations. At a minimum, they must define the competencies of beginning practitioners and establish the psychometric criteria for their decisions (neither of which are done currently). Gathering data then can be delegated to whoever is best qualified to meet these standards.

Current practice in initial licensure decisions for dentists involves the use of four pieces of evidence: graduation from a qualifying educational program, passage of parts I and II of the national board examinations, a criminal background check and a one-shot practical examination that includes a combination of live patient treatment, simulations and possibly additional multiple-choice questions. The practical examination is causing the most concern today. There are questions about the ethics of using live patients,15 the representativeness of the tasks performed610 and the lack of reliability and validity.1116 Frustration has been expressed by the ADA17,18; the American Dental Education Association, or ADEA19; various state legislatures and citizens’ groups20; and graduates.21,22 This article describes how boards are ill-advised to defend the one-shot practical examination in its current form or any minor modification of it. The communities voicing discontent with this testing method are equally ill-advised to demand reform of the approach. The wrong issue is being addressed. It is not the test that matters; it is the decision that must be reliable, valid, fair and cost-effective, not the data on which the decision is based.2327

It is not the test that matters; it is the decision that must be reliable, valid, fair and cost-effective.

The following assumptions have been advocated as being appropriate to the licensure examination process:

– Students should be granted an initial license to practice dentistry only after demonstrating competency in treating live patients.
– Students should not be allowed to substitute an advanced general dentistry experience for a patient-based evaluation. The amount of teaching is not an effective proxy for the quality of learning.
– Licensure by legislation undermines educational qualification and board testing equally. California Assembly Bill 1045 and similar legislation granting temporary licenses to graduates of dental schools not accredited by the Joint Commission on Dental Accreditation and who were not examined at any point for competency by standards applied to U.S.-trained dentists create a two-tier system.
Simulations and standardized tests are steps away from valid testing of competency. Simulations are appropriate learning and evaluation tools when students are beginning dental school clinical experiences but not when they are concluding them.
– One-shot tests of professional competence have very low validity (regardless of their reliability) because of the large variance introduced by patients and the magnitude of subject-by-trial variation. Evaluations that fail to sample the full range of competencies required of dentists have unknown validity and thus expose the public to an unnecessary level of risk.
Graduates of accredited dental education programs should not be granted licenses automatically. Licensure authority rests with boards, not schools. Schools should have a say in the process because they know many things that boards do not know about candidates. Boards can delegate information gathering to others, but they must retain the decision-making responsibility.
False-positive licensure decisions (granting licenses to candidates who are incompetent to practice) are more damaging to the public and the profession than are false-negative decisions (denying licenses to qualified candidates). Mechanisms such as reexamination are in place to correct false-negative decisions but not false-positive decisions.
Portfolios use more data, more diverse data and data of a higher quality than what are used now.

All of these assumptions have been expressed in discussions regarding initial licensure. Generally, however, different constituencies advance different subsets because it is assumed that these seven assumptions are internally inconsistent. In this article, I describe how portfolios can be used in the licensure examination process without displacing or affecting any of the stated assumptions. Portfolios use more data, more diverse data and data of a higher quality than what are used now. The portfolio approach has the extra advantages of increasing fairness and being less costly than the current method.


   THE ISSUE OF VALIDITY
 TOP
 ABSTRACT
 THE ISSUE OF VALIDITY
 PORTFOLIOS AND COMPETENCY
 THE PORTFOLIO APPROACH
 PORTFOLIO EVALUATION AND...
 PORTFOLIO EVALUATION AND THE...
 INITIAL LICENSURE IS POLITICAL...
 CONCLUSIONS
 REFERENCES
 
Validity is not a stable and unitary concept, even among psychometricians.2834 Two definitions of the term are solid enough to support this discussion of testing for initial licensure. Predictive validity refers to those situations in which judgments about future behavior (based on observing present behavior) tend to be correct. For example, are the people whom we think will practice consistently with standards of care (based on test data or other evidence) more likely to practice that way than those about whom we have reservations? Predictive validity data regarding initial licensure evaluation have never been published. Guidelines for Valid and Reliable Dental Licensure Clinical Examinations35 states that "job competency" for a practicing dentist is "too complex" to make predictive validity possible. This is an unfortunate statement to put into writing, since some licensed dentists have been disciplined because they failed to meet the job competency for practicing dentists.

Development of initial licensure testing in dentistry tends to rely on a second definition of validity. Content validity involves the sampling relationship between the content of an evaluation process and the domain of performance expected. Content validity is desirable, but it is no guarantee of predictive validity. An evaluation with strong content validity can be spoiled by an inappropriate decision (cut-score) rule or a bad distribution of candidates.36 The driver’s license test is a notorious example.

Initial licensure examinations have been validated by inquiring whether tasks evaluated on the test are representative of tasks critical to the safe and effective practice of dentistry. The guidelines describe a validation process undertaken by the ADA and the American Association of Dental Examiners, or AADE, licensure committee in 1992.35 The examination blueprint (what will be tested) was validated, in part, on opinions about reliability instead of measured reliability, and the examination was characterized as subjective.

Validating the examination, however, is not exactly the right challenge. What we really want to know is whether the tasks critical to the safe and effective practice of dentistry are evaluated on the test. That could be another matter entirely from whether the tasks on the test are part of dental practice. Figure 1Go shows this situation diagrammatically. Oval A represents the domain of skills, understanding and supporting values (the competencies) necessary to begin independent dental practice. Oval B represents the domain of skills and understanding sampled in a typical initial licensure examination. The content validity of the test (the proportion of the test that covers tasks used in practice) is high (the percentage of B that overlaps with A). However, the true content validity (the sampling of practice competencies tested) is not very defensible (percentage of A that overlaps with B).



View larger version (18K):
[in this window]
[in a new window]
 
Figure 1. Schematic representation of the overlap in content between competencies required of a dentist beginning practice (A) and the skills and knowledge tested on initial licensure examinations (B).

 
Many practitioner characteristics that are important for safe and efficacious oral health care are not evaluated on licensure examinations. Prevention and diagnosis were the skills identified in the ADA/AADE practice survey as being performed most frequently. Selecting cases, understanding the science and the alternatives to certain dental procedures, managing the care of patients over time, and making ethical choices are examples of skills that are critical to practice that are not tested on licensure examinations. It often is reported that the most common reasons for disciplining licensed practicing dentists have to do with moral weakness, substance abuse, patient mistreatment, fraud and other factors not tested on board examinations. Kalkwarf9 and Hasegawa4 have argued that one-shot technical procedures are becoming less representative of the standard of care for managing carious lesions.

Validity—preferably predictive validity or content validity as the proportion of the target domain sampled in the test—should be the focus of any discussion about protecting the public. Improving the administration of the test or moving it to before graduation do not address these concerns. Improving examination reliability will not serve as an effective substitute. Figure 2Go shows the relationship between predictive validity and reliability. The often quoted rule that reliability determines validity24,37 is graphed here. The area under the curve depicts theoretically possible combinations of reliability and validity. When reliability is low, predictive validity also is low. When reliability is high, validity might be high or low. In cases in which tests are chosen to be objective or defensible by making them arbitrary or by using simulations, the drive for reliability is most likely to result in combinations of reliability and validity in the lower right-hand portion of the graph.



View larger version (25K):
[in this window]
[in a new window]
 
Figure 2. Relationship between reliability and predictive validity. All combinations of measured reliability and validity below the curve are theoretically possible.

 
One-shot measures of ability are especially susceptible to overstated reliability. In their analysis of the math portfolios of grade school-aged children in Vermont, researchers at the RAND Corporation (Santa Monica, Calif.)38 demonstrated that the student-by-work sample source of variance was the largest of any identified. It was about the same size as student and rater differences combined. Shavelson and colleagues39 reported that this is a common pattern in generalizability studies. Examiner variance is not the only source of error in performance tests, and it almost never is the largest. The greatest error is likely to come from circumstances idiosyncratic to the testing situation, such as patient variability or testing location. Multiple examiners often are used to correct the relatively small amount of errors caused by examiners. Comparable multiple measures are required to address the much larger task and situational sources of error variance. These major sources of error remain invisible in one-shot licensure testing.

In a simulation of state board examinations40 using three live patients with two examiners each for crown preparations, reliability of any single test was estimated to be between r = .10 and r = .15. Increasing interrater consistency from the actual r = .68 to r = .99 would have increased decision consistency by about 1 percent; using two test cases instead of one would increase the consistency by more than 75 percent; three test cases would more than double the consistency.


   PORTFOLIOS AND COMPETENCY
 TOP
 ABSTRACT
 THE ISSUE OF VALIDITY
 PORTFOLIOS AND COMPETENCY
 THE PORTFOLIO APPROACH
 PORTFOLIO EVALUATION AND...
 PORTFOLIO EVALUATION AND THE...
 INITIAL LICENSURE IS POLITICAL...
 CONCLUSIONS
 REFERENCES
 
Oftentimes, the term "portfolio" is used in the sense of a trace or work sample—a notebook with one’s work. Such items often are part of a portfolio, but the term has a larger meaning.4143 For example, consider your own investments. A good portfolio has diversity and balance that is proportional to risk. A portfolio is the best collection of evidence, with multiple replicates from diverse sources, chosen to reduce the risk of decision error. (The technical and operational definition is the evaluation design that maximizes the coefficient of generaliz-ability for a specific decision.) The investing analogy emphasizes multiple sources and often relies on the opinions of knowledgeable advisers, while the decision maker retains ultimate responsibility.

A portfolio is the best collection of evidence, with multiple replicates from diverse sources, chosen to reduce the risk of decision error.

In this section, I explore the connection between evaluation and level of learning. Initial licensure is a decision about competency.44,45 Competency cannot be measured directly; it must be inferred.46 The accuracy of that inference by state boards is the operational definition of validity.35

We should avoid borrowing evaluation methods that work in one context (say, for beginning students) and assuming that these methods will work equally well in other contexts (for example, so-called "ethics tests," which actually measure knowledge of practice acts rather than propensity to practice ethically).749 The tableGo highlights the five stages that professionals go through in their 10- to 15-year growth from novice to expert (earlier versions of the table have appeared in other publications).46,50,51 Learners’ performances at these various stages change, though under ideal circumstances, work products may show small differences. The best way to teach novices is through highly formalized lectures, but that is not the best way to teach experts. Similarly, the best way to evaluate professionals depends on the position of the candidate along the novice-expert continuum.


View this table:
[in this window]
[in a new window]
 
TABLE STAGES IN BECOMING A PROFESSIONAL: CHARACTERISTICS, OPTIMAL LEARNING CONDITIONS AND BEST ASSESSMENT APPROACHES.

 
Novices should be tested. "Test" is a technical term in measurement theory meaning a standardized opportunity to demonstrate one’s ability.24,52 In the first years of dental school, faculty members assume responsibility for what students should do and how they should be evaluated. It makes sense that standardized evaluation should be used because the situations are standardized. Beginners—dental students in the middle of their educations—are experimenting with structuring their work under the guidance of faculty members, and simulations make sense as a way of both teaching and evaluating these students. This is the stage at which constructed examples that exhibit the essential features of realistic situations are presented, and students show some degree of responsibility for managing the context. Examples of simulations include cases and typodont exercises. Students who pass simulation evaluations should be allowed to enter the clinic to begin treating patients while under supervision.

Competency can only be taught in realistic situations and appraised with authentic evaluation. As Messick27 noted, "The evaluation of assessments should demonstrate that the level and sources of task complexity match those of the construct being measured and are attuned to the level of developing expertise of the students assessed." Authentic evaluation refers to performance that is judged by experts and represents performance expected of a candidate in realistic settings—settings that resemble the environments in which candidates are expected to be qualified to work should they past the authentic evaluation.53,54 In dentistry, competency can be demonstrated only with live patients and with the results of managing the care of representative families of patients over an extended period. It requires an authentic portfolio. One-shots and simulations measure at the beginner level, but they do not measure competency.

Dentists in practice are expected to advance to higher levels of performance than competency. These include proficiency and expertise, or mastery. This expectation of continuous professional growth raises some issues regarding the notion of continued competency testing. Periodic reexamination seems inadequate to account for the growth that is expected of dentists in practice.55 Specialists are poorly served when they are required to demonstrate competency with a set of skills they no longer use but not with those skills they profess and intend to use.7

The categories of authentic evaluation are test cases, ratings, and traces or naturally occurring outcomes of practice.46 Initial licensure examinations are examples of test cases. They are samples of realistic performance under semistandardized circumstances in which no assistance is given to the candidate, but the patient’s health is protected. Ratings are professional judgments of characteristics of candidates’ performance observed over time by professionals qualified to make such judgments. Ratings also can be based on candidates’ defenses of case presentations. Traces are patient charts, records of cases completed, patient complaints, audits of infection control and emergency preparedness equipment, and any other evidence that results from patient treatment.


   THE PORTFOLIO APPROACH
 TOP
 ABSTRACT
 THE ISSUE OF VALIDITY
 PORTFOLIOS AND COMPETENCY
 THE PORTFOLIO APPROACH
 PORTFOLIO EVALUATION AND...
 PORTFOLIO EVALUATION AND THE...
 INITIAL LICENSURE IS POLITICAL...
 CONCLUSIONS
 REFERENCES
 
Switching to portfolios is preferable to further attempts to increase the reliability of one-shot initial licensure. Evaluations that have known high validity can be "stepped up" to predetermined standards for reliability by using combinations of examinations. Aiming to increase reliability as the first goal normally sacrifices validity.

A portfolio is a collection of authentic evaluations, including repeated measures from several sources.4143 Portfolio evaluation is a four-step process. First, a responsible entity establishes the competencies expected of those who are evaluated. Next, the responsible entity establishes rules for what will be accepted as evidence. The third step is defining who is qualified to serve as a judge and how the judging will be conducted and reported. Finally, there needs to be consideration of logistics, timetables and due process. Something like a portfolio evaluation system is mandated in American dental schools by the accreditation requirements.

Establishing competencies. All dental schools have written definitions of the competencies expected of graduates. For example, the University of the Pacific School of Dentistry, San Francisco, has 59 competencies. They include such items as "restore single teeth for therapeutic reasons"; "determine differential, provisional, and definitive diagnoses"; "administer and prescribe medications commonly used in dentistry, including local anesthesia, and manage their complications"; "practice consistent with sound business principles and legal requirements and regulations"; "diagnose and treat only within one’s competence"; "assume active responsibility for one’s lifelong learning"; and "participate in organized dentistry." The ADA has developed competency and proficiency standards for the 14 categories of dental professional programs accredited by the Commission on Dental Accreditation.56 The ADEA has approved a core set of competencies for both dentists57 and dental hygienists.58 State boards and testing agencies have yet to do this.

Identifying acceptable evidence. State boards generally have said that sufficient evidence for competency to begin practice includes written, standardized multiple-choice tests; live patient examinations in which the eligibility of the patient and the testing circumstances are carefully prescribed; simulations such as typodont exercises; certification of graduation; and documents showing no criminal history.

Typically, portfolio evaluation goes beyond this limited evidence. Candidates usually are given options, including "any other evidence deemed acceptable" to the examiners. In the strict testing case, the tester retains responsibility for proving whether the candidate is competent. In the portfolio approach, that responsibility is shared with the candidate. Well-known examples of portfolio evaluation include virtually all hiring, promotions in most organizations and selection for honors. Because evaluation for competency must be authentic, unnecessary standardization should be resisted.

Delegating data collection. The next step is to select qualified people to gather and evaluate the evidence and to charge them with developing a fair process for reaching their judgments. Evaluation and decision making normally are separate. State boards have deputized the ADA Joint Commission on National Dental Examinations to test didactic knowledge and law enforcement agencies to perform background checks. In only a handful of states have board members met candidates or seen their dentistry. Boards delegate one-shot testing to examining agencies under very carefully specified conditions.59 They also could delegate multiple-method evaluation to dental schools under the same sort of carefully specified conditions.

Logistics. The final step in portfolio evaluation is to develop logistics, timetables, protocols for managing candidates and others, data and records procedures; and due process standards.

Dental schools have dental records for families of patients who have been treated in comprehensive patient care settings by students over extended periods. The schools can produce patients for examination and provide opportunities for case presentations and oral, patient-based examinations. Test cases can be conducted with varying degrees of standardization and preparation. The dental school could be thought of an as in-office visit with performance requirements.

State boards develop the protocol for managing portfolio evaluation. Once it is known what the competencies are, where the evidence is and who is qualified to judge that evidence, the details of the process can be developed. Most likely, candidates will assemble their portfolios in the final six months of their dental school educations, and visits by board members and their external delegates then will begin and continue until graduation. Some candidates would be judged by boards to be competent to practice on graduation, while others whose competencies must improve or who require one-shot, standardized performance testing or other special evaluations would be appraised shortly before graduation or at a time when they meet the qualifying standards.


   PORTFOLIO EVALUATION AND RELIABILITY
 TOP
 ABSTRACT
 THE ISSUE OF VALIDITY
 PORTFOLIOS AND COMPETENCY
 THE PORTFOLIO APPROACH
 PORTFOLIO EVALUATION AND...
 PORTFOLIO EVALUATION AND THE...
 INITIAL LICENSURE IS POLITICAL...
 CONCLUSIONS
 REFERENCES
 
Figure 3Go shows a schematic view of the competency domains addressed by initial licensure and dental education and what is required to practice dentistry. Naturally, there will be arguments about the details of this representation. However, there probably will be universal agreement that there is substantial overlap, with practice demanding more than education and education covering more than initial licensure testing. The parts of practice not covered in education or initial licensure testing may not be problematic because dentists are excellent on-the-job learners. Competent beginning dentists have become proficient and even expert practitioners in predictably large numbers.60,61



View larger version (19K):
[in this window]
[in a new window]
 
Figure 3. Schematic representation of the overlap in content between competencies required of a dentist beginning practice (A), competencies taught and measured in dental schools (C), and skills and knowledge tested on initial licensure examinations (B).

 
Figure 3Go also illustrates that the combination of what examiners and educators know together about the competency of professionals is greater than what either knows independently. In particular, educators can evaluate diagnosis, patient care management, professionalism and other comprehensive dental competencies, and they have multiple opportunities to observe student performance in realistic patient care settings.

Annually, about 3 to 4 percent of students studying in American dental schools are dismissed, with no opportunity to return.62 The ADA 2000 Survey of Dental Graduates63 places the number of graduates who are licensed and professionally active within one year of graduation at 97 percent. Each year, the public makes complaints against 9 percent of American dentists, and five in 1,000 licensed dentists are disciplined.64 Disciplined dentists represent double false-negative decisions. The schools should not have graduated them, and the boards should not have passed them. In some jurisdictions, practitioners who are disciplined for incompetent practice are remediated by means of having to receive specific hours of education, with dental schools having the final say over whether practice privileges should be restored.

Pass rates on various licensure examinations and the number of newly licensed dentists compared with graduates are recorded by the ADA.65 Over the past several years, the national first-time pass rate on the one-shot examination has been 77 percent. The difference between 77 percent and the 97 percent of graduates who are licensed within a year is a result of candidates taking examinations in more than one jurisdiction or taking the tests several times. Taking tests in multiple jurisdictions also accounts for the difference between the 95 percent pass rate reported by examining agencies66 and the overall 97 percent reported by the ADA.63

It should be intuitive that a testing system that passes virtually everyone after three attempts would have weak reliability. Next, I focus on this idea, with psychometric precision based on classical test theory regarding reliability.24,26,6769 Although I do not present the full details, the discussion is precise enough to demonstrate the insight and the power provided by classical measurement theory tools.

The overall reliability of all one-shot initial licensure examinations given in the United States is approximately .40, meaning that no more than 16 percent of the variance in competence to begin independent dental practice is available in a consistent fashion for those who make licensure decisions. Such a coefficient is low compared to similar high-stakes and professional licensure examinations.23,29,30,70

This estimate is derived from the fact that reliability is the square root of the ratio of true variance to total variance. These variances are known in the case of one-shot initial licensure examinations because the variance of pass-fail tests is the product of the pass rate multiplied by failure rate. Total variance is the true variance plus the error variance, or the variance on a single examination administration. Seventy-seven percent of candidates pass each test. The true pass rate is approximated at 97 percent—the proportion who eventually earn licensure. The square root of (.97 x .03)/(.77 x .23) is .40.

One-shot initial licensure examinations typically contain four or five sections. The Spearman-Brown formula can be used to determine that each section has an average consistency in the range of r = .12 to .15.24 This is remarkably similar to the estimates obtained by Chambers and Loos40 for simulations of a single section of an initial licensure examination in crown preparations.

It is difficult to reconcile the numbers on overall initial and eventual pass rates for initial licensure examinations with the explanation that candidates learn between administrations of these tests. If candidates pass each of three administrations of the tests at the same rate (77 percent), the cumulative pass rate of this series is 97.8 percent. This is essentially the same as the 97 percent overall national pass rate. The ultimate pass rate can be explained entirely in terms of retesting, with no need to assume that learning is occurring. The actual second-time national pass rate is 75 percent.65

Figure 4Go shows the theoretical relationship between reliability and validity. The letter X appears approximately where the initial licensure process might be currently with respect to reliability. What matters is how we propose to change this by any reasonable modification of the process. Simulations, elimination of patients, moving one-shot tests into schools or any restriction of the system for the sake of objectivity will move the X along the path labeled S for simulation.10,71,72 The Objective Standardized Clinical Evaluation is not properly a level of testing. It is a format for presenting items, which may be authentic, but usually are simulations. A better alternative is portfolios—multiple replicates of multiple sources of evidence regarding the range of skills, understanding and values required to start independent practice.



View larger version (31K):
[in this window]
[in a new window]
 
Figure 4. Schematic representation of current initial licensure examination reliability and validity and potential alternatives. X: Approximate current reliability and validity. S: Change to approaches based on simulation (beginner level). P: Changes to multiple replicate, multiple source authentic evaluation (portfolios).

 

   PORTFOLIO EVALUATION AND THE INITIAL LICENSURE DECISION
 TOP
 ABSTRACT
 THE ISSUE OF VALIDITY
 PORTFOLIOS AND COMPETENCY
 THE PORTFOLIO APPROACH
 PORTFOLIO EVALUATION AND...
 PORTFOLIO EVALUATION AND THE...
 INITIAL LICENSURE IS POLITICAL...
 CONCLUSIONS
 REFERENCES
 
Portfolio evaluation is not a single method; it is an approach designed to reduce the risk of poor decisions being made by using multiple measures and multiple balanced sources of information. The four-step process that follows is only one potential way to realize the approach. I offer it here to make the concept more understandable and to demonstrate its feasibility.

Step one. Each licensure jurisdiction (not each testing agency) should start by defining the competencies necessary to begin the independent practice of dentistry. This step normally can be accomplished in a few weekends, each separated by a few months.73 Jurisdictions that generate different competency sets logically would want to use different evaluation systems. The competencies become the evaluation blueprint. Evidence would include candidates’ possessing understanding, skills and supporting values for every aspect of dentistry defined by the boards as being essential to independent dental practice.

A common mistake has been to make licensure decisions with only a general notion of what is required of practitioners. Guidelines for Valid and Reliable Dental Licensure Clinical Examinations35 was based on surveys of some of the most frequently performed procedures and opinions about what would be easiest to test. That work is useful, but it is incomplete to protect the public.

Step two. State boards define what evidence they will need to make their decisions about licensure and what standards this evidence must satisfy to be accepted. Boards should be able to justify their decisions based on evidence about the quality of their decisions. (Several boards are under court-ordered monitoring precisely because they have not been able to provide this assurance.) Boards might say, for example, that candidates are required to present results of written and patient-based examinations, evidence of satisfactory completion of various tasks that dentists perform, and records of ethical conduct that collectively demonstrate competence in all defined areas with a reasonable probable error rate. Note that the responsibility of the candidate is increased, and the board is controlling the quality of its decisions more precisely. (It is a technical matter, but probable error24 is a measurable function of three components: validity [addressed through competency statement blueprints], reliability [addressed through portfolio construction] and measured candidate performance.)

A common mistake is to aim for the single best testing approach possible. The best may not be good enough, as educators and examiners would be quick to tell an incompetent but motivated candidate. An evaluation plan should be designed by putting together the variety of evidence required. When requisite reliability cannot be achieved by satisfactory consistency in the existing evaluation (for example, standardization of conditions, calibration of examiners or clarity of interpretation of standards), it can be achieved by increasing the number of evaluations.74 Test theory has shown that consistency of evaluation can be achieved in two ways: increase internal consistency or increase the number of replications.24,26,3840,68,69,75,76 These approaches are psychometrically interchangeable, meaning that committing to a one-shot testing scheme in advance unnecessarily compromises the quality of licensure decisions.

Here is an example of crafting a portfolio designed to reach a predetermined, acceptable level of confidence. Assume that we are locked into an average reliability for each piece of evidence at r = .15. The portfolio consists of data gathered in dental school under the supervision of boards of dentistry. It includes two endodontic test cases and two crown placements, three comprehensive periodontal management cases, a removable prosthesis and a surgery case, three randomly selected chart reviews and defense of three completed cases with predetermined parameters. This portfolio has a projected reliability of r = .82, even though each part is reliable at the r = .15 level. It explains two-thirds of the variance and is four times as consistent as the average one-shot initial licensure testing system. If evidence of case management, ethical conduct, diagnostic skill and professional involvement were added, the reliability and validity of the licensure decision would be higher. (The psychometric foundation for these calculations is found in classical test theory,24,26,69 the Spearman-Brown formula24 and powerfully in generalizability theory.75,76)

Step three. The concept of delegated authority rests on differentiating examination from licensure or distinguishing between the test and the decision. State boards delegate the gathering of almost all the evidence on which they base licensure decisions, and that system has proven to be workable.59 What boards need to ensure in their delegation processes is that standards exist, that they are monitored and made available to those who need to know about them, and that they are used to manage the relationship between boards and those to whom they delegate. Organizations that perform assessments should be selected based on their performance. There is a tremendous need for boards to make their standards known and to share outcomes data. A chair of the examination committee for the Northeast Regional Testing Boards said that there are no acceptable statistical models to determine reliability of clinical examinations.77 It is possible that such factually inaccurate statements prompted the ADA to consider studies of predictive validity at several House of Delegate meetings.

Higher-quality data now are available at almost no additional cost to the licensure decision process.

There is risk associated with the view that only those who have examined candidates can decide who should be licensed. That approach is not applied uniformly across all evidence boards used. It would be interesting to speculate what sources of evidence would be selected if we started over today and asked those with relevant data to compete based on predetermined standards.

Step four. Cost and fairness are especially important in logistics considerations. The argument seems sound that the cost of one-shot licensure examination (for example, fees, travel, patient concerns) is small relative to the magnitude of the decision being made. That same argument might be extended to justify the collection of the additional data needed to bring the reliability of the decision into the r ≥ .80 range that is considered acceptable by other professional licensing groups.

There are two ways that acceptable reliability can be achieved while actually lowering costs. First, higher-quality data now are available at almost no additional cost to the licensure decision process. These are the data normally collected by highly calibrated faculty members in dental schools. If dental schools demonstrate that they meet the evaluation standards set by boards, just as testing agencies do, they should be able to provide the needed information at a reduced cost. They also can provide information that testing agencies have eschewed.

Because of legal challenges to the one-shot testing system, boards and agencies have placed barriers between themselves and the candidates that restrict access to evidence other than the work product. Subjective determinants of practice such as character, ethics, patient care management and case management are the primary causes of action against licensed dentists, but they are not part of the one-shot format. Subjective data are valid in any decision in which such data will be used to judge performance in practice,78,79 and they have shown good predictive validity in dental education.80 In a series of court decisions, including one by the U.S. Supreme Court, the right of educators to make such judgment decisions has been established. Horowitz v. Curators of the University of Missouri stated that educational evaluation is inherently subjective and that courts will not substitute their decisions for the decisions of educators who are qualified to make such judgments.81

The second way that in-school assembly of portfolio evaluation information reduces cost is that there is a longer period over which the evaluation can be spaced.65 Testing everyone on a weekend after graduation may be superfluous in the cases of some candidates, but it certainly provides too little information in the cases of candidates who are near the cut-off score. Ambiguous results should be explored until they can be resolved. The rule should be to test until a predetermined acceptable level of confidence is established that the candidate either is or is not competent. The level of confidence in protecting the public should be predetermined and not varied; instead, the amount of evidence needed to reach this standard should be varied. If some of the testing takes place in dental schools, the amount of postgraduation testing can be reduced substantially, hence lowering the overall cost of initial licensure examination.

The ultimate criterion for fairness in initial licensure decisions is a high level of predictive validity. It is patently unfair to design examinations to protect testing agencies. It is the public boards are empowered to protect. Fortunately, alternatives to one-shot practical tests, such as the portfolio system, are fairer in both the details of test administration and in the likely validity of licensure decisions.

Because state boards are strapped for resources, they have delegated the data-gathering portion of their responsibility but not their decision-making responsibility regarding initial licensure to regional testing agencies. The use of portfolios does not require expenditure of any further resources by boards because portfolios are a richer and alternative way of gathering data. The use of portfolios by regional testing agencies can be mandated by state boards or mediated and delegated in part to dental schools.


   INITIAL LICENSURE IS POLITICAL AS WELL AS RATIONAL
 TOP
 ABSTRACT
 THE ISSUE OF VALIDITY
 PORTFOLIOS AND COMPETENCY
 THE PORTFOLIO APPROACH
 PORTFOLIO EVALUATION AND...
 PORTFOLIO EVALUATION AND THE...
 INITIAL LICENSURE IS POLITICAL...
 CONCLUSIONS
 REFERENCES
 
It is unlikely that initial licensure decisions will be made differently in the future. Alternatives are available now that outperform the current system in terms of reliability, validity, cost and fairness. Licensure is a political decision. Boards recently have been sunsetted in Texas and California, and legislatures have found ways around one-shot tests in New York, California and North Dakota. Boards in Hawaii and Florida are under court-appointed scrutiny, and the ADA and ADEA have passed policy resolutions calling on boards and testing agencies to discuss alternatives to the present system. It is, however, unlikely that any appreciable change will take place based on what is known about psychometrics or the science of making sound decisions.

It is necessary to inventory what interests might be threatened by a portfolio system that substantially increases the soundness of initial licensure decisions and does so at less cost and with greater fairness than the current system.

Marginal dental students may object to such a system. Decisions that are based on repeated trials of a test with low reliability produce substantial numbers of false-positive decisions when the results of early trials are not considered. Candidates who fail one-shot tests are given multiple opportunities to prove that the initial decision was wrong; those who pass are not re-examined to prove that the decision was correct. The current system appears to protect more adequately the rights of the candidate than the rights of the public.

Some dental educators will find fault with portfolios on the grounds that dental boards will be coming into the schools. Boards and schools, however, are in the same business, and they can be more effective working together than independently. It may be inconvenient for schools to open their records to examiners, and it may be uncomfortable for them to discuss frankly how they determine competency. However, it will be better for the profession and the public if this is done. In the end, examiners can improve the quality of dental education just as educators can provide information about candidates that strengthens the boards’ charge of protecting the public.

Regional testing agencies might be jealous of any new partner that the boards would include. If they are, they should state why. The matter of trusting our professional partners to be free of conflicts of interest is a challenge that goes both ways. It is better to define operationally what must be done to ensure a working relationship than to begin with an untested assumption that others cannot be trusted.71

The greatest objection to portfolio evaluation will come from those who are unwilling to stand behind their professional judgments. Perhaps a few will have read this entire article without believing the statement "It is not the test that matters: it is the decision." Competence to join a profession is determined by comprehensive judgment. No one but the members of state dental boards can make such judgments. They have available a much broader range of quality evidence on which to make such judgments than they have used so far. The public would welcome the additional level of protection afforded them through expanding the portfolio of initial competency evidence.


   CONCLUSIONS
 TOP
 ABSTRACT
 THE ISSUE OF VALIDITY
 PORTFOLIOS AND COMPETENCY
 THE PORTFOLIO APPROACH
 PORTFOLIO EVALUATION AND...
 PORTFOLIO EVALUATION AND THE...
 INITIAL LICENSURE IS POLITICAL...
 CONCLUSIONS
 REFERENCES
 
A portfolio process for evaluating professional competence to begin practice increases the scope and amount of data available to examiners while reducing cost. At no time would boards relinquish their control over defining what competency is, what evidence is acceptable for demonstrating competence, who will judge competence, what standards they must meet and how the process is to function. What will be reduced is the cost of producing all the sources of evidence and the restrictions that costs and tradition have placed on the range of information considered.

The portfolio approach meets the seven criteria for an effective system I discussed at the beginning of this article. Portfolios will reduce the number of false-positive licensure decisions by providing more valid evidence than the one-shot, restricted range test does. Using a portfolio approach is preferable to abandoning live patients, substituting advanced education as a bypass or using simulations. At every stage, boards control the licensure process as they take advantage of all the partners who are able to help in this responsibility.



View larger version (127K):
[in this window]
[in a new window]
 
Dr. Chambers is associate dean for academic affairs and scholarship, School of Dentistry, University of the Pacific, 2155 Webster St., San Francisco, Calif. 94115, e-mail "dchambers{at}pacific.edu". Address reprint requests to Dr. Chambers.

 


   REFERENCES
 TOP
 ABSTRACT
 THE ISSUE OF VALIDITY
 PORTFOLIOS AND COMPETENCY
 THE PORTFOLIO APPROACH
 PORTFOLIO EVALUATION AND...
 PORTFOLIO EVALUATION AND THE...
 INITIAL LICENSURE IS POLITICAL...
 CONCLUSIONS
 REFERENCES
 
  1. Buchanan R. Problems related to the use of human subjects in clinical evaluation/responsibility for follow-up care. J Dent Educ 1991;55:797–801.[Medline]

  2. Chiodo GT, Tolle SW. An ethics perspective on licensure by state board examinations. Gen Dent 1996;44:18–25.[Medline]

  3. Formicola AJ, Shub JL, Murphy FJ. Banning live patients as test subjects on licensing examinations. J Dent Educ 2002;66:605–9.[Abstract]

  4. Hasegawa TK. Ethical issues of performing invasive/irreversible dental treatment for purposes of licensure. J Am Coll Dent 2002;69(2):43–6.[Medline]

  5. Jenson LE. Is it ethical to involve patients in state board examinations? J Am Coll Dent 2002;69(2):39–42.

  6. Berry T. The board examination: a true test or only a rite of passage? Oper Dent 1995;20:85.[Medline]

  7. Dugoni AA. Licensure: entry level examinations—strategies for the future. J Dent Educ 1992;56:251–3.[Medline]

  8. Howard WW. What do state dental board examinations really accomplish? J Acad Gen Dent 1991;39:310.

  9. Kalkwarf K. Commentary on Melnick, Pattalochi, and Formicola papers. J Dent Educ 2002;66:610–1.

  10. Pattalochi RE. Patients on clinical board examinations: an examiner’s perspective. J Dent Educ 2002;66:600–4.[Medline]

  11. Collins W. Conflicting interests of state boards and the public welfare. J Dent Educ 1985;49:743–5.[Medline]

  12. Damiano PC, Shugars DA, Freed JR. Clinical board examinations: variation found in pass rates. JADA 1992;123:68–73.

  13. Gaines JH. Licensure. J Dent Educ 1993;57:221–2.[Medline]

  14. Hangorsky U. Clinical competency levels of fourth-year dental students as determined by board examiners and faculty members. JADA 1981;102:35–7.

  15. Meskin L. Time for a dental board checkup. JADA 1994;125: 1418–20.

  16. Meskin L. Dental licensure revisited. JADA 1996;127:292–4.

  17. American Dental Association. Supplement to annual reports and resolutions. Vol. 2. Chicago: American Dental Association; 2000:5090–3.

  18. American Dental Association, Office of Student Affairs. Dental boards and licensure information for the new graduate. Chicago: American Dental Association; 2001.

  19. American Dental Education Association. ADEA policy statement: licensure and certification goals. J Dent Educ 2001;65:656.

  20. Swankin DA. Assuring the continued competence of health care professionals in the public interest. Citizen Advocacy Center. J Am Coll Dent 1997;64(4):35–8.[Medline]

  21. Kirkham D. A fresh look at how dental schools prepare dentists for today’s practice. J Am Coll Dent 2001;68(3):16–9.

  22. Meeske J, Fortman J, Feil P. Clinical board examinations: feedback from graduates (abstract 74). J Dent Educ 1995;59:351.

  23. American Educational Research Association, American Psychological Association, National Council on Measurement in Education. Standards for educational and psychological testing. Washington: American Psychological Association; 1985.

  24. Cronbach LJ. Essentials of psychological testing. 5th ed. New York: Harper & Row; 1990.

  25. Cronbach LJ. Construct validation after thirty years. In: Humphreys LG, Linn RL, ed. Intelligence: measurement, theory, and public policy. Urbana, Ill.: University of Illinois Press; 1989.

  26. Crocker L, Algina J. Introduction to classical and modern test theory. New York: Holt, Rinehart, and Winston; 1986.

  27. Messick S. The interplay of evidence and consequences in the validation of performance assessments. Educ Res 1994;23(2):13–23.

  28. Cronbach LJ. Five perspectives on validity arguments. In: Wainer H, ed. Test validity. Hillsdale, N.J.: Erlbaum; 1988:58–83.

  29. Kane M. The validity of licensure examinations. Am Psychol 1982;37:911–8.

  30. LaDuca A. Validation of professional licensure examinations: professions theory, test design, and construct validity. Eval Health Prof 1994;17:178–97.[Abstract/Free Full Text]

  31. Linn RL, Baker EL, Dunbar SB. Complex, performance-based assessment: expectations and validation criteria. Educ Res 1991;20(8):5–21.

  32. Messick S. Validity. In: Linn RL, ed. Educational measurement. 3rd ed. Washington: American Council on Education and National Council on Measurement in Education; 1989:372–411.

  33. Moss PA. Shifting conceptions of validity in educational measurement: implications for performance assessment. Rev Educ Res 1992;62:229–58.

  34. Smith IL, Hambelton RK. Content validity studies of licensing examinations. Educ Meas Issues Prac 1990;9(4):7–10.

  35. American Dental Association, American Association of Dental Examiners Licensure Committee. Guidelines for valid and reliable dental licensure clinical examinations. Chicago: American Dental Association and American Association of Dental Examiners; 1992:19, 27.

  36. Kane M. Validating the performance standards associated with passing scores. J Educ Res 1994;64:425–61.

  37. Moss PA. Can there be validity without reliability? Educ Res 1994;23:5–12.

  38. Koretz D, Stechner B, Klein S, McCaffrey D. The Vermont portfolio assessment program. Educ Meas Issues Prac 1994;13:5–18.

  39. Shavelson RJ, Baxter GP, Gao X. Sampling variability of performance assessments. J Educ Meas 1993;30:215–32.

  40. Chambers DW, Loos L. Analyzing the sources of unreliability in fixed prosthodontics mock board examinations. J Dent Educ 1997;61:346–53.[Abstract]

  41. Arter JA, Spandel V. Using portfolios of student work in instruction and assessment. Educ Meas Issues Prac 1992:11(1):36–44.

  42. Jensen GM, Saylor C. Portfolios and professional development in the health professions. Eval Health Prof 1994;17:344–57.[Abstract/Free Full Text]

  43. Moss PA, Beck JS, Matson B, Muchmore J, Steele D, Taylor C. Portfolios, accountability, and an interpretative approach to validity. Educ Meas Issues Prac 1992;11:12–21.

  44. Chambers, DW. Toward a competency-based curriculum. J Dent Educ 1993;57:790–3.[Medline]

  45. Chambers DW. Competencies: a new view of becoming a dentist. J Dent Educ 1994;58:342–5.[Medline]

  46. Chambers DW, Glassman P. A primer on competency-based evaluation. J Dent Educ 1997;61:651–66.[Medline]

  47. Baxter GP, Glaser R. Investigating the cognitive complexity of science assessments. Educ Meas Issues Prac 1998;17:37–45.

  48. Fitzpatrick R, Morrison EJ. Performance and product evaluation. In: Thorndike RL, ed. Educational measurement. Washington: American Council on Education; 1971:237, 270.

  49. Stiggins RJ. The design and development of performance assessments. Educ Meas Issues Prac 1988;6:33–42.

  50. Chambers DW, Gilmore CJ, Maillet JO, Mitchell BE. Another look at competency-based education in dietetics. J Am Diet Assoc 1996;96:614–7.[Medline]

  51. Chambers DW. Competency-based dental education in context. Eur J Dent Educ 1998;2(1):8–13.[Medline]

  52. Ebel RL. Measuring educational achievement. Englewood Cliffs, N.J.: Prentice-Hall; 1965.

  53. Wiggins G. A true test: toward more authentic and equitable assessment. Phi Delta Kappa 1989;79:703–13.

  54. Wiggins G. Standards should mean qualities, not quantities. Educ Week Jan. 24, 1990:25, 36.

  55. Chambers DW. Continuing competency does not equal current qualification. The Daily Grind July/August 1997:7, 15.

  56. Commission on Dental Education. Accreditation standards for dental education programs. Chicago: American Dental Association; 1998.

  57. American Dental Education Association. Competencies for the new dentist. J Dent Educ 1997;61(7):556–8.

  58. American Dental Education Association. Competencies for entry into the profession of dental hygiene. J Dent Educ 2000;64(7):512–6.

  59. Cole JR 2nd. Legal status of dentistry and licensure. J Am Coll Dent 2002;69(2):6–12.

  60. Chambers DW, Eng WR Jr. Practice profile: the first twelve years. J Calif Dent Assoc 1994;22:25–32.

  61. Chambers DW, Budenz AW, Fredekind RE, Nadershahi NA. Debt and practice profiles of beginning dental practitioners. J Calif Dent Assoc 2002;30:909–14.

  62. Division of Educational Policy and Research. 1999/2000 Survey of dental educators. Chicago: American Dental Association; 2002.

  63. American Dental Association. The 2000 survey of dental graduates. Chicago: American Dental Association; 2002.

  64. American Association of Dental Examiners. Composite. Chicago: American Association of Dental Examiners; 1997.

  65. American Dental Association. 2000 Survey of clinical testing agencies. Chicago: American Dental Association; 2001.

  66. Low D. The changing face of the clinical examination. ADA News Nov. 18, 2002:2.

  67. Chambers DW. The roles of evidence and the baseline in dental decision making. J Amer Coll Dent 1999;66(2):60–8.

  68. Feldt LS, Brennan RL. Reliability. In: Linn RL, ed. Educational measurement. Washington: American Council on Education and the National Council on Measurement in Education; 1989:318–71.

  69. Stanley JC. Reliability. In: Thorndike R, ed. Educational measurement. Washington: American Council on Education; 1971:356–442.

  70. McGaghie W. Professional competence evaluation. Educ Res 1991;20:3–9.

  71. Cole JR 2nd, Maitland RI. A response from the American Association of Dental Examiners. J Am Coll Dent 2002;69(2):47–9.

  72. Yaple N, Metzler J, Wallace W. Results of the Ohio non-patient dental board examinations for 1990 and 1991. J Dent Educ 1992;56:248–50.[Medline]

  73. Chambers DW, Gerrow JD. Manual for developing and formatting competency statements. J Dent Educ 1994;58:361–6.[Medline]

  74. Reckase MD. Portfolio assessment: a theoretical estimate of score reliability. Educ Meas Issues Prac 1995;14:12–14, 31.

  75. Brennan RL. Elements of generalizability theory. Iowa City, Iowa: American College Testing Program; 1983.

  76. Cronbach LJ. The dependability of behavioral measurements: theory of generalizability for scores and profiles. New York: Wiley; 1972.

  77. Cartwright CB. NERB’s methods in developing and administering examinations and evaluating results. Paper presented at: North East Regional Board symposium, Ohio Dental Association; 1994; Columbus, Ohio.

  78. Chambers DW. Objectivity is not enough. JADA 1975;91:1157–61.

  79. Phillips DC. Subjectivity and objectivity: an objective inquiry. In: Eisner EW, Peshkin A, eds. Qualitative inquiry in education: The continuing debate. New York: Teachers College Press; 1990.

  80. Chambers DW, Leknius C, Woodson R. Predictive validity of instructor judgment in preclinical technique courses. J Dent Educ 1997;61:736–40.[Medline]

  81. Horowitz v. Board of Curators of the University of Missouri, 74CV47-W-3 (Sup. Ct. 1975).




This article has been cited by other articles:


Home page
J Dent EducHome page
L. M. Neumann and R. L. MacNeil
Revisiting the National Board Dental Examination
J Dent Educ., October 1, 2007; 71(10): 1281 - 1292.
[Abstract] [Full Text] [PDF]


Home page
J Dent EducHome page
C. M. Stewart, R. E. Bates Jr., and G. E. Smith
Relationship Between Performance in Dental School and Performance on a Dental Licensure Examination: An Eight-Year Study
J Dent Educ., August 1, 2005; 69(8): 864 - 869.
[Abstract] [Full Text] [PDF]


Home page
J Dent EducHome page
R. J. Beltran-Neira and E. D. Beltran-Aguilar
Taxonomy for Competency-Based Dental Curricula
J Dent Educ., September 1, 2004; 68(9): 978 - 984.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow