, Jamie Bayliss2
1Department of Physical Therapy, University of Dayton, Dayton, OH, USA
2Department of Physical Therapy, Mount Saint Joseph University, Cincinnati, OH, USA
© 2025 Korea Health Personnel Licensing Examination Institute
This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Authors’ contributions
Conceptualization: SG, JB. Data curation: SG, JB. Methodology/formal analysis/validation: SG. Project administration: SG. Funding acquisition: not applicable. Writing–original draft: SG. Writing–review & editing: SG, JB.
Conflict of interest
No potential conflict of interest relevant to this article was reported.
Funding
None.
Data availability
Data files are available from https://doi.org/10.7910/DVN/DWRN8Q
Dataset 1. Student and Clinical Instructor Performance Instrument validation study raw data.
Acknowledgments
None.
| Validity evidence type | Meaning | SCIPAI evidence |
|---|---|---|
| Content | Content represents construct being assessed | Student performance content based upon prior instruments; CI performance content based upon accreditation standards; Student and CI performance items developed by credentialed CIs and reviewed by academic clinical educator content experts |
| Response process | Evidence of data integrity | Student and CI raters uniformly trained in use of instrument; simple, feasible scoring instructions and process; clean data entry & management |
| Internal structure | Relationships of items within the assessment tool (e.g., reliability, internal consistency) | Inter-item correlations, corrected item-total correlations, and split-half analyses all support acceptable internal structure reliability (See Supplement 3. Internal structure analyses) |
| Relations with other variables | Assessment item associations with variables external to the assessment (e.g., concurrent and predictive validity, time/group-based score differences) | Concurrent and predictive validity analyses; progression of scores analyses (see Tables 2–5) |
| Consequences | Impact on examinee, teaching, and learning | As a low-stakes, formative tool, the intention is for the use of the tool to increase communication to allow addressing of opportunities for improvement of teaching and learning. Comments on later SCIPAIs indicate addressing items highlighted on earlier SCIPAIs |
| Generalizability/reliability | Adequate number of cases, assessment encounters, raters | Power analysis for comparisons (N=199) and correlations (N=89) each exceeded by hundreds of cases, with N of raters equivalent to N of cases. Each rater/ratee pair had 4 assessment encounters |
| Final SCIPAI item | Final CPI item | Clinical instructor as rater | Student as rater | ||||
|---|---|---|---|---|---|---|---|
| No. of participants | Effect sizea) | Cohen categoryb) | No. of participants | Effect sizea) | Cohen categoryb) | ||
| Concurrent validity item pairs | |||||||
| Evaluation | Examination | 609 | 0.362 | Medium | 661 | 0.300 | Medium |
| Evaluation | Evaluation | 609 | 0.355 | Medium | 661 | 0.293 | Small |
| Treatment | Plan of care | 610 | 0.380 | Medium | 661 | 0.392 | Medium |
| Treatment | Procedural intervention | 610 | 0.386 | Medium | 661 | 0.404 | Medium |
| Treatment | Educational intervention | 610 | 0.372 | Medium | 661 | 0.401 | Medium |
| Communication | Communication | 608 | 0.305 | Medium | 660 | 0.285 | Small |
| Professional behavior | Professional behavior | 608 | 0.203 | Small | 659 | 0.137 | Small |
| Sum of SCIPAI scores (0–400) | Sum of CPI scoresd) (18–378) | 615 | 0.415 | Medium | 660 | 0.388 | Medium |
| Predictive validity item pairs | |||||||
| Sum of scores SCIPAIc) #1 (0–400) | Sum of scores final CPId) (18–378) | 630 | 0.275 | Small | 672 | 0.150 | Small |
| Sum of scores SCIPAIc) #2 (0–400) | Sum of scores final CPId) (18–378) | 697 | 0.331 | Medium | 728 | 0.253 | Small |
| Sum of scores SCIPAIc) #3 (0–400) | Sum of scores final CPId) (18–378) | 650 | 0.334 | Medium | 689 | 0.266 | Small |
SCIPAI, Student & Clinical Instructor Performance Assessment Instrument; CPI, Clinical Performance Instrument.
a) P-values for all analyses significant, P<0.01.
b) Cohen’s r guidelines for correlational effect sizes (small 0.10–<0.30; medium 0.30–<0.50; large ≥0.50).
c) Sum of 4 SCIPAI items each rated on a 0–100 scale; total possible sum 0–400.
d) Sum of 18 CPI items each rated on a 1–21 scale; total possible sum 18–378.
| Item | Rater | No. of participants | SCIPAI 1 mean score | SCIPAI 4 mean score | Z-scorea) | P-valuea) |
|---|---|---|---|---|---|---|
| CI performance | ||||||
| Supervision | CI | 478 | 91.6 | 90.1 | 3.105 | 0.002* |
| Supervision | Student | 590 | 95.9 | 94.1 | 1.231 | 0.218 |
| Feedback | CI | 477 | 91.7 | 93.1 | 5.610 | <0.001* |
| Feedback | Student | 588 | 96.6 | 96.1 | 3.553 | <0.001* |
| Student performance | ||||||
| Evaluation | CI | 506 | 52.8 | 86.6 | 18.393 | <0.001* |
| Evaluation | Student | 572 | 49.9 | 84.2 | 19.573 | <0.001* |
| Treatment | CI | 514 | 51.1 | 86.4 | 18.653 | <0.001* |
| Treatment | Student | 585 | 49.5 | 84.9 | 20.271 | <0.001* |
| Communication | CI | 518 | 79.4 | 94.4 | 14.447 | <0.001* |
| Communication | Student | 587 | 78.9 | 93.3 | 15.628 | <0.001* |
| Professional behavior | CI | 520 | 95.5 | 99.1 | 7.959 | <0.001* |
| Professional behavior | Student | 587 | 94.0 | 98.8 | 12.069 | <0.001* |
| Sum of scores | CI | 521 | 270.6 | 363.3 | 18.679 | <0.001* |
| Sum of scores | Student | 582 | 275.0 | 361.12 | 20.456 | <0.001* |
| Hypothesis | Validity assessed | Statistical analysis | Result threshold | Hypothesis/validity outcome |
|---|---|---|---|---|
| Student SCIPAI scores will increase during CEEs. | Construct | Wilcoxon signed rank | P<0.05 | Confirmed |
| CI SCIPAI scores will increase during CEEs. | Construct | Wilcoxon signed rank | P<0.05 | Not confirmed |
| Student final SCIPAI scores will correlate with like-item final CPI scores. | Concurrent | Spearman rank correlation | rs≥0.30a) | Confirmed |
| Student scores on earlier SCIPAIs will correlate with like-item final CPI scores. | Predictive | Spearman rank correlation | rs≥0.30a) | Confirmed |
| Validity evidence type | Meaning | SCIPAI evidence |
|---|---|---|
| Content | Content represents construct being assessed | Student performance content based upon prior instruments; CI performance content based upon accreditation standards; Student and CI performance items developed by credentialed CIs and reviewed by academic clinical educator content experts |
| Response process | Evidence of data integrity | Student and CI raters uniformly trained in use of instrument; simple, feasible scoring instructions and process; clean data entry & management |
| Internal structure | Relationships of items within the assessment tool (e.g., reliability, internal consistency) | Inter-item correlations, corrected item-total correlations, and split-half analyses all support acceptable internal structure reliability (See |
| Relations with other variables | Assessment item associations with variables external to the assessment (e.g., concurrent and predictive validity, time/group-based score differences) | Concurrent and predictive validity analyses; progression of scores analyses (see |
| Consequences | Impact on examinee, teaching, and learning | As a low-stakes, formative tool, the intention is for the use of the tool to increase communication to allow addressing of opportunities for improvement of teaching and learning. Comments on later SCIPAIs indicate addressing items highlighted on earlier SCIPAIs |
| Generalizability/reliability | Adequate number of cases, assessment encounters, raters | Power analysis for comparisons (N=199) and correlations (N=89) each exceeded by hundreds of cases, with N of raters equivalent to N of cases. Each rater/ratee pair had 4 assessment encounters |
| Final SCIPAI item | Final CPI item | Clinical instructor as rater | Student as rater | ||||
|---|---|---|---|---|---|---|---|
| No. of participants | Effect size |
Cohen category |
No. of participants | Effect size |
Cohen category |
||
| Concurrent validity item pairs | |||||||
| Evaluation | Examination | 609 | 0.362 | Medium | 661 | 0.300 | Medium |
| Evaluation | Evaluation | 609 | 0.355 | Medium | 661 | 0.293 | Small |
| Treatment | Plan of care | 610 | 0.380 | Medium | 661 | 0.392 | Medium |
| Treatment | Procedural intervention | 610 | 0.386 | Medium | 661 | 0.404 | Medium |
| Treatment | Educational intervention | 610 | 0.372 | Medium | 661 | 0.401 | Medium |
| Communication | Communication | 608 | 0.305 | Medium | 660 | 0.285 | Small |
| Professional behavior | Professional behavior | 608 | 0.203 | Small | 659 | 0.137 | Small |
| Sum of SCIPAI scores (0–400) | Sum of CPI scores |
615 | 0.415 | Medium | 660 | 0.388 | Medium |
| Predictive validity item pairs | |||||||
| Sum of scores SCIPAI |
Sum of scores final CPI |
630 | 0.275 | Small | 672 | 0.150 | Small |
| Sum of scores SCIPAI |
Sum of scores final CPI |
697 | 0.331 | Medium | 728 | 0.253 | Small |
| Sum of scores SCIPAI |
Sum of scores final CPI |
650 | 0.334 | Medium | 689 | 0.266 | Small |
| Item | Rater | No. of participants | SCIPAI 1 mean score | SCIPAI 4 mean score | Z-score |
P-value |
|---|---|---|---|---|---|---|
| CI performance | ||||||
| Supervision | CI | 478 | 91.6 | 90.1 | 3.105 | 0.002 |
| Supervision | Student | 590 | 95.9 | 94.1 | 1.231 | 0.218 |
| Feedback | CI | 477 | 91.7 | 93.1 | 5.610 | <0.001 |
| Feedback | Student | 588 | 96.6 | 96.1 | 3.553 | <0.001 |
| Student performance | ||||||
| Evaluation | CI | 506 | 52.8 | 86.6 | 18.393 | <0.001 |
| Evaluation | Student | 572 | 49.9 | 84.2 | 19.573 | <0.001 |
| Treatment | CI | 514 | 51.1 | 86.4 | 18.653 | <0.001 |
| Treatment | Student | 585 | 49.5 | 84.9 | 20.271 | <0.001 |
| Communication | CI | 518 | 79.4 | 94.4 | 14.447 | <0.001 |
| Communication | Student | 587 | 78.9 | 93.3 | 15.628 | <0.001 |
| Professional behavior | CI | 520 | 95.5 | 99.1 | 7.959 | <0.001 |
| Professional behavior | Student | 587 | 94.0 | 98.8 | 12.069 | <0.001 |
| Sum of scores | CI | 521 | 270.6 | 363.3 | 18.679 | <0.001 |
| Sum of scores | Student | 582 | 275.0 | 361.12 | 20.456 | <0.001 |
| Item | Rater | No. of participants | CIs with positive changes SCIPAI 1–SCIPAI 4 | CIs with no changes SCIPAI 1–SCIPAI 4 | CIs with negative changes SCIPAI 1–SCIPAI 4 |
|---|---|---|---|---|---|
| Supervision | CI | 478 | 172 | 231 | 75 |
| Supervision | Student | 590 | 141 | 370 | 79 |
| Feedback | CI | 477 | 177 | 238 | 62 |
| Feedback | Student | 588 | 134 | 393 | 61 |
| Total | All | 2,133 | 624 | 1,232 | 277 |
| Hypothesis | Validity assessed | Statistical analysis | Result threshold | Hypothesis/validity outcome |
|---|---|---|---|---|
| Student SCIPAI scores will increase during CEEs. | Construct | Wilcoxon signed rank | P<0.05 | Confirmed |
| CI SCIPAI scores will increase during CEEs. | Construct | Wilcoxon signed rank | P<0.05 | Not confirmed |
| Student final SCIPAI scores will correlate with like-item final CPI scores. | Concurrent | Spearman rank correlation | rs≥0.30 |
Confirmed |
| Student scores on earlier SCIPAIs will correlate with like-item final CPI scores. | Predictive | Spearman rank correlation | rs≥0.30 |
Confirmed |
SCIPAI, Student and Clinical Instructor Performance Assessment Instrument; CI, clinical instructor.
SCIPAI, Student & Clinical Instructor Performance Assessment Instrument; CPI, Clinical Performance Instrument. P-values for all analyses significant, P<0.01. Cohen’s r guidelines for correlational effect sizes (small 0.10–<0.30; medium 0.30–<0.50; large ≥0.50). Sum of 4 SCIPAI items each rated on a 0–100 scale; total possible sum 0–400. Sum of 18 CPI items each rated on a 1–21 scale; total possible sum 18–378.
SCIPAI, Student & Clinical Instructor Performance Assessment Instrument; CI, clinical instructor. *P<0.05 (statistically significant). Repeated measure Wilcoxon signed rank test.
CI, clinical instructor; SCIPAI, Student & Clinical Instructor Performance Assessment Instrument.
SCIPAI, Student and Clinical Instructor Performance Assessment Instrument; CEEs, clinical education experiences; CI, clinical instructor; CPI, Clinical Performance Instrument. Cohen’s guidelines for social science research: medium effect rs ≥0.30, large effect rs ≥0.50.