Title: | Observers Needed to Evaluate Subjective Tests |
---|---|
Description: | This ONEST software implements the method of assessing the pathologist agreement in reading PD-L1 assays (Reisenbichler et al. (2020 <doi:10.1038/s41379-020-0544-x>)), to determine the minimum number of evaluators needed to estimate agreement involving a large number of raters. Input to the program should be binary(1/0) pathology data, where “0” may stand for negative and “1” for positive. Additional examples were given using the data from Rimm et al. (2017 <doi:10.1001/jamaoncol.2017.0013>). |
Authors: | Gang Han [aut, cre], Baihong Guo [aut] |
Maintainer: | Gang Han <[email protected]> |
License: | GPL-3 |
Version: | 0.1.0 |
Built: | 2025-02-02 05:41:24 UTC |
Source: | https://github.com/hangangtrue/onest |
A precomputed dataset only used by the ONEST_vignettes function to create the vignettes.
data('empirical')
data('empirical')
columns : different permutations
rows : agreement percentage
NCCN 22c3 tumor data from Rimm et al. (2017) used to illustrate the ONEST method.
data('NCCN_22c3_t')
data('NCCN_22c3_t')
columns : different raters
rows : different cases
[1] Rimm DL, Han G, Taube JM, Yi ES, Bridge JA, Flieder DB, et al. A Prospective, Multi-institutional, Pathologist-Based Assessment of 4 Immunohistochemistry Assays for PD-L1 Expression in Non-Small Cell Lung Cancer. JAMA Oncol. 2017;3(8):1051-8, DOI: 10.1001/jamaoncol.2017.0013, PMID: 28278348.
NCCN SP142 stromal data from Rimm et al. (2017) used to illustrate the ONEST method.
data('NCCN_sp142')
data('NCCN_sp142')
columns : different raters
rows : different cases
[1] Rimm DL, Han G, Taube JM, Yi ES, Bridge JA, Flieder DB, et al. A Prospective, Multi-institutional, Pathologist-Based Assessment of 4 Immunohistochemistry Assays for PD-L1 Expression in Non-Small Cell Lung Cancer. JAMA Oncol. 2017;3(8):1051-8, DOI: 10.1001/jamaoncol.2017.0013, PMID: 28278348.
NCCN SP142 tumor data from Rimm et al. (2017) used to illustrate the ONEST method.
data('NCCN_sp142_t')
data('NCCN_sp142_t')
columns : different raters
rows : different cases
[1] Rimm DL, Han G, Taube JM, Yi ES, Bridge JA, Flieder DB, et al. A Prospective, Multi-institutional, Pathologist-Based Assessment of 4 Immunohistochemistry Assays for PD-L1 Expression in Non-Small Cell Lung Cancer. JAMA Oncol. 2017;3(8):1051-8, DOI: 10.1001/jamaoncol.2017.0013, PMID: 28278348.
The ONEST software implements a statistical method in Reisenbichler et al. (2020[1]), to determine the minimum number of evaluators needed to estimate agreement involving a large number of raters. This method could be utilized by regulatory agencies, such as the FDA, when evaluating agreement levels of a newly proposed subjective laboratory test. Input to the program should be binary(1/0) pathology data, where “0” may stand for negative and “1” for positive. The example datasets in this software are from Rimm et al. (2017[2]) (the SP142 assay), and Reisenbichler et al. 2020. This program can run in R version 3.5.0 and above.
[1] Reisenbichler ES, Han G, Bellizzi A, Bossuyt V, Brock J, Cole K, et al. Prospective multi-institutional evaluation of pathologist assessment of PD-L1 assays for patient selection in triple negative breast cancer. Mod Pathol. 2020, DOI: 10.1038/s41379-020-0544-x; PMID: 32300181.
[2] Rimm DL, Han G, Taube JM, Yi ES, Bridge JA, Flieder DB, et al. A Prospective, Multi-institutional, Pathologist-Based Assessment of 4 Immunohistochemistry Assays for PD-L1 Expression in Non-Small Cell Lung Cancer. JAMA Oncol. 2017;3(8):1051-8, DOI: 10.1001/jamaoncol.2017.0013, PMID: 28278348.
This function uses the binary (0/1) pathology data to test if the agreement proportion will converge to 0 with an increasing number of observers.
ONEST_inflation_test(Y)
ONEST_inflation_test(Y)
Y |
pathology data as a matrix (missing values are allowed) |
p_value: chi-square p-value, a small p-value indicates significant evidence that the observers’ agreement will converge to a non-zero proportion.
data("sp263_bin") ONEST_inflation_test(sp263_bin)
data("sp263_bin") ONEST_inflation_test(sp263_bin)
This is the ONEST main function taking binary (0/1) pathology data as input. This function utilizes the plotline function to produce the ONEST graphs and estimates.
ONEST_main(data)
ONEST_main(data)
data |
a matrix containing the binary pathology data. Each row is the data from one case, and each column is the data from one rater. Missing values are allowed and can be denoted as NA or left blank. If there are n cases and k raters, the input ‘data’ is a matrix with dimension n by k. |
consist_p: a vector of length k-1, indicating proportion of identical reads among a set of pathologists. For example, the first element of “consist_p” is the estimate of agreement percentage for 2 raters. The k-1 th element is the estimate of agreement percentage for k raters.
consist_low: a vector of length k-1, indicating the lower bound of the agreement percentage with 95 percent confidence level corresponding to “consist_p”.
diff_consist: a vector of length k-2, indicating the difference between the consist_p. For example, the first element of “diff_consist” is the estimated difference of agreement percentage after increasing from 2 to 3 raters. The k-2 th element is the difference of agreement percentage after increasing from k-1 to k raters.
diff_high: a vector of length k-2, indicating the upper bound of the change of agreement percentage corresponding to “diff_consist” with 95 percent confidence level.
size_case: number of cases n
size_rater: number of raters k
p: the probability of of being rated positive among the proportion of ‘1-p_plus-p_minus’ cases.
p_plus: proportion of the cases rated positive by all raters.
p_minus: proportion of the cases rated negative by all raters.
empirical: a matrix of dimension k-1 by 3, including the empirical estimate of the agreement percentage, and the empirical 95 percent confidence intervals (CI) of the agreement percentage with equal tail probabilities on the two sides. The empirical estimate and CI were calculated by permuting the raters with 1000 random permutations, and using the mean, 2.5th percentile, and 97.5th percentile.
data("sp142_bin") ONEST_main(sp142_bin)
data("sp142_bin") ONEST_main(sp142_bin)
This function is only used to create vignettes. Although it can get the same results from ONEST_main(sp142_bin), it uses some precomputed data to decrease the time to get the results. Therefore, it can only be applied to the sp142_bin dataset. Please use ONEST_main instead.
ONEST_vignettes(data,empirical)
ONEST_vignettes(data,empirical)
data |
a matrix containing the binary pathology data. Each row is the data from one case, and each column is the data from one rater. Missing values are allowed and can be denoted as NA or left blank. If there are n cases and k raters, the input ‘data’ is a matrix with dimension n by k. |
empirical |
a matrix containing data used to plot the empirical confidence intervals for the sp142_bin dataset. |
consist_p: a vector of length k-1, indicating proportion of identical reads among a set of pathologists. For example, the first element of “consist_p” is the estimate of agreement percentage for 2 raters. The k-1 th element is the estimate of agreement percentage for k raters.
consist_low: a vector of length k-1, indicating the lower bound of the agreement percentage with 95 percent confidence level corresponding to “consist_p”.
diff_consist: a vector of length k-2, indicating the difference between the consist_p. For example, the first element of “diff_consist” is the estimated difference of agreement percentage after increasing from 2 to 3 raters. The k-2 th element is the difference of agreement percentage after increasing from k-1 to k raters.
diff_high: a vector of length k-2, indicating the upper bound of the change of agreement percentage corresponding to “diff_consist” with 95 percent confidence level.
size_case: number of cases n
size_rater: number of raters k
p: the probability of of being rated positive among the proportion of ‘1-p_plus-p_minus’ cases.
p_plus: proportion of the cases rated positive by all raters.
p_minus: proportion of the cases rated negative by all raters.
empirical: a matrix of dimension k-1 by 3, including the empirical estimate of the agreement percentage, and the empirical 95 percent confidence intervals (CI) of the agreement percentage with equal tail probabilities on the two sides. The empirical estimate and CI were calculated by permuting the raters with 1000 random permutations, and using the mean, 2.5th percentile, and 97.5th percentile.
data('sp142_bin') data('empirical') ONEST_vignettes(sp142_bin,empirical)
data('sp142_bin') data('empirical') ONEST_vignettes(sp142_bin,empirical)
This function uses the binary (0/1) pathology data to plot the percentage of agreement.
plotline(path,indi=1,color="red")
plotline(path,indi=1,color="red")
path |
pathology data as a matrix (missing values are allowed) |
indi |
an indicator; if indi=0: then do not make plot; indi=1: then make plot, original scale. Labels on the x and y-axis are on the original scale |
color |
the color used to draw pictures |
A matrix with percentage of agreement from the original scale
data("sp142_bin") plotline(sp142_bin,1,"red")
data("sp142_bin") plotline(sp142_bin,1,"red")
A pathology reads dataset of triple negative breast cancer in Reisenbichler et al. (2020). A value of 0 means negative and a value of 1 means positive.
data('sp142_bin')
data('sp142_bin')
columns : different raters
rows : different cases
[1] Reisenbichler ES, Han G, Bellizzi A, Bossuyt V, Brock J, Cole K, et al. Prospective multi-institutional evaluation of pathologist assessment of PD-L1 assays for patient selection in triple negative breast cancer. Mod Pathol. 2020, DOI: 10.1038/s41379-020-0544-x; PMID: 32300181.
A pathology reads dataset of triple negative breast cancer in Reisenbichler et al. (2020). A value of 0 means negative and a value of 1 means positive.
data('sp263_bin')
data('sp263_bin')
columns : different raters
rows : different cases
[1] Reisenbichler ES, Han G, Bellizzi A, Bossuyt V, Brock J, Cole K, et al. Prospective multi-institutional evaluation of pathologist assessment of PD-L1 assays for patient selection in triple negative breast cancer. Mod Pathol. 2020, DOI: 10.1038/s41379-020-0544-x; PMID: 32300181.