TABLE 5

Pairwise error rate by human review.

Category
(# of samples)
CC
(50)
UU
(50)
CU
(100)
UC
(100)
PSERPLERError rate
(splitting + lumping)
Authority 20092%6%57%46%9.3%12.5%11.9% = 1.8% + 10.1%
Our clustering2%6%43%54%23.1%3.2%9.9% = 7.7% + 2.2%

PrecisionRecallF-score

Authority 200987.5%97.5%92.2%
Our clustering96.8%89.3%92.9%
-