Singla, Y. K., Krishna, S., Shah, R. R., & Chen, C. (2022). Using Sampling to Estimate and Improve Performance of Automated Scoring Systems with Guarantees. Proceedings of the AAAI Conference on Artificial Intelligence, 36(11), 12835-12843. https://ojs.aaai.org/index.php/AAAI/article/view/21563