Determining an inter-rater agreement metric for researchers evaluating student pathways in problem solving