Topic modelling was used to determine common topics across the wholecorpus. Sixty-five topics were found (of which 60 were used) using theApache Mallet Toolkit Latent Dirichlet Allocation (LDA) algorithm.
The authors used LDA with k=60 across full text case studies. The Apache Mallet implementation was used.