Dataset for paper 'Analysing Semantic Textual Similarity of University Module Catalogue Entries for Linking Modules Comparable in Content'
This dataset is an export of the SQL database backing the Newcastle University module catalogue, visible at Module Catalogue - Global Opportunities - Newcastle University (ncl.ac.uk), used in the 'Analysing Semantic Textual Similarity of University Module Catalogue Entries for Linking Modules Comparable in Content' paper. Contains 27 tables, where modules.csv is used directly in the analysis. modules.csv contains various metadata corresponding to the module catalogue entries, including module names, semantic descriptions and tabular data. Additionally, a set of paired module codes with corresponding semantic similarities are given in test_pairs_labelled.txt.
Corresponding repository found at: lukekaye/sts-university-modules: Analysing Semantic Textual Similarity of university module catalogue entries for linking modules comparable in content. (github.com)