Publication Classification Dataset


This classification dataset contains 383 scientific publications from AAN manually classified into 31 research areas using session information. The session information was compiled using the session information from ACL, COLING and EMNLP. We have manually annotated all the publications from ACL 2005-08.

Here is a complete README which explains the selection process for the publications, sessions, the annotation process and the format of the different files.

Click here to download this data set.

Back to Datasets