Recognising Activity Labeling Styles in Business Process Models
AbstractQuality assurance is a serious issue for large-scale process modelling initiatives. While formal control flow analysis has been extensively studied in prior research, there is little work on how the textual content of a process model and its activity labels can be systematically analysed. In this context, it is a major challenge to systematically identify and to consequently assure high label quality. As many large process model collections contain more than thousand models, each including several activity labels, there is a strong need for an automatic detection of labels that might be of bad quality. Recent research has shown that different grammatical styles correlate with potential ambiguity of a label. In this paper, we propose an algorithm for recognition of activity labeling styles. The developed algorithm exploits natural language processing techniques, e.g., part of speech tagging and analysis of the grammatical structure. We also study how ontologies, like WordNet, can support the solution. We conduct a thorough evaluation of the developed techniques utilising about 6,000 activity labels from the SAP Reference Model. The evaluation of this algorithm shows that spurious labels can be identified with a significant level of precision and recall. In this way, our approach can be used as a means of quality assurance for process repository management by listing bad quality labels, which a human modeler should correct.
Authors who publish with this journal agree to the following terms: Authors retain copyright and grant the journal 'Enterprise Modelling and Information Systems Architectures - International Journal of Conceptual Modeling' and the Gesellschaft für Informatik e.V. (GI) the permission of first publication, and the non-exclusive, irrevocable and non-time limited publication permission for the submitted work including the permissions to store, copy, distribute and reproduce their work in printed and electronic form for the duration of the legal copyright. This includes the right of translation. Authors grant the journal 'Enterprise Modelling and Information Systems Architectures - International Journal of Conceptual Modeling' and the Gesellschaft für Informatik e.V. (GI) the permission to license their work under a Creative Commons BY-SA 4.0 license that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book) given an acknowledgement of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access). The submitting corresponding author on behalf of all co-authors asserts that she/he is entitled to the granting of the above mentioned permissions for the submitted work.