Semi-Automated Analysis of Large Privacy Policy Corpora

dc.contributor.authorDima, Alden
dc.contributor.authorMassey, Aaron
dc.date.accessioned2020-12-24T19:57:57Z
dc.date.available2020-12-24T19:57:57Z
dc.date.issued2021-01-05
dc.description.abstractRegulators, policy makers, and consumers are interested in proactively identifying services with acceptable or compliant data use policies, privacy policies, and terms of service. Academic requirements engineering researchers and legal scholars have developed qualitative, manual approaches to conducting requirements analysis of policy documents to identify concerns and compare services against preferences or standards. In this research, we develop and present an approach to conducting large-scale, qualitative, prospective analyses of policy documents with respect to the wide-variety of normative concerns found in policy documents. Our approach uses techniques from natural language processing, including topic modeling and summarization. We evaluate our approach in an exploratory case study that attempts to replicate a manual legal analysis of roughly 200 privacy policies from seven domains in a semi-automated fashion at a larger scale. Our findings suggest that this approach is promising for some concerns.
dc.format.extent10 pages
dc.identifier.doi10.24251/HICSS.2021.563
dc.identifier.isbn978-0-9981331-4-0
dc.identifier.urihttp://hdl.handle.net/10125/71180
dc.language.isoEnglish
dc.relation.ispartofProceedings of the 54th Hawaii International Conference on System Sciences
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subjectPrivacy and Economics
dc.subjectnatural language processing
dc.subjectprivacy policy analysis.
dc.subjectrequirements analysis
dc.titleSemi-Automated Analysis of Large Privacy Policy Corpora
prism.startingpage4641

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
0456.pdf
Size:
924.99 KB
Format:
Adobe Portable Document Format