Skip to content

Latest commit

 

History

History
14 lines (8 loc) · 1.15 KB

README.md

File metadata and controls

14 lines (8 loc) · 1.15 KB

GermanAnglicisms

The file "Anglicism_Matchlist_July18.txt" is a list of potential German-language infinitives and past participles generated from English-language verbs.

The files "GermanTweets_aa" to "GermanTweets_af" should be joined to create a compressed list of the Tweet IDs of 36,240,530 German-language tweets collected from Twitter's APIs in April 2018.

To join the files, download them to an empty directory and (in Linux) run cat German* > GermanTweets.tar.gz. Decompress the resulting file to get the large list of Tweet IDs.

To download the tweets from Twitter, feed in the Tweet IDs using Tweepy, Twarc, or a similar tool.

For more information, please see this paper, and if you make use of these resources, please cite it:

Coats, Steven. (2018). Variation of new German verbal Anglicisms in a social media corpus. In Reinhild Vandekerckhove, Darja Fišer and Lisa Hilte (eds.), Proceedings of the 6th Conference on CMC and Social Media Corpora for the Humanities, 27–32. Antwerp, Belgium: University of Antwerp.