- Fixed bug where
lexicon_nrc_vad()
didn't have column names. (#53)
- Update path to correctly path source for NRC lexicon.
- Fixed documentation to be HTML5 friendly.
cache_info()
function has to added to allow for quick overview of cacheing size.- Update download url for
lexicon_nrc()
.
embedding_glove6b()
,embedding_glove27b()
,embedding_glove42b()
, andembedding_glove840b()
have been added to give access to the Stanford NLP Global Vectors for Word Representations pre-trained word vectors (@jonthegeek, #26).manual_download
argument have been added to all functions to allow the user to manual place file download at right place.
lexicon_nrc_eil()
has been added to give access to the NRC Emotion Intensity Lexicon (aka Affect Intensity Lexicon) v0.5.lexicon_nrc_vad()
has been added to give access to the The NRC Valence, Arousal, and Dominance Lexicon.- The argument
clean
have been added to all functions to allow deletion of intermediate files. - An optional information prompt is implemented. This will be turned off by default and turned on by original authors request.
dataset_nrc()
got improved url for faster and more reliable downloads.
dataset_imdb()
has been added to give access to the IMDb Large Movie Review Dataset.dataset_trec()
has been added to give access to the TREC-6 and TREC-50 classification datasets.dataset_dbpedia()
has been added to give access to DBpedia Ontology classification dataset.dataset_ag_news()
has been added to give access to AG's News Topic classification dataset.- Functions will now notify the user about the download mechanism used to download the data. http/https etc. (#12).
lexicon_nrc()
has been added to give access to the NRC Emotion lexicon (@juliasilge, #11).