-
Notifications
You must be signed in to change notification settings - Fork 12
/
First look.Rmd
82 lines (59 loc) · 2.29 KB
/
First look.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
---
title: "Gun.io First Look"
author: "Lucy D’Agostino McGowan"
date: "June 13, 2016"
output: html_document
---
```{r}
library(tm)
library(SnowballC)
library(wordcloud)
```
Pull in lead data from close.io
```{r}
dat<-read.csv("~/Documents/gunio/Gun.io leads 2016-06-13 16-16.csv")
```
Pull in LOE data
```{r}
loe<-read.csv("~/Documents/gunio/signed_LOE.csv")
LOE<-dat[dat$primary_contact_primary_email %in% loe$Email | (dat$name %in% loe$Description & dat$name != "") | (dat$display_name %in% loe$Description & dat$display_name != "") ,]
dat[dat$name %in% loe$Description & dat$name != "","name"]
#check who is not in there
loe[!(loe$Description %in% LOE$primary_contact_name | loe$Email %in% LOE$primary_contact_primary_email),]
#delitor -- Glenn Devitt ,
#Accudraw -- should be tim meyer
#wonderthings, redishade, zaapt, not in Close
LOE<-dat[dat$primary_contact_primary_email %in% loe$Email | (dat$name %in% loe$Description & dat$name != "") | (dat$display_name %in% loe$Description & dat$display_name != "") | dat$display_name %in% c("Tim Meyer","Glenn Devitt") ,]
#create indicator for LOE
dat$loe<-ifelse(dat$id %in% LOE$id,1,0)
#create times communicated before loe
dat<-merge(dat,loe[,c("Email","Created..UTC.")],by.x="primary_contact_primary_email",by.y="Email",all.x=T)
#eek not possible because we don't have dates...
```
Just look at primary email
```{r}
email<-dat[,c("id","display_name","name","status_label","date_created","primary_contact_primary_email","times_communicated","primary_opportunity_value")]
#subset to 2016
email$date<-as.Date(gsub(" .*$","",email$date_created))
email$year_created<-as.numeric(gsub("-.*$","",email$date_created))
email<-email[email$year==2016,]
#check which are admins, etc
email$email_name<-gsub( "@.*$", "", email$primary_contact_primary_email )
#word cloud
emailCorpus <- Corpus(VectorSource(email$email_name))
emailCorpus <- tm_map(emailCorpus, PlainTextDocument)
wordcloud(emailCorpus,min.freq=2)
```
Big general words:
- info
- hello
- admin
- jobs
- team
```{r}
#generic email name category
email$email_cat<-ifelse(email$email_name=="info" | email$email_name=="hello" | email$email_name=="admin" | email$email_name=="jobs"|email$email_name=="team",1,0)
#categorize times communicated >1 <1
email$comm_cat<-ifelse(email$times_communicated<=1,1,0)
```
Dead end