This Repository Contains R-Codes executed on various Datasets in RStudio. I Hope This Repository is very helpful for those who are Willing to build their Career in Data Science, Big Data. I am a Beginner in this Field so kindly Forgive if there are any Silly Mistakes. Suggestions through Mail for Improving the Analysis are always Welcome.ππΉ π₯π― E-Mail id:- mandarmakhi007@gmail.com
You will Need Rstudio to Execute all the Codes So Install it first and then Go through the Below Codes. To Download Rstudio, Click Here.
To Begin with the Basics of the Data Science, go through the Practice(Basics) Folder in the Repository.
1.Basics practice.r
2.Confidence Interval Confidence_Interval.r
3.Probability Probability.r
Now we will do the Descriptive Statistics Analysis also known as Exploratory Data Analysis(EDA).
1.Carbon Dioxide(CO2) Descriptive_Stats_CO2.r
2.Air Quality Descriptive_Stats_airquality.r
Now lets Go through Various Algorithms.
- Hypothesis Testing Hypothesis Testing.r
1.Newspaper Data NewspaperData.CSV Newspaper_LinearRegression.r
2.Waist Circumference-Adipose issue WC-AT.csv WC-AT_LinearRegression.r
1.Cars Cars.csv Cars_Multi_Linear_Regression.r
2.Corolla Toyota_Corolla.csv Toyota_Multi_Linear_Regression.r
- Claimants Claimants.csv Logistic Regression.r
1.Titanic Titanic.csv Titanic_Association_Rule.r
1.Cat Cat.jpg Example1_PCA.r
2.University Universities.csv Universities_PCA.r
1.Universities Univesities.csv Universities_Heirarchical_Clustering.r K-Means_Clustering.r
1.Unemployment Survival_Unemployment.csv Survival_Unemployment.r
- Cancer KNN.csv K-Nearest_Neighbour.r
- Iris Available in R Datasets random_forest.r
- Concrete concrete.csv Concrete_Neural_Network.r
- Letter Data LetterData.csv LetterData_Support_Vector_Machine.r
- SMS Spam sms_spam.csv Naive_Bayes_Sms_Spam.r
-
Amtrak Amtrak.csv | Predict_new.xlsx | Amtrak_Forecasting.r
-
Aviation Aviation.csv Aviation_Exponential_Smooting_Forecasting.r
We require Positive Words and Negative Words for the Analysis.
-
Emotion Mining Amazon Nokia Lumia Reviews.txt Emotion_Mining_Amazon.r
-
Sentiment Analysis McD_Small.csv Sentiment Analysis_McD.r
If you want to extract the Reviews of a particular Product from Amazon then Run the Below Code in Rstudio.
This Code is Valid only for the Products on Amazon.
The Code Varies from site to site.
install.packages("rvest")
install.packages("XML")
install.packages("magrittr")
library(rvest)
library(XML)
library(magrittr)
# Amazon Reviews #############################
aurl <- "URL of Product Reviews page"
amazon_reviews <- NULL
for (i in 1:10){
murl <- read_html(as.character(paste(aurl,i,sep="=")))
rev <- murl %>%
html_nodes(".review-text") %>%
html_text()
amazon_reviews <- c(amazon_reviews,rev)
}
length(amazon_reviews)
write.table(amazon_reviews,"apple.txt",row.names = F)
I have Performed this code for Extracting Reviews of Apple Macbook Air, Do check it Out.
-
Buyer Ratio .pptx BuyerRatio.csv BuyerRatio.r
-
Customer Order Form .pptx Customer+OrderForm.csv Customer+OrderForm.r
-
Cutlet Diameter .pptx Cutlets.csv Cutlet_Hyp_Test.r
-
Fantaloons .pptx Fantaloons.csv Fantaloons.r
-
Calories Consumed .txt Calories_Consumed.csv Calories_Simple_Linear.r
-
Delivery Time Data .txt Delivery_Time.csv Delivery_Simple_Linear_Regression.r
-
Employee Data .txt Emp_Data.csv Emp_Simple_Linear.r
-
Salary Data .txt Salary_Data.csv Salary_Simple_Linear.r
-
50 Startup .txt 50_Startups.csv 50_Startup_Multi_Linear.r
-
Computer Data .txt Computer_Data.csv Computer_Data_Multi_Linear.r
-
Computer Data .txt ToyotaCorolla.csv ToyotaCorolla_Multi_Linear.r
-
Credit Card .txt Creditcard.csv Creditcard_Logistic_regression.r
-
Groceries [.txt](https://github.com/mandarmakhi/DataScience-R-code/blob/master/2.%20Algorithms%20on%20Datasets/Association%20Rule/groceries/Problem_Statment.txt Groceries.csv Groceries.r
-
Movies .txt My_Movies.csv My_Movies.r
- Crime Data .txt Crime_Data.csv Crime_Data_Clustering.r
- East West Airlines .txt EastWestAirlines.xlsx EastWestAirlines_Cluster.r
- Wine .txt Wine.csv Wine_PCA.r
-
Company Data .txt Company_Data.csv Company_Data.r
-
Fraud Check .txt Fraud_Check.csv Fraud_Check.r
-
Company Data .txt Company_Data.csv Company_Data.r
-
Fraud Check .txt Fraud_Check.csv Fraud_Check.r
- 50 Startups 50_Startups.csv 50_Startups.r
- Concrete Concrete.csv Concrete.r
- ForestFires Forestfires.csv Forestfires.r
- Forest Fires .txt Forestfires.csv Forestfires.r
- Salary Data .txt Salary_Data_Train.csv, Salary_Data_Test.csv SalaryData.r
- Salary_Data .txt SalaryData_Train.csv, SalaryData_Test.csv SalaryData.r
- Sms Data .txt Sms_Raw_NB.csv Sms_Raw_NB.r
- Airlines Data .txt Airlines+Data.xlsx Airlines+Data.r
- Coca Cola Sales .txt CocaCola_Sales_Rawdata.xlsx CocaCola_Sales_Rawdata.r
- Plastic Sales .txt PlasticSales.csv PlasticSales.r
You Require Positive-Words, Negative-Words and Stop-Words for this Analysis.
- Amazon iphone Review .txt iphone Reviews.txt Amazon_iphone_Reviews.r
- IMDB Money heist WebSeries Review .txt Money heist_Reviews.txt Money Heist.r