Sai Charan Thotapalli
01/25/2015
Data description
First, is need to know the amount of information this analysis will involve, in this section a general review of data.
Number of rows, this mean the number of observations to be analysed. 21,061 observations are found.
## [1] 21061
Number of columns, this mean the number of variables to be analysed.
## [1] 12
The original names of the variables.
## [1]
## [4]
## [7]
## [10]
"day"
"platform"
"orders"
"add_to_cart"
"site"
"visits"
"gross_sales"
"product_page_views"
"new_customer"
"distinct_sessions"
"bounces"
"search_page_views"
Data dictionary.
Data dictionary is a set of information to explain the variables that are up to be analysed.
Where variables can be explained.
•
•
•
•
•
•
•
•
•
•
•
•
day | The calendar day. site | Company site visited by users. new_customer | 0 = returning customer; 1 = new customer; null = neither platform | The type of device used by a website visitor visits | The number of distinct website visits; 1 session may have multiple visits distinct_sessions | The number of distinct website visitors; 1 session may have multiple visits orders | The number of website orders gross_sales | The total gross sales for website orders bounces | The number of visits that only viewed one page add_to_cart | The number of visits that added a product to cart product_page_views | The number of product pages viewed search_page_views 1 The number of search pages viewed
Exploratory data analysis
First we explore relevant data, company site visited by users is described by next table:
##
##
Acme
7392
Botly Pinnacle
804
5725
Sortly
5532
Tabular Widgetry
804
804
Acme site, result to be the more visited followed by Pinnacle and Sortly.
In platforms according to next table, the most visitors users use iOS, followed by Android devices and Windows systems. In following figure the missing platform is cause of missing data of databe origin.
##
##
##
##
##
##
410