RoyN8586799Tutor: Jenny Houtsma | [BSB123 - Data analysis research report] | Analyzing the relationships between different variables in relation to one year returns within the superannuation industry. | Contents 1.0 Introduction 2 2.0 Outliers 3 3.0 Historical Analysis 4 4.0 Current Data (One Variable Analysis 5 5.0 Bivariate and Trivariate Analysis 6 5.1 Impact of Investment Strategy on One Year Returns 6 5.2 Impact of Three Year Returns on One Year Returns 8 5.3 Impact
Premium Investment Rate of return
(Batman Begins)‚ and 33.90 (Wedding Crashers). Values that are less than z-score of -3 or larger than z-score of +3 should also be considered outliers‚ so‚ if using only this criteria‚ War of the Worlds (z-score: 3.586)‚ Harry Potter and the Goblet of Fire (z-score: 4.944)‚ and Star Wars: Episode III (z-score: 5.248) would have still been considered outliers. Total Gross Sales As with the opening
Free Median Standard deviation Mode
Christine Marie Florece Date Performed: June 20‚ 2013 Karmelie Jane Monaya Date Submitted: June 27‚ 2013 Kean Gerard Sumayo Experiment 1 APPLICATION OF STATISTICAL CONCEPTS IN THE DETERMINATION OF WEIGHT VARIATION IN SAMPLES I. OBJECTIVES 1. To determine the use of the different statistical concepts 2. To perform the proper applications of the statistical methods/ concepts on determining the weight variations of samples II. RESULTS and DISCUSSIONS
Premium Statistics Normal distribution
minute before the step test. This was only 2 beats more than the average pulse rate of the female subjects‚ which was 67 beats per minute. The pulse rates of all subjects‚ male or female‚ before the step test were about the same‚ excluding a couple of outliers‚ as seen in Figure 1. The average pulse rate of the male subjects after the step test was 93 beats per minute‚ which was significantly lower than the average pulse rate of the female subjects after the step test‚ which was 100 beats per minute
Free Heart rate
formula. Evaluate the coefficient of determination. 4. Rerun the regression‚ and drop the point (20‚000‚ $26‚000) as an outlier. Compare the results from this regression to those for the regression in Requirement 3. Which is better? SOLUTION: 1. [pic] The overall relationship looks reasonably linear—although the data point for the first quarter may be an outlier. 2. Using the high-low method: Variable power cost = [pic] = $1.13 (rounded) Fixed power cost = $42‚500
Premium Costs Regression analysis Polynomial
Max: 5678 The histogram above shows the Credit Balance variable of the 50 customers surveyed. The histogram is almost symmetrical with one outlier which is the credit balance of $2‚000. While it being symmetrical you can almost fold the y-axis in half to have it look the same. While observing the histogram‚ its skewed to the left because of the outlier‚ and the skew is -.015043. Using the Anderson-Darling Normality Test‚ the P-value for Credit Balance is 0.400‚ and A^2 is 0.38. Throughout the
Premium Standard deviation Normal distribution Median
data. Data mining is practiced on static data collection‚ called ‘DATA WAREHOUSE’‚ rather than ‘online’ databases which keep on updating. 8 FORMS OF DATA MINING CLASS DESCRIPTION ASSOCIATION ANAYLSIS CLASS DISCRIMINATION OUTLIER ANALYSIS CLUSTER ANALYSIS CLUSTER ANALYSIS 9 1) CLASS DESCRIPTION: Class description deals with identifying properties that characterize a given group of data items‚ whereas class discrimination deals with identifying
Premium Data mining Data analysis
Figure 4 from the paper displays the results from the automated monkey Movement-Assessment Panel (mMAP) testing system. In this experiment‚ the motor performance times of monkey’s retrieving small food items were tested using an automated mMAP. There were 12 trials conducted with a 30 second delay between each trial. There were three different levels of difficulty: platform task was the lowest in difficulty‚ the straight rod task was moderate in difficulty‚ and the q-mark task was the most difficult
Premium Psychology Operant conditioning Behaviorism
data in a scatterplot to determine if there is a possible linear relationship. Compute and interpret the linear correlation coefficient‚ r. Determine the regression equation for the data. Graph the regression equation and the data points. Identify outliers and potential influential observations. Compute and interpret the coefficient of determination‚ r2. Obtain the residuals and create a residual plot. Decide whether it is reasonable to consider that the assumptions for regression analysis are met
Premium Regression analysis Errors and residuals in statistics Linear regression
business‚ engineering and computer science. • It is unique - there is only one answer. • Useful when comparing sets of data. Disadvantages: Outliers can change the mean a lot... making it much lower/higher than it should be. Affected by extreme values (outliers) Median: Advantages: Finds the middle number of a set of data‚ so outliers have little or no effect. Disadvantages: If the gap between some numbers is large‚ while it is small between other numbers in the data‚ this can cause
Premium Arithmetic mean Mode Average