Probability and Statistics Research Project

Better Essays

Probability and Statistics Research Project

Name: Lakeisha M. Henderson
ID: @02181956

Spring 2007

Abstract

Table of Contents

Principle Component Analysis (PCA) Definition.4 Uses of PCA5 Illustrative Example of PCA5 Method to Determine PCA..6 Basic Analysis of Variance (ANOVA) Purpose and Definition of ANOVA12 Illustrative Example of ANOVA.12

Risk Based Design Concepts Definition.15 Predictions and Relation to Risk Based Designs.15

Principle Components Analysis (PCA)

Definition:

Principal Components Analysis is a method that reduces data dimensionality by performing a covariance analysis between factors. As such, it is suitable for data sets in multiple dimensions. It is a way of identifying patterns in data, and expressing the data in such a way as to highlight their similarities and differences. Since patterns in data can be hard to find in data of high dimension, where the luxury of graphical representation is not available, PCA is a powerful tool for analyzing data. The other main advantage of PCA is that once you have found these patterns in the data, and you compress the data, i.e. by reducing the number of dimensions, without much loss of information.
Technically speaking, PCA is an orthogonal linear transformation that transforms the data to a new coordinate system such that the greatest variance by any projection of the data comes to lie on the first coordinate (called the first principal component), the second greatest variance on the second coordinate, and so on. PCA can be used for dimensionality reduction in a data set while retaining those characteristics of the data set that contribute most to its variance, by keeping lower-order principal components and ignoring higher-order ones. Such low-order components often contain the "most

References: DeMuth, James E. Basic Statistics and Statistical Applications. New York: Marcel Dekker Publications, 1999. Dunteman, George H. Principal Component Analysis. Chicago: Sage Publications Inc., 1989. Frantzen, Kurt A. Risk Based Analysis for Environmental Managers. New York: CRC Press, 2002. Iversen, Gudmund R., Norpoth, Helmut. Analysis of Variance. St. Louis: Sage Publications, 1987. Jolliffe, Ian T. Principal Component Analysis. New York: Springer, 2002. Todinov, Michael. Risk-Based Reliability Analysis and Generic Principles for Risk Reduction. Texas: Elsevier, 2006. Yeung & Ruzzo (2001) Principal component analysis for clustering gene expression data. Bioinformatics 17(9): 763-74.

Probability and Statistics Research Project

You May Also Find These Documents Helpful

Bloodstain Pattern Analysis In Criminal Investigation

Bloodstain Pattern Analysis In Criminal Investigation

Statistics Project

Statistics Project

Statistics Project

Statistics Project

WEEK 5 Preparing to Conduct Business Research Part 4

WEEK 5 Preparing to Conduct Business Research Part 4

Hcs/438 Dq's

Hcs/438 Dq's

cluster analysis

cluster analysis

Factor Analysis

Factor Analysis

GROUNDED THEORY DESIGNS IN QUALITATIVE ANALYSIS

GROUNDED THEORY DESIGNS IN QUALITATIVE ANALYSIS

The K-Nearest Neighbors Poem Analysis

The K-Nearest Neighbors Poem Analysis

BN2102 Info

BN2102 Info

Support Vector Machine

Support Vector Machine

Product Life Cycle

Product Life Cycle

ISLR Fourth Printing

ISLR Fourth Printing

Speech Recognition

Speech Recognition

Artificial Neural Network

Artificial Neural Network

Related Topics