Preview

Sparse Matrix Factorization 1311

Better Essays
Open Document
Open Document
8763 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Sparse Matrix Factorization 1311
Sparse Matrix Factorization
Behnam Neyshabur1 and Rina Panigrahy2

arXiv:1311.3315v3 [cs.LG] 13 May 2014

1

Toyota Technological Institute at Chicago bneyshabur@ttic.edu 2
Microsoft Research rina@microsoft.com Abstract. We investigate the problem of factoring a matrix into several sparse matrices and propose an algorithm for this under randomness and sparsity assumptions. This problem can be viewed as a simplification of the deep learning problem where finding a factorization corresponds to finding edges in different layers and also values of hidden units. We prove that under certain assumptions on a sparse linear deep network with n nodes in each layer, our algorithm is able to recover the structure of the
˜ 1/6 ). network and values of top layer hidden units for depths up to O(n
We further discuss the relation among sparse matrix factorization, deep learning, sparse recovery and dictionary learning.
Keywords: Sparse Matrix Factorization, Dictionary Learning, Sparse
Encoding, Deep Learning

1

Introduction

In this paper we study the following matrix factorization problem. The sparsity π(X) of a matrix X is the number of non-zero entries in X.
Problem 1 (Sparse Matrix-Factorization). Given an input matrix Y factorize it is as Y = X1 X2 . . . Xs so as minimize the total sparsity si=1 π(Xi ).
The above problem is a simplification of the non-linear version of the problem that is directly related to learning using deep networks.
Problem 2 (Non-linear Sparse Matrix-Factorization). Given matrix Y , minimize si=1 π(Xi ) such that σ(X1 .σ(X2 .σ(. . . Xs ))) = Y where σ(x) is the sign function (+1 if x > 0, −1 if x < 0 and 0 otherwise) and σ applied on a matrix is simply applying the sign function on each entry. Here entries in Y are 0, ±1.
Connection to Deep Learning and Compression: The above problem is related to learning using deep networks (see [3]) that are generalizations of neural networks. They are layered network of nodes connected by edges between
successive



References: 3. Y. Bengio. Learning deep architectures for ai. Foundations and Trends in Machine Learning, 2009. 13. R. Salakhutdinov and G. E. Hinton. Deep boltzmann machines. Journal of Machine Learning Research, 5:448–455, 2009. 15. Li. Wan, Matthew. Zeiler, Sixin. Zhang, Yann. LeCun, and Rob. Fergus. Regularization of neural networks using dropconnect. ICML, 2013. 16. P. M. Wood. Universality and the circular law for sparse random matrices. The Annals of Applied Probability, 22(3):1266–1300, 2012.

You May Also Find These Documents Helpful

  • Good Essays

    Nt1310 Unit 7 Lab Report

    • 493 Words
    • 2 Pages

    The Equations (5.19) to (5.21) denotes the calculation of the outputs in the forward propagation…

    • 493 Words
    • 2 Pages
    Good Essays
  • Satisfactory Essays

    Pt1420 Unit 4

    • 4123 Words
    • 17 Pages

    For obtaining the solution of dual of the following Linear Programming Problem, how many slack and/or surplus, and artificial variables are required?…

    • 4123 Words
    • 17 Pages
    Satisfactory Essays
  • Powerful Essays

    Pt1420 Unit 7 Homework

    • 2430 Words
    • 10 Pages

    From my table of values above, it is clear that the change of sign from negative…

    • 2430 Words
    • 10 Pages
    Powerful Essays
  • Satisfactory Essays

    It 240 Week 2 Appendixb

    • 565 Words
    • 3 Pages

    How would the pieces and components of this network relate to each other? Define all the components of this type of network.…

    • 565 Words
    • 3 Pages
    Satisfactory Essays
  • Satisfactory Essays

    Lan operating systems

    • 404 Words
    • 2 Pages

    How would the pieces and components of this network relate to each other? Define all the components of this type of network.…

    • 404 Words
    • 2 Pages
    Satisfactory Essays
  • Good Essays

    Income Tax

    • 950 Words
    • 11 Pages

    8 Subtract line 7 from line 6. If zero or less, enter -0-. However, if line 7…

    • 950 Words
    • 11 Pages
    Good Essays
  • Better Essays

    In 1990, a novel was written by philosopher Judith Butler titled Gender Trouble. The importance of this novel was evident as it was a very controversial yet interesting analysis of the way we humans look at the topic of gender and sex. She explains throughout the book that our "gender norms" have been created by our ancestors and society. To many, crossing this boundary set by society is very deviant. Eight years after Gender Trouble was written, Disney released a very feminist cartoon movie called Mulan. During this story, the main character, a girl, joins the Chinese army to fight because she doesn't want her dad to get hurt. Girls were not allowed in the Chinese army so Mulan had to hide her identity doing a variety of different things. Many people speculate that Gender Trouble played a role in the creation of Mulan. Judith Butler believes that gender parody and bodily performance of possible alternatives to established gender norms are means for overthrowing these oppressive gender norms. I believe that Mulan uses Butler’s theory to overthrow oppressive gender norms during three specific parts of the movie. The first part in which this occurs is in the beginning when Mulan attempts to impress the women and become an honorable lady. The second part is when Mulan decides to enter the army, and the third part is when she is at the army base and is part of the army. There are numerous examples throughout Mulan that have to do with feminist issues.…

    • 1308 Words
    • 6 Pages
    Better Essays
  • Good Essays

    I then used 4 for the value of n. In this case, the formula was y = x4 and its inverse was x = y1/4.…

    • 2092 Words
    • 9 Pages
    Good Essays
  • Satisfactory Essays

    Math Portfolio

    • 1029 Words
    • 5 Pages

    This means that the function will stay y=xn but here only n will change. The parameters will stay the same a=0 and b=1.…

    • 1029 Words
    • 5 Pages
    Satisfactory Essays
  • Satisfactory Essays

    Asdas

    • 522 Words
    • 3 Pages

    Neutral networks are those which involve in pattern or image recognition. This helps companies to get the required information for processing in second life.…

    • 522 Words
    • 3 Pages
    Satisfactory Essays
  • Good Essays

    • Rough ER: A network is a network of interconnected flattened sacs with two main functions: To make more…

    • 669 Words
    • 3 Pages
    Good Essays
  • Good Essays

    Max 0.61 X3R + 0.73X4R + 0.20X5R + 0.33X1D + 0.31X2D + 0.61X3D + 0.73X4D + 0.20X5D + 0.19Y1 + 0.16Y2 + 0.50Y3 + 0.54Y4…

    • 516 Words
    • 3 Pages
    Good Essays
  • Powerful Essays

    The Development of Empathy

    • 10590 Words
    • 43 Pages

    Carr, L., Iacoboni, M., Dubeau, M. C., Mazziotta, J. C., & Lenzi, G. L. (2003). Neural…

    • 10590 Words
    • 43 Pages
    Powerful Essays
  • Good Essays

    This book provides a comprehensive introduction to the modern study of computer algorithms. It presents many algorithms and covers them in considerable depth, yet makes their design and analysis accessible to all levels of readers. We have tried to keep explanations elementary without sacrificing depth of coverage or mathematical rigor. Each chapter presents an…

    • 242616 Words
    • 971 Pages
    Good Essays
  • Powerful Essays

    Ols introduction

    • 1274 Words
    • 14 Pages

    y  x 1  1  x 2  2 . . . x K  K  …

    • 1274 Words
    • 14 Pages
    Powerful Essays