Top Banner
Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop, Berkeley, Nov 2016 Andrej Risteski Yuanzhi Li Joint work with
34

Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Jun 19, 2020

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Non-negative Matrix Factorization via Alternating Updates

Yingyu Liang, Princeton University

Simons workshop, Berkeley, Nov 2016

Andrej RisteskiYuanzhi Li

Joint work with

Page 2: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Non-convex Problems in ML

Dictionary learningMatrix completion

Deep learning

Page 3: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Non-convex Problems in ML

Dictionary learningMatrix completion

Deep learning

NP-hard to solve in the worst case

Page 4: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Non-convex Problems in ML

• In practice, often solved by “local improvement”

• Gradient descent and variants

Page 5: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Non-convex Problems in ML

• In practice, often solved by “local improvement”

• Gradient descent and variants

• Alternating update

Page 6: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Non-convex Problems in ML

• In practice, often solved by “local improvement”

• Gradient descent and variants

• Alternating update

Goal: provable guarantees of simple algos under natural assumptions

When and why do such simple algos work for the hard problems?

Page 7: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Non-convex Problems in ML

• In practice, often solved by “local improvement”

• Gradient descent and variants

• Alternating update

Goal: provable guarantees of simple algos under natural assumptions

When and why do such simple algos work for the hard problems?

This work: alternating update for Non-negative Matrix Factorization

Page 8: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Non-negative Matrix Factorization (NMF)

• Given: matrix

• Find: non-negative matrices

Page 9: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

NMF in ML Applications

Basic tool in machine learning

• Topic modeling [Blei-Ng-Jordan03, Arora-Ge-Kannan-Moitra12, Arora-Ge-Moitra12,…]

0.05

0.04

0.08

0.05 0.05 0.05

0.2

0.3

0.1

0.2

0.8

the

physics

soccer

……

……

Doc1 Doc2 Doc3 ……

rain

Weather Sport Science ……

the

soccer

……

……

physics

rain

Doc1 Doc2 Doc3 ……

Page 10: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

NMF in ML Applications

Basic tool in machine learning

• Topic modeling [Blei-Ng-Jordan03, Arora-Ge-Kannan-Moitra12, Arora-Ge-Moitra12,…]

• Computer vision [Lee-Seung97, Lee-Seung99, Buchsbaum-Bloch02,…]

Page 11: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

NMF in ML Applications

Basic tool in machine learning

• Topic modeling [Blei-Ng-Jordan03, Arora-Ge-Kannan-Moitra12, Arora-Ge-Moitra12,…]

• Computer vision [Lee-Seung97, Lee-Seung99, Buchsbaum-Bloch02,…]

• Many others: network analysis, information retrieval, ……

Page 12: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Worst Case v.s. Practical Heuristic

Worst case analysis [Arora-Ge-Kannan-Moitra12]

• Upper bound:

• Lower bound: no algo, assuming ETH

Alternating updates: typical heuristic, suggested by Lee-Seung

Page 13: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Analyzing Non-convex Problems

• Generative model: the input data is generated from (a distribution defined by) a ground-truth solution

• Warm start: good initialization not far away from the ground-truth

Warm start

Update

Page 14: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Beyond Worst Case for NMF

• Separability-based assumptions [Arora-Ge-Kannan-Moitra12]

• Motivated by topic modeling: each column of (topic) has an anchor word

• Lots of subsequent work [Arora-Ge-Moitra12, Arora-Ge-Halpern-Mimno-Moitra-Sontag-

Wu-Zhu12, Gillis-Vavasis14, Ge-Zhou15, Bhattacharyya-Goyal-Kannan-Pani16, …]

0.05 0.05 0.05

0.2 0 0

0.3

0.1

Weather Sport Science

the

soccer

……

physics

rain

Page 15: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Beyond Worst Case for NMF

• Separability-based assumptions [Arora-Ge-Kannan-Moitra12]

• Motivated by topic modeling: each column of (topic) has an anchor word

• Lots of subsequent work [Arora-Ge-Moitra12, Arora-Ge-Halpern-Mimno-Moitra-Sontag-

Wu-Zhu12, Ge-Zhou15, Bhattacharyya-Goyal-Kannan-Pani16,…]

• Variational inference [Awasthi-Risteski15]

• Alternating update method on objective

• Requires relatively strong assumptions on , and/or a warm start depending on its dynamic range (not realistic)

Page 16: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Outline

• Introduction

• Our model, algorithm and main results

• Analysis of the algorithm

Page 17: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Generative Model

Each column of is i.i.d. example from

Page 18: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Generative Model

Each column of is i.i.d. example from

• (A1): columns of are linearly independent

Page 19: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Generative Model

Each column of is i.i.d. example from

• (A1): columns of are linearly independent

• (A2): are independent random variables

where is a parameter

Page 20: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Our Algorithm

Parameters:

0

Known as Rectified Linear Units (ReLU) in Deep Learning

1

Page 21: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Our Algorithm

Parameters:

are two independent examples, and are their decodings

Page 22: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Our Algorithm

Parameters:

Page 23: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Our Algorithm

Parameters:

Page 24: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Warm Start

• (A3): warm start with error

Page 25: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Warm Start

• (A3): warm start with error

Aligned with truth Not too much

Page 26: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Main Result

Page 27: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Analysis: Overview

Show the algo will

1. Maintain and

2. Decrease the potential function

Page 28: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Analysis: Effect of ReLU

Page 29: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Analysis: Effect of ReLU

Page 30: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Analysis: Effect of ReLU

Much less noise after

Page 31: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Analysis: Change of Error Matrix

• Therefore,

Page 32: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

More General Results

1. Each column of is generated i.i.d. by

• The decoding is

• So can* tolerate large adversarial noise,

• and tolerate zero-mean noise much larger than signal

2. Distribution of only needs to satisfy some moment conditions

Page 33: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Conclusion

• Beyond worse case analysis of NMF

• generative model with mild condition on the feature matrix

• Provable guarantee of alternating update algorithm

• Strong denoising effect by ReLU + non-negativity

Page 34: Non-negative Matrix Factorization via Alternating Updates · 2020-01-03 · Non-negative Matrix Factorization via Alternating Updates Yingyu Liang, Princeton University Simons workshop,

Conclusion

• Beyond worse case analysis of NMF

• generative model with mild condition on the feature matrix

• Provable guarantee of alternating update algorithm

• Strong denoising effect by ReLU + non-negativity

Thanks! Q&A