Proximal Gradient Descent and Proximal Coordinate descent for Lasso Problem

Asked Oct 21 '17 at 13:49

Active Dec 04 '19 at 05:07

Viewed 355 times

Why is proximal coordinate descent much less affected by bad conditioning than proximal gradient descent?

For example, we can consider this problem : $\min_x \frac{1}{2}\|Ax-b\|^2_2 + \lambda\|x\|_1$

If A has a large condition number, how can we demonstrate that the algorithm of proximal coordinate descent is much less affected than proximal gradient?

edited Dec 04 '19 at 05:07

Geoffrey Negiar

asked Oct 21 '17 at 13:49

aferjani

Proximal Gradient Descent and Proximal Coordinate descent for Lasso Problem

0 Answers0