 |
 |
Manthey, B. and Röglin, H.
( 2009)
Improved smoothed analysis of the -means method.
In: Proceedings of the 20th Annual ACM-SIAM Symposium on Discrete Algorithms, SODA 2009, 4-6 Jan 2009, New York, NY, USA.
.
SIAM.
ISBN not assigned
Full text available as: Official URL: http://www.siam.org/proceedings/soda/2009/SODA09_051_mantheyb.pdf  AbstractThe -means method is a widely used clustering algorithm. One of its distinguished features is its speed in practice. Its worst-case running-time, however, is exponential, leaving a gap between practical and theoretical performance. Arthur and Vassilvitskii aimed at closing this gap, and they proved a bound of poly( ) on the smoothed runningtime of the -means method, where is the number of data points and is the standard deviation of the Gaussian perturbation. This bound, though better than the worstcase bound, is still much larger than the running-time observed in practice.
We improve the smoothed analysis of the -means method by showing two upper bounds on the expected running-time of -means. First, we prove that the expected running-time is bounded by a polynomial in and Second, we prove an upper bound of poly( ), where is the dimension of the data space. The polynomial is independent of and , and we obtain a polynomial bound for the expected running-time for
Finally, we show that -means runs in smoothed polynomial time for one-dimensional instances. | Item Type: | Conference or Workshop Paper (Full Paper, Talk) |
|---|
| Research Group: | EWI-DMMP: Discrete Mathematics and Mathematical Programming |
|---|
| Research Program: | CTIT-IE&ICT: Industrial Engineering and ICT |
|---|
| Uncontrolled Keywords: | Smoothed analysis, -means method, Clustering |
|---|
| ID Code: | 16976 |
|---|
| Status: | Published |
|---|
| Deposited On: | 10 December 2009 |
|---|
| Refereed: | Yes |
|---|
| International: | Yes |
|---|
| More Information: | statisticsmetis |
|---|
Export this item as: To correct this item please ask your editor Repository Staff Only: edit this item
|
 |
 |