Viewing Issue Advanced Details
[ Jump to Notes ]
|
[ View Simple ]
[ Issue History ]
[ Print ]
|
ID |
Category |
Severity |
Reproducibility |
Date Submitted |
Last Update |
0000643 |
[ALGLIB] Data analysis |
feature |
have not tried |
2015-01-22 16:01 |
2018-01-25 18:07 |
|
Reporter |
SergeyB |
View Status |
public |
|
Assigned To |
SergeyB |
Priority |
normal |
Resolution |
implemented |
Platform |
|
Status |
resolved |
|
OS |
|
Projection |
none |
|
OS Version |
|
ETA |
none |
Fixed in Version |
3.10.0 |
Product Version |
|
|
Target Version |
3.10.0 |
Product Build |
|
|
Summary |
0000643: IMPLEMENTED: faster k-means clusterization (blocked, parallel) |
Description |
New incarnation of k-means is much faster than previous version due to several improvements:
* initialization algorithm was remastered - we can choose between random initialization, k-means++ (high-quality but slower and non-parallelizable), and "fast-greedy" algorithm which gives reasonably good results while being cache-efficient and parallelizable.
* main k-means iteration is now blocked and parallelized, which significantly improves performance on large-scale datasets |
Steps To Reproduce |
|
Additional Information |
|
|
Programming language |
Unspecified |
|
Attached Files |
|
|