Smat technology produces massive data and requires massive computation. Clustering algorithms are the fundamental data analysis tool. GPU is a promising technology for massive computation. We aim to implement some basic clustering algorithms for GPU.
We are developping the kmeans algorithm and its Bayesian variants using CUDA, targeting Tesla K40. The functions will be exported into R through a simple wrapper.
The project summary page you can find here.