K-Means implemented from scratch using CUDA

Usage

KMeans_CUDA model (data, N, D, K); // Where data is AoS, N is number of data points, D is number of dimensions and K is number of clusters
model.one_epoch(); // Trains one epoch
model.print_predictions(); // Prints the classifications. Can be commented out.
// printf ("Error: %f\n", model.compute_error()); // Uncomment to print error

Kernels

// Updates each centroid using d_sum and d_count
//    where the index is d * centroid number (out of k).
// d: number of dimensions
// k: number of clusters
__global__ void update_centroids(
    float *d_centroids,
    const float *d_sum,
    const int *d_count,
    int d,
    int k
);

// Computes the sum (d_sum) and count (d_count) 
//    for each of the k clusters labeled in d_centroids.
// n: number of data points
// d: number of dimensions
// k: number of clusters
// Uses shared memory of 3*k*d
__global__ void sum_and_count(
    const float *d_data,
    const float *d_centroids,
    float *d_sum,
    int *d_count,
    int n,
    int d,
    int k
);

// Computes error and updates d_error
// n: number of data points
// d: number of dimensions
// k: number of clusters
__global__ void calculate_error(
    const float *d_data,
    const float *d_centroids,
    float *d_error,
    int n,
    int d,
    int k
) {

Performance

Go to: https://github.com/Tyler-Hilbert/CUDA-KMeans/tree/52db75728794449dc152989c648e03b632d24c08 for most recent performance tests.

Performance compared to scikit-learn and ArrayFire

It is shown that this implementations of K-Means outperforms scitkit-learn and ArrayFire on a T4.

Usage Continued

To compile the test program:
$git clone https://github.com/Tyler-Hilbert/CUDA-KMeans.git
$cd CUDA-KMeans
$nvcc main.cpp KMeans_CUDA.cu -o kmeans
$./kmeans

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
Benchmark_ArrayFire		Benchmark_ArrayFire
Benchmark_scikit-learn		Benchmark_scikit-learn
Performance		Performance
TestData		TestData
KMeans_CUDA.cu		KMeans_CUDA.cu
KMeans_CUDA.h		KMeans_CUDA.h
README.md		README.md
main.cpp		main.cpp

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

K-Means implemented from scratch using CUDA

Usage

Kernels

Performance

Performance compared to scikit-learn and ArrayFire

Usage Continued

About

Languages

Tyler-Hilbert/CUDA-KMeans

Folders and files

Latest commit

History

Repository files navigation

K-Means implemented from scratch using CUDA

Usage

Kernels

Performance

Performance compared to scikit-learn and ArrayFire

Usage Continued

About

Topics

Resources

Stars

Watchers

Forks

Languages