Scalable ML Model with Real-Time Inference Endpoint for Customer Segmentation

Overview:

This repository showcases the development of a scalable machine learning model for customer segmentation, capable of handling real-time inference through a REST API. The project tackles the business challenge of segmenting customers using unstructured data to create tailored marketing strategies. The solution incorporates complex workflows such as data ingestion, transformation, model training, and deployment, leveraging modern tools like FastAPI, Docker, Kubernetes, and GitHub Actions.

Business Problem

Customer segmentation is vital for businesses to understand and address the diverse needs of their customer base. By grouping customers based on their behaviors and characteristics, companies can tailor marketing efforts, enhancing customer engagement and retention. This project develops a machine learning model that segments customers and provides real-time predictions via a REST API.

Workflow

The project workflow includes several interconnected components:

1. Data Ingestion: Load and prepare the customer dataset for analysis.
2. Data Transformation: Clean and preprocess the data to make it suitable for model training.
3. Model Training: Utilize the K-Means clustering algorithm to segment customers and determine the optimal number of clusters using the Elbow Method.
4. Real-Time Inference with FastAPI: Develop a REST API using FastAPI to serve the model and provide real-time predictions.
5. Containerization with Docker: Use Docker to containerize the FastAPI application, ensuring consistent deployment across various environments.
6. Deployment with Kubernetes: Deploy the containerized application to a Kubernetes cluster for scalability and high availability.
7. CI/CD Pipeline with GitHub Actions: Automate the build and deployment process using GitHub Actions, enabling continuous integration and continuous deployment.
8. Testing: Verify the model's performance and accuracy using Postman for API endpoint testing and a Gradio interface hosted on Hugging Face Spaces for user-friendly testing.

Project Structure

.
├── .github/workflows       # GitHub Actions workflows
│   └── deploy.yml
├── artifacts               # Directory for storing artifacts like models
├── src                     # Source code for the application
│   ├── components          # Data ingestion, transformation, and model training
│   ├── pipeline            # Prediction pipeline
│   ├── utils               # Utility functions
├── Dockerfile              # Docker configuration file
├── deployment.yaml         # Kubernetes deployment configuration
├── service.yaml            # Kubernetes service configuration
├── requirements.txt        # Python dependencies
├── main.py                 # FastAPI application
└── README.md               # Project description

Key Technologies

FastAPI: Framework for developing the REST API.
Docker: Containerization tool for creating consistent deployment environments.
Kubernetes: Orchestrates deployment, scaling, and management of containerized applications.
GitHub Actions: Automates the CI/CD pipeline for continuous integration and deployment.
Azure: Uses Azure Container Registry for storing Docker images and Azure Kubernetes Service for hosting the application.

Testing

Postman: Utilized for testing the API endpoints.
Gradio Interface: Developed a user-friendly interface hosted on Hugging Face Spaces to test the model in real-time.

Conclusion

This project provides an end-to-end solution for scalable customer segmentation with real-time inference capabilities. By leveraging modern tools and technologies, the model is robust, scalable, and easy to deploy, helping businesses tailor their marketing strategies effectively.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Real-Time-Customer-Segmentation-with-Scalable-Kubernetes-Deployment-and-CI-CD-Integration-main		Real-Time-Customer-Segmentation-with-Scalable-Kubernetes-Deployment-and-CI-CD-Integration-main
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scalable ML Model with Real-Time Inference Endpoint for Customer Segmentation

Overview:

Business Problem

Workflow

Project Structure

Key Technologies

Testing

Conclusion

About

Releases

Packages

Languages

namanngala/Real-Time-Customer-Segmentation

Folders and files

Latest commit

History

Repository files navigation

Scalable ML Model with Real-Time Inference Endpoint for Customer Segmentation

Overview:

Business Problem

Workflow

Project Structure

Key Technologies

Testing

Conclusion

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages