Pritam Bhutada — AI Engineer & DevOps Architect

What I Do

Services

End-to-end delivery — from system design to production deployment.

AI & LLM Engineering

Production-grade AI systems — RAG pipelines, LLM integrations, AI agents, and intelligent automation. From prototype to deployed product.

RAG / Vector DBs
LLM Integration
AI Agents
FAISS / Pinecone

MLOps & AI Deployment

Take ML models from notebook to production. Model serving, monitoring, versioning with MLflow, and end-to-end pipelines using Airflow and cloud platforms.

MLflow
Apache Airflow
Model Serving
Monitoring

DevOps & CI/CD Pipelines

Design and implement CI/CD pipelines, containerize applications with Docker, orchestrate with Kubernetes, and automate infrastructure with Terraform and IaC.

Docker / K8s
CI/CD
Terraform
GitHub Actions

Cloud Architecture & Consulting

Architect scalable, cost-efficient infrastructure on AWS, Azure, and GCP. Cloud migration, modernization, and ongoing optimization for performance and cost.

AWS
Azure
GCP
Cloud Migration

Backend Engineering

High-performance backend systems using Python and FastAPI. Async processing, message queues (RabbitMQ/Kafka), database design, and scalable API development.

FastAPI
RabbitMQ
PostgreSQL
Redis

Data Engineering & Pipelines

End-to-end data pipelines and transformation workflows. Snowflake, Airflow, SQL stored procedures, and seamless integration with downstream ML models.

Snowflake
Airflow
ETL Pipelines
SQL

Work

Projects

Real systems built and shipped.

AI / RAG / Deployment Deployed

AI Trip Planner with RAG

Production-grade travel itinerary generator using Retrieval-Augmented Generation. Custom knowledge base via FAISS vector store, intelligent fallback to general LLM knowledge, containerized with Docker and deployed to cloud.

FastAPIStreamlitDockerFAISSLLMCloud Deploy

MLOps / Dashboard

ML Training Visualization Dashboard

Real-time MLOps dashboard for monitoring model training runs. Integrates with MLflow for experiment tracking, backed by PostgreSQL and Redis caching, with live metric updates via multithreading.

StreamlitMLflowPostgreSQLRedis

Data Engineering / MLOps

Promotions Data Transformation Workflow

Automated data pipeline for promotions analytics. Complex SQL transformations in Snowflake, orchestrated via Apache Airflow, outputs feeding directly into ML model pipelines.

SnowflakeApache AirflowSQLML Pipelines

Backend / APIs

Scalable Media Ingestion API

High-throughput async data ingestion API with customizable parsing and standardization mechanisms. Built for diverse data sources with asynchronous task processing via message queues.

FastAPIRabbitMQAsyncData Standardization

About

Who I Am

I'm Pritam Bhutada — an AI Engineer and DevOps Architect with 8+ years of experience building and deploying intelligent systems at scale.

My work spans the full stack of modern AI delivery: designing RAG-based applications, integrating LLMs into production systems, building MLOps pipelines, and deploying everything on cloud infrastructure with proper CI/CD, containerization, and monitoring.

I work with startups and SMBs who want to move fast without breaking things — bringing the engineering discipline of enterprise teams to organizations that need results, not overhead.

Work With Me

AI / ML

LLM Integration
RAG Pipelines
MLflow / MLOps
FAISS / Vector DBs
Apache Airflow

DevOps & Cloud

Docker / Kubernetes
CI/CD Pipelines
Terraform / IaC
AWS / Azure / GCP
GitHub Actions

Backend

Python / FastAPI
RabbitMQ / Kafka
PostgreSQL / Redis
REST APIs
Async Systems

Data

Snowflake
SQL / Stored Procs
ETL Pipelines
Data Modeling
Streamlit

Writing

Blog

Deep dives on system design, AI engineering, and backend architecture.

System Design Dec 2025

From Zero to 10 Billion: Architecting a Video Recommendation System

A deep dive into designing a recommendation system that scales to billions of interactions — architecture decisions, trade-offs, and production considerations.

Read on Medium →

System Design Nov 2025

News Feed System: From Monolith to Distributed Architecture

How to evolve a news feed from a simple monolith to a distributed system that aggregates and personalizes content at scale for social platforms.

Read on Medium →

Backend Oct 2025

From Zero to Millions: Notification Systems at Scale

Exploring notification systems as essential infrastructure in connected applications — architecture patterns for delivering millions of notifications reliably.

Read on Medium →

Backend Oct 2025

WebSockets Unleashed: A Blueprint for Distributed Real-Time Apps

Real-time communication patterns for live chat, notifications, and collaborative dashboards — building distributed WebSocket systems that actually hold up under load.

Read on Medium →

System Design Oct 2025

How Consistent Hashing Prevents Meltdowns

A practical look at consistent hashing — why naive approaches fail under node changes, and how consistent hashing keeps distributed systems stable at scale.

Read on Medium →

Backend Oct 2025

Building a Production-Ready Distributed Rate Limiter with FastAPI, Redis & Lua

In microservices and scalable APIs, a robust rate limiter is a necessity. A complete implementation using FastAPI, Redis, and Lua scripts for atomic operations.

Read on Medium →

AI / RAG Oct 2025

Built a Production-Ready AI Trip Planner with RAG

How Retrieval-Augmented Generation overcomes LLM limitations for factual accuracy — building and deploying a full RAG pipeline from scratch to production.

Read on Medium →

Contact

Let's Build Something

Have a project in mind? Let's talk.

Email

contact@pritambhutada.com

Phone

+91 8149573220

+91 7057345886

pritam-bhutada