RKSSH LLP · Engineering Blog

Practical DevOps Insights

20 articles · updated regularly

All CI/CD Cloud Cost Compliance DevOps Infrastructure Kubernetes MLOps Platform Engineering

2 posts in MLOps

How to Deploy LLMs to Kubernetes with vLLM: A Production Guide

Running LLMs in production is an infrastructure problem as much as an AI problem. Here's the exact setup - GPU node pools, vLLM on Kubernetes, autoscaling, and request routing - that we use to put language models into production for AI startups.

March 7, 2026Read

MLOps·6 min

MLOps in 2026: Taking ML Models From Jupyter Notebook to Production

Most ML projects stall at the notebook stage not because the model is bad but because the infrastructure to serve, monitor, and retrain it does not exist. Here is how to build it.

January 22, 2026Read