AI Systems Engineer | Backend & Distributed AI Infrastructure
I build scalable AI systems, LLM-powered agents, and production-grade ML pipelines that bridge cutting-edge AI models with real-world applications.
My work focuses on distributed inference, low-latency AI pipelines, and fault-tolerant backend systems.
I design and deploy AI solutions that operate at scale and deliver measurable impact:
- Distributed LLM Systems: Multi-agent pipelines, prompt orchestration, and GPU-accelerated inference using Ray, LangChain, and Python.
- Real-Time Predictive Analytics: Predictive maintenance and event detection systems using Spark, AWS, and streaming architectures.
- Multimodal AI Pipelines: Computer vision + NLP integrations for interactive applications, leveraging Azure, FastAPI, and cloud storage.
- RAG / LLM Applications: Retrieval-Augmented Generation pipelines for enterprise knowledge, task automation, and intelligent agents.
Tech: Python, FastAPI, Vision + LLM, Real-Time Agent
- A real-time AI agent that interprets interfaces, understands tasks, and executes actions autonomously.
- Combines computer vision and LLM-based reasoning for multi-step agent orchestration.
Tech: Python, FastAPI, React, Azure, Computer Vision
- Developed a vision-based virtual try-on system for window blinds with real-time inference pipelines.
- Integrated backend + frontend to provide a production-ready visual AI experience.
Tech: Python, LangChain, Pinecone, LLMs
- Built an AI commerce assistant with RAG capabilities, enabling automated responses and product recommendations.
- Designed multi-step agent orchestration pipelines with low-latency query retrieval.
Tech: Python, FastAPI, LangChain, OpenAI API
- LLM-powered assistant for job search automation and application tracking.
- Implemented structured prompts and backend services for real-time recommendations.
AI & ML: LLMs, NLP, RAG, Computer Vision, Multimodal AI, Real-Time Inference
Backend: Python, FastAPI, Flask, Distributed Systems, Ray
Cloud & Infra: Azure, AWS, Docker
Frontend & Full Stack: React, TypeScript, Node.js
Databases: Milvus, Pinecone, SQL/NoSQL