who I am
ABOUT4+
yrs exp
6+
companies
10+
shipped
Founding Engineer building LLM-driven systems that work for real users.
Based in San Jose, CA. I've spent the last 4+ years building production systems across fintech, EdTech, and AI: from compliance platforms at Deloitte to multimodal RAG at Astranetix and agentic evaluation pipelines at SJSU.
I care deeply about systems that are observable, maintainable, and actually solve real problems, not just impressive demos. My stack is Python-first, cloud-native, and increasingly agent-driven.
Published research on Hindi NLP. Outside engineering: basketball, sketching, and building things with my hands.
education
M.S. Software Engineering
San Jose State University
2023 – 2025 · GPA 3.6
B.E. Engineering
Osmania University, Hyderabad
2016 – 2020 · GPA 3.5
things I've built
PROJECTSScorePAL: Agentic Evaluation Platform
SJSU
Multi-agent evaluation system using CrewAI where agents autonomously decompose rubrics, retrieve context via Weaviate vector search, and score submissions using Gemini on GCP. Multimodal RAG over text and image inputs reduced manual grading effort by 60%.
AI Suspect Sketch Generator
Text-to-image sketch generation using Stable Diffusion + ControlNet for shape and style consistency. End-to-end full-stack with async job handling and serverless image processing on AWS. 45% latency reduction via embedding caching and request batching.
Multimodal RAG System
Production RAG combining text and document embeddings. Weaviate + GraphQL retrieval, tuned HNSW parameters. Scalable inference on AWS Lambda and S3.
LangChain + Neo4j Knowledge Graph
Knowledge graph-powered RAG using LangChain and Neo4j. Hybrid search combining vector similarity and graph traversal for richer LLM responses.
Movie Ticket Booking System
Full-stack booking platform with React, Node.js, and GraphQL for real-time seat selection. Microservice backend with MongoDB, containerized via Docker, deployed with CI/CD.
Expira: ID Expiry Tracker
Flutter mobile app scanning ID documents and tracking expiry dates. OCR-based extraction, push notifications, offline-first, cross-platform iOS & Android.
what I work with
SKILLSLanguages
AI / ML
Frameworks
Databases
Cloud & DevOps
Data
where I've worked
EXPERIENCEFounding Engineer
· Now- Built LLM-driven workflows that convert raw user input into structured business profiles and narrative-style stories, powering personalized discovery and community engagement for 1,000+ active users.
- Developed recommendation workflows leveraging user interaction signals and embedding-based similarity to customize content and profile ranking.
- Designed service APIs using Python (FastAPI) and TypeScript, integrated with MongoDB Atlas and AWS to support inference, context assembly, and response orchestration.
- Operated containerized services with production-grade observability, telemetry, and cost-aware scaling through CI/CD pipelines supporting frequent deployments.

Teaching Assistant
- Supported courses in Machine Learning, Networking, and Information Security. Mentored students on distributed systems, consistency models, and high-availability design.
- Led lab sessions and debugging walkthroughs covering model evaluation, network protocols, and Linux system internals, helping students apply theoretical concepts to practical implementations.

AI Engineer
- Developed and productionized multimodal Retrieval-Augmented Generation solutions combining text and document embeddings to ground large language model outputs.
- Established chunking, embedding, and vector retrieval workflows using Weaviate and GraphQL, tuning HNSW parameters to improve semantic recall and reduce inference latency.
- Evaluated models from GCP Model Garden to inform production model selection; operated scalable inference services on AWS Lambda and S3 with a focus on throughput control, fault tolerance, and cost efficiency.

AI Intern
- Evaluated advanced RAG techniques including GraphRAG and RAPTOR to enhance multi-hop reasoning and contextual grounding over large document corpora.
- Adapted embedding models and task-specific language models using domain datasets; constructed offline evaluation workflows to assess relevance, precision, recall, and latency.
- Executed controlled A/B experiments across chunking strategies, embedding choices, and retrieval parameters to inform production-ready configuration decisions.

Associate Software Analyst
- Contributed to data-intensive, compliance-focused fintech platforms by developing Python-based backend services integrated with Angular frontends.
- Implemented REST and GraphQL interfaces in Python to enable analytics, reporting, and integration with downstream data-driven and intelligent systems.
- Prototyped AI-oriented pipelines for text classification, information extraction, and similarity search on enterprise datasets to assess automation feasibility.
- Strengthened automated testing, release validation, and CI/CD processes, improving deployment reliability and reducing post-release defects by ~25%.
- Recognized with a Spot Award for delivering high-impact solutions and consistently exceeding performance and quality expectations.

Trainee Software Engineer
- Implemented JVM-based backend services in Scala supporting data-centric workflows for EdTech and media streaming applications.
- Created RESTful interfaces enabling analytics ingestion, personalization logic, and multi-client content delivery.
- Improved AWS DynamoDB performance through optimized key design and query patterns, reducing API response times by roughly 30%.

research
PUBLICATIONLearning-Based Approach for Hindi Text Sentiment Analysis using Naive Bayes Classifier
International Journal of Innovative Engineering Research and Technology (IJIERT) · 2020
Proposed a machine learning approach for sentiment classification of Hindi-language text using a Naive Bayes classifier, addressing the unique morphological and syntactic challenges of Hindi NLP.
let's talk
CONTACT