Engineering Tomorrow: Soham Katkar
A Full-Stack Developer & Data Systems Engineer specializing in scalable C++ data models, PostgreSQL, and LLM-driven automation.
"Bridging robust backend engineering with modern AI and cloud-native architectures."
About Me
I am currently pursuing a Master's in Information Systems at UMBC, building upon my three years of professional experience as a **Software Engineer at HSBC Technology India**. My expertise lies in high-performance backend systems, specifically optimizing scalable, multi-threaded **C++ data models** and highly efficient **SQL queries** for global payment systems.
I specialize in modern DevOps practices, leveraging **Kubernetes, Docker, Jenkins, and GitHub** to maintain robust and secure microservices. My commitment to innovation is highlighted by my experience developing an LLM-based monitoring tool to automate incident summarization, demonstrating a keen interest in applying AI to streamline operations.
I am actively seeking a full-time role where I can combine my skills in high-volume transaction systems and full-stack development (React.js, Flask) to drive efficiency and deliver cutting-edge technical solutions.
Experience & Education
Master's, Information Systems
Dec 2025University of Maryland, Baltimore County (UMBC)
Currently pursuing a Master's degree, focusing on advanced systems architecture and data management. GPA: 3.62/4.0.
Software Engineer
Aug 2021 - Dec 2023HSBC Technology India, Pune
- Boosted system efficiency by **15%** by developing scalable, multi-threaded C++ data models and optimizing SQL queries.
- Resolved **200+ real-time JIRA issues** for the Global Payment System, leveraging Kubernetes, Docker, and Jenkins CI/CD.
- Developed a proof-of-concept **LLM-based monitoring tool** for payment transaction log analysis, automating incident summarization.
- Increased backend test coverage by **35%** with JUnit test cases and automated server installations using shell scripting.
Data Analyst Intern
Feb 2021 - Jul 2021Incentius Solution Ltd., Pune
- Streamlined sales data processing across multiple European countries, cutting processing time by **25%** using SQL and ETL tools.
- Developed scalable, rule-based data workflows with Python for cleaning and categorization, improving speed and accuracy by **30%**.
- Enhanced stakeholder reporting by designing interactive dashboards in Power BI and Excel, increasing report usage by **50%**.
Core Technologies
C++, Python, Java
Core LanguagesPostgreSQL & Neo4j
Data & QueriesReact.js & Flask
Full-Stack DevKubernetes & Docker
CI/CD & CloudLLMs & Ollama
AI/NLP ToolsSpark & Kafka
Big Data ProcessingFeatured Projects
Retrieval Augmented Generation (RAG)
An advanced LLM system leveraging RAG architecture (LangChain, Hugging Face, FAISS) to generate accurate, context-aware answers grounded in external documents.
Healthcare AI Prediction
An AI/ML project focused on predictive modeling for healthcare outcomes, demonstrating data processing, model training, and deployment.
AI Study Coach (HackUMBC)
Intelligent advisory system using **Neo4j graph database** and **local AI (Ollama)** to generate personalized study recommendations.
Full-Stack Tic-Tac-Toe Game
Web game featuring CRUD operations, game persistence, and real-time player statistics using a React, Express, and PostgreSQL stack.
More Projects Coming Soon...
Find all my repositories on my GitHub profile.
Get In Touch
I'm currently looking for new opportunities. Whether you have a question or just want to say hi, my inbox is always open.
Say Hello!Or connect with me on LinkedIn.