AI Platform & Performance Engineer · Pune, India

Shubham Pagare I turn complex AI systems into business outcomes.

Senior engineer with 5+ years designing scalable AI systems, LLM observability platforms, and enterprise performance solutions across Azure, AWS, and GCP — improving system performance, cutting cloud costs, and shipping AI for Fortune 500 clients.

Core stack AWS Azure GCP Python LLM LangChain LangGraph Langfuse
scroll
About

Engineer, translator, builder

I sit where business goals meet technical reality — and make sure both win.

I'm a senior AI and performance engineer who understands the business behind the build. I don't just write code — I turn ideas into shipped, scalable reality, and I make sure the engineering serves a real outcome.

Much of my work happens in the gap between stakeholders and development: clarifying what's actually needed, surfacing tradeoffs early, and translating ambition into systems that perform. My favorite thing to do is make complicated things feel simple — for users, for teams, and for the people making the call.

I understand the business, not just the code
I turn ideas into shipped reality
I bridge the gap between stakeholders and development
I make critical, complex things simple
120%
response-time gain on WebSocket bottlenecks
10×
execution speedup, Python → C++
~6 min
analysis cut from a 1-hour task
100%
tool-licensing savings vs LoadRunner
How I work

The way I think

Beyond the tools — the engineering instincts I bring to every system, team, and hard problem.

Architect Mindset

See the whole system before the parts — design for scale, trace the data, anticipate where it breaks.

Critical Thinking

Question assumptions, follow evidence, and reason from first principles instead of surface symptoms.

Problem Solving

Break ambiguous, multi-layer issues into root causes — then ship the fix that actually holds.

Deep Diagnostics

Comfortable in the weeds: thread dumps, memory leaks, latency hotspots, token-level profiling.

Stakeholder Clarity

Translate complex engineering into decisions leaders can act on — straight to the people who matter.

Continuous Learning

Stay ahead of a fast-moving field — from load testing to agentic AI — and bring teams along.

Experience

Where I've shipped

A run through the roles, the clients, and the measurable outcomes — from AI agent platforms to deep performance forensics.

Dec 2024 — Present

Senior Performance & AI Engineer

Waynautic Technologies
Client: PwC US— AI-based product on Azure Cloud
  • Designed and deployed multi-agent AI workflows with LangChain, LangGraph, and Azure OpenAI to automate QA and business processes.
  • Built a custom E2E trace tracking tool with Langfuse for deep LLM pipeline visibility — surfacing latency hotspots, token inefficiencies, and failure points.
  • Led AI performance engineering across LLM analysis, load-testing strategy, and Azure optimization, driving measurable cost reduction.
  • Identified critical architectural bottlenecks and shaped design decisions in direct review with PwC US stakeholders.
Sept 2023 — Nov 2024

Performance Engineer

Zensoft Services
Client: PwC US
  • Achieved 120% response-time improvement and ~0.1% error rate resolving multi-layer WebSocket performance issues.
  • Built a custom JMeter + Selenium framework replacing LoadRunner TrueClient — 100% savings on tool licensing.
  • Automated test execution and analysis on Azure ADO with Python, cutting insight-delivery time by 50%.
Aug 2021 — Sept 2023

Performance Engineer

NTT Data Services
Client:UNIQLO Uniqlo (UQ)
  • Led an in-house open-source monitoring & auto-analysis tool, reducing a 1-hour task to ~6–7 minutes.
  • Diagnosed memory leaks, packet drops, and microservice issues across AWS and GCP, significantly improving production stability.
  • Built precision JMeter load models and implemented database replication for improved reliability.
  • Star Award (Foresight & Hard Work) and KK Award (Outstanding Performance of the Year).
Jan 2021 — June 2021

Engineering Intern

Softnautics LLP
  • Migrated a codebase from Python to C++, achieving a 10× improvement in execution speed.
Toolkit

What I work with

From agentic orchestration to thread-dump forensics — the stack behind the outcomes.

Generative AI & Agentic Systems
LangChainLangGraphLangfuseAI Agent DevLLM ObservabilityE2E Trace AnalysisPrompt Engineering
Performance & Monitoring
JMeterGatlingAzure Load TestingLoadRunnerGrafanaDatadogDynatraceCloudWatchApp Insights
Cloud & DevOps
Azure AKSAWS ECSGCPJenkinsAzure ADODockerGit / GitHubMicroservices
Programming & Analysis
PythonJavaScriptReactDjangoC++Thread / Heap DumpAWRLLM Token Profiling
Selected Work

Things I built

Platforms born from the same instinct — making complex AI systems measurable and accountable. Hover to watch them run.

hover to run
01 / AGENTS

AI Agent Platform

LangGraph-based agents that talk to each other to handle QA and business automation.

Integrated enterprise tools and workflows, reducing manual effort and improving operational efficiency.

LangGraphMulti-AgentAutomation
hover to run
02 / OBSERVABILITY

LLM Observability Platform

End-to-end trace analysis on Langfuse — watch spans flow through the pipeline.

Tracks latency, token consumption, failures, and cost for root-cause analysis.

LangfuseTracingToken Profiling
hover to run
03 / NOW

Currently Exploring

Sharpening fundamentals — data structures & algorithms and competitive programming.

Going deeper on agentic AI patterns and system design to build faster, sturdier systems.

DSACompetitive ProgrammingSystem DesignAgentic AI
Foundations

Education & recognition

DEGREE

B.Tech, Computer Science (AI/ML)

MIT ADT University, Pune
CGPA 8.32/10 · Distinction · Aug 2017 — July 2021
Star Award — Foresight & Hard Work · NTT Data KK Award — Outstanding Performance of the Year · NTT Data
CERTIFICATIONS
2024Site Reliability Engineering — Google / Coursera
2019Data Science Orientation — IBM / Coursera
Contact

Let's build something observable.

Open to roles and collaborations in AI platform engineering, LLM observability, and performance. The fastest way to reach me is below.