Scaling Postgres with pgvector — what we learned at 2M embeddings
Lessons from running a production HNSW vector index on Postgres for code search at scale. Recall, latency, and the operational tradeoffs nobody warns you about.
Full-stack developer based in Lahore, Pakistan. I work with startups and growing teams to ship production web apps — from Python and Java backends to React frontends, deployed on AWS.


I'm a full-stack developer with a Bachelor's degree in Computer Science from UMT, Lahore. For the past three years I've been building production systems for clients across fintech, AI, and developer tools — usually as the engineer who can take an idea from a Figma file to a deployed, monitored, and documented system.
I work mostly across the Python, Java, and JavaScript ecosystems. I'm comfortable on the database side (Postgres, MySQL, MongoDB), the cloud side (Docker, AWS), and the UI side (React, Next.js, Tailwind).
I take on a small number of projects at a time so I can give each one the attention it deserves. If you're a founder or team looking for someone reliable to ship a real product — let's talk.

I write code that's tested, monitored, and ready to handle real users — not demos.
From the database schema to the React component. One person who can take a project all the way through.
Async-friendly, no theatrics. You'll always know where the project stands and what's next.
I optimize for the codebase you'll inherit six months from now — not just the demo on launch day.
Twenty-five technologies hardened across three years of shipping production. Grouped by what they're for.
6 technologies
3 technologies
5 technologies
3 technologies
4 technologies
4 technologies
Three projects I led end-to-end. Each one is its own page with the full story — the problem, the architecture, what shipped.
Six things I do well. Send me a brief and I'll tell you honestly whether I'm the right fit.
Production-grade web apps from data layer to pixel-perfect UI. Django, Flask, or Node.js on the backend; React + Next.js on the frontend.
Scalable backend systems engineered for the long term. Microservices, event-driven architectures, and database design that scales past 1M rows.
AWS-native infrastructure with Docker containers, CI/CD pipelines, and observability baked in. From zero to deployed in days, not months.
Bring large language models into your existing product. OpenAI, Claude, Gemini — implemented with caching, streaming, and cost controls that scale.
End-to-end ML pipelines from data ingest to model serving. Scikit-learn, Pandas, NumPy on classic ML; LLM-based systems for modern workloads.
Architecture review, code audits, and technical decision-making for early-stage teams. Get a senior engineer's opinion before you commit to a 6-month build.
"Ali rebuilt our reconciliation pipeline in three weeks. Cut what used to take five days down to under eight hours. He thinks like an owner — flagged three architectural risks we hadn't noticed and fixed two of them before we even prioritized."
"Most contractors ship code. Ali ships systems — with monitoring, with docs, with a runbook. Our oncall actually thanked me for hiring him. I've never had that happen before."
"We were burning $14k/month on a managed observability stack. Ali designed and shipped a self-hosted alternative that's now cheaper, faster, and easier to query. Paid for itself in 60 days."
"He integrated Claude into our editor and the cache hit rate is sitting at 78%. Token bill dropped by more than half month-over-month. Excellent communicator — async-friendly, no theatrics, just delivery."
"Hired Ali for a one-week architecture review. Found a P0 race condition we'd been chasing for a quarter. Stayed on for the full backend rebuild after that. One of the strongest engineers I've worked with this year."
"I needed a full-stack person who could ship without holding my hand. Ali delivered the MVP, got it on AWS, and walked me through everything in a 90-minute Loom. Already lined him up for v2."
Lessons from production — Postgres, AWS, AI, and the operational tradeoffs that don't make it into tutorials.
Lessons from running a production HNSW vector index on Postgres for code search at scale. Recall, latency, and the operational tradeoffs nobody warns you about.
A practical walkthrough of the four biggest cost wins on serverless: right-sizing memory, ARM architecture, provisioned concurrency, and the invocation patterns nobody profiles.
Six months of shipping with the App Router and RSC. The mental model shifts, the gotchas, and the patterns that emerged after the hype settled.
Send me a quick brief and I'll respond within 24 hours. If we're a fit, we'll set up a call. If not, I'll happily point you in the right direction.