The Good Tech Companies - Abhigyan Khaund on the Systems Engineering Behind AI Applications

Episode Date: July 16, 2025

This story was originally published on HackerNoon at: https://hackernoon.com/abhigyan-khaund-on-the-systems-engineering-behind-ai-applications. Abhigyan Khaund builds AI... infrastructure that scales—powering fraud detection, real-time apps, and resilient systems that keep AI reliable under pressure. Check more stories related to machine-learning at: https://hackernoon.com/c/machine-learning. You can also check exclusive content about #ai-systems-engineering, #abhigyan-khaund, #backend-infrastructure-ai, #real-time-fraud-detection, #scalable-ai-applications, #microsoft-meta-engineer, #palantir-software-engineer, #good-company, and more. This story was written by: @jonstojanjournalist. Learn more about this writer by checking @jonstojanjournalist's about page, and for more stories, please visit hackernoon.com. Abhigyan Khaund, a systems engineer at Palantir and former Microsoft/Meta contributor, builds the backend infrastructure that keeps AI systems fast, reliable, and scalable. From latency fixes to fraud detection, his work ensures AI applications work in real-world, high-stakes conditions—where reliability matters as much as intelligence.

Transcript
Discussion (0)
Starting point is 00:00:00 This audio is presented by Hacker Noon, where anyone can learn anything about any technology. Abhigyan Count on the systems engineering behind AI applications, by John Stoyan Journalist. Artificial intelligence has the ability to automate logistics, scan vast datasets for hidden insights, and power tools that respond to users in real time, fundamentally changing how many industries operate. But for these systems to function reliably at scale, they need more than just well-trained models. They need a stable backend infrastructure that can support fast data flow, coordinate services across multiple servers, and scale as demand grows.
Starting point is 00:00:37 That kind of reliability depends on engineers who understand the systems behind the models and know how to make them production-ready. With experience at companies like Microsoft and Meta, systems-focused software engineer Abhigyan Kound has contributed to fraud detection systems that flag anomalies in real-time latency reduction tools and secure digital framework for defense teams. He was a pioneer in bringing AI techniques like machine learning and reinforcement learning into real-world applications. Read on for a closer look at Abhigyan Kaun's early life, career and valuable contributions
Starting point is 00:01:09 to building the backend infrastructure behind technologies that operate at scale and serve millions worldwide. Getting his start at Microsoft. Tackling latency in enterprise communication. After earning a bachelor's in computer science from the Indian Institute of Technology, Mandi, Abhigyan began building real-world experience in backend systems as a software engineer at Microsoft. He worked on the digital backbone for the shared channels feature for Microsoft Teams, which lets users from different organizations collaborate in the same workspace.
Starting point is 00:01:40 He was in charge of tackling the feature's growing latency issues. As Adoption Inc. raised, onboarding delays became more frequent, with new users having to wait several minutes before gaining access, something unacceptable at an enterprise level. The issue stemmed from the features policy evaluation flow. Each time a user was added, the system reevaluated permissions from scratch, triggering redundant checks and multiple network calls.
Starting point is 00:02:04 Abagian redesigned this flow by introducing intelligent caching of access policies, reducing unnecessary duplicate evaluations, and streamlining communication between backend services. These changes led to a tenfold improvement in onboarding time as it allowed the feature to scale effectively as usage grew. That experience gave me a deeper appreciation for systems design and made Mewent to work on the kind of infrastructure that quietly powers complex, high-stakes environments, he recalls. Shifting to meta and running into constraints for AI. Inspired by his time at Microsoft, Abagian deepened his focus on computer science with a masters at Georgia Tech.
Starting point is 00:02:42 During his studies, he joined Meta as a machine learning engineer intern, contributing to a company-wide initiative to improve fraud detection. The team used reinforcement learning, an area of eye that trains systems to make decisions based on real-time feedback, to analyze ouzer behavior and flag suspicious activity as it happened. Although critics of reinforcement learning say there are risks of false positives and algorithmic bias in large-scale fraud detection systems, Abigan and his team saw positive results. While the models performed well in testing, the production system struggled because the pipeline was built on loosely coupled microservices with no failover support, so even minor slowdowns could disrupt the entire flow. Real-time data has a lot of variance
Starting point is 00:03:25 and can trigger a lot of edge cases in the system, but systems like these do not have the luxury of failing or giving incorrect results at critical times. This wasn't Abhigyan's first time dealing with fragile systems. As an undergraduate, he worked on Icebreaker, a cold start video recommendation engine designed to operate with minimal behavioral data.
Starting point is 00:03:45 In this case, the challenge was merging signals from disparate sources such as metadata, embeddings, and sparse user history into a single pipeline that could still return relevant results from the start. The tools and states at Meta were different, but the challenge was the same, building an AI system capable of making consistent, real-time decisions under pressure and with incomplete data. While the team succeeded in improving the models, the project reinforced Abhigyan's commitment to working on application infrastructure. I learned that elegant systems aren't the ones with the fanciest architecture. Rather, they're the ones that stay standing when things go sideways.
Starting point is 00:04:21 Developing a unified operating picture framework, Abhigyan is now a software engineer at Palantir Technologies, a company known for building data platforms. There he works on backend systems that power high impact real-world outcomes. His main responsibility involves maintaining a unified operating picture framework that gives distributed teams across company branches, partner organizations, and disconnected environments a shared, real-time situational awareness. These systems integrate diverse data streams, function reliably across complex environments, and enable responsive interaction between people and technology.
Starting point is 00:04:57 Abagian focuses on keeping the framework reliable as more teams adopt it, ensuring data pipelines stay fast, consistent, and secure. This is a high-stakes environment where reliability and stability matter a lot, he explains. I've worked on making sure the backends can support large-scale and coordination securely. Building the infrastructure that keeps AI from breaking, Abhigyan has turned his attention to AI systems that can operate independently of large cloud platforms. Instead of relying on constant server access, these lightweight, task-specific agents run directly on personal devices like phone-san tablets, allowing them to respond quickly
Starting point is 00:05:34 and continue functioning even when connectivity is limited. One framework he points to as an example is the Model Context Protocol, MCP, which enables AI agents to securely and dynamically connect to external data sources. For Abhigyan, this is a crucial step toward making intelligence more useful in the environments where it's actually needed. In the long run, he sees AI engineering evolving into a hybrid discipline, part model builder, part systems thinker. He's especially interested in how model scan better respond in real time, not just predict
Starting point is 00:06:07 in batches, how they hold youp under unpredictable user behavior, and how to isolate failures without compromising trust in the system. The end goal is to make AI feel less like magic in the cloud and more like something reliable, useful, and accessible, he concludes. Through his work in fraud detection, enterprise data SaaS, and workplace software, Abhigyan Kaun's career reflects a clear principle. AI is only as useful as the technological foundation behind it. That means systems that hold up under pressure, pipelines that keep data flowing smoothly, and low-latency
Starting point is 00:06:39 tools that respond the moment they're needed. And as this technology becomes more embedded in critical operations, his work highlights the importance of not just smarter models, but systems designed top-perform reliably in real-world conditions. Thank you for listening to this Hacker Noon story, read by Artificial Intelligence. Visit HackerNoon.com to read, write, learn and publish.

There aren't comments yet for this episode. Click on any sentence in the transcript to leave a comment.