Apurva Modi — Software Development Engineer at Amazon Web Services, Seattle, WA

Experienced Software Development Engineer with 5+ years at Amazon, specializing in distributed systems, full-stack development, and enterprise SaaS applications. Key contributor to Amazon Bedrock, Amazon Q Business, AWS AppFabric, Amazon WorkDocs, and Amazon Chime. Proven track record of designing high-performance systems handling 1000+ TPS, leading cross-functional collaborations, and delivering high-impact solutions at scale.

Open to senior and staff engineering roles in AI infrastructure, distributed systems, and LLM serving. Reach me at modiapurva03@gmail.com or +1-757-670-5100.

Experience

Software Development Engineer II · Amazon — Bedrock Infrastructure (Mar 2025 — Present)

Technologies: Rust, DynamoDB, ECS, CDK, Tokio, AppConfig, ALB

Inference Scheduling Service (Rudi)

Co-architected and developed Rudi, a Rust-based Bedrock inference scheduling service handling 1000+ TPS; governs timing and ordering of inference requests from FES to foundation models — implementing concurrency-per-variant enforcement, priority-based queuing with weighted round-robin scheduling, request throttling/load shedding, and deferred capacity allocation enabling Provisioned Throughput V2 without hard-allocating capacity
Built on single-threaded Tokio scheduling engine with MPSC/oneshot channel patterns for microsecond-level scheduling decisions; deployed across all Bedrock regions

Priority Queue Migration

Implemented two-phase zero-downtime migration from legacy 4-value (0,1,2,3) to standardized 7-tier priority system (P5–P100), with bidirectional session token normalization handling values 0–150 for rollback compatibility
Built configuration-driven priority weights, AppConfig/Rust fallback configs, and capacity allocation constraints; deployed across Beta, PreProd, and Prod with zero incidents over 6-week rollout enabling 2 new revenue tiers (Premium OD, Best Effort OD)

Quality of Service (QoS)

Implemented QoS tier mapping (RudiPriority → QoS 0/1/10) in schedule responses and forwarding logic for Anthropic models (Claude Sonnet, Haiku, Opus); added preemptable flag in capacity constraints and AppConfig-based model QoS detection
Built unified priority assignment avoiding separate QoS vs. non-QoS code paths; updated capacity allocator to deduct preemptable consumption only for QoS-supported models

RAMA Classification Consolidation

Led 4-team consolidation expanding from 2-value to 10-classification system (PTv2, Priority Access, OnDemand, Flex, Batch); implemented fail-closed exception throwing in Rudi and FES for unknown classifications with SEV2 alarms
Built thick client with AppConfig-based cached overrides (50+ account-level imports), shadow mode validation framework, and session token migration including RAMA classification for request lifecycle consistency

ServiceTier API

Implemented ServiceTier-to-RudiPriority mapping with configuration-driven rules engine (service_tier.json, 15+ rules) supporting FLEX/PRIORITY tiers and RAMA BEST_EFFORT integration; built dual-operation mode with RequestType fallback and feature flags for gradual rollout

Leader-Follower & Game Day

Implemented leader-follower architecture with DynamoDB coordination, request forwarding, and broadcast mechanisms; automatic failover recovery under 10 seconds
Built and executed 27 Game Day failure scenarios (routing, leader election, capacity tracking, peer communication); identified and fixed 5 critical edge cases including session token priority mismatches, TTL recovery, and ALB cookie validation

Operations & Full-CD

Resolved 800+ tickets; created 30+ composite alarms reducing false positives by 70%; reduced MTTR by 60% through variant-level monitoring, priority fairness alarms, and cache error metrics
Enabled Full-CD for AppConfig, Canary, and Service pipelines with 20+ rollback alarms and ECS circuit breakers; drove region expansion and 10+ model launches (Claude Sonnets, Haikus, Opus, Amazon Nova family) resolving GMDS fallback, health check, and AZ blockers

Software Development Engineer II · Amazon — Q Business (Jul 2024 — Mar 2025)

Technologies: Python, CloudWatch, S3, Kendra, Lambda, CDK

Accuracy Scorecard Framework

Built Phase 2 evaluation pipeline across 5 ISV connectors (SharePoint, WebCrawler, Confluence, GoogleDrive, S3) and 5 document types (HTML, PDF, DOCX, PPTX, CSV) measuring Correctness (LLM-based), CitScore (citation validation), and Retrieval metrics (Recall/Precision @1/5/10) with automated CloudWatch emission and QuickSight dashboards
Implemented GoogleDrive test corpus (115K Finance FIQA documents, 300 queries) achieving 99.3% retrieval recall@1; built SharePoint multi-document test sites with M365 token auth; created WebCrawler Amplify CDK infrastructure with HTML conversion scripts for multi-modal content (tabular, images, audio); achieved 60% reduction in manual testing via automated canary infrastructure

Data Ingestion Pipeline

Implemented integration between ProductivityIngestionService and PreProcessor service handling UPSERT workflows, GetResource, and delete events; built IngestionNotificationLambda closing failure-scenario gaps and refactoring EventIngestionQueue model
Implemented dynamic attribute mapping for Kendra index ingestion via Q Business BatchPutDocument, simplifying complex data transformations for downstream indexing

Software Development Engineer · Amazon — AWS AppFabric (Nov 2021 — Jun 2024)

Technologies: React, Java, TypeScript, Cloudscape, IAM, OAuth

Console Development & Launch

Led end-to-end development of AWS AppFabric Console from greenfield project, contributing to 75%+ of Console ORR tasks including pipeline setup, Cloudscape design system integration, localization, analytics, and security
Implemented operational dashboards (client-side and server-side), Content Security Policy violation testing using Puppeteer, and CSAT/Aperture feedback integration
Reduced console deep canary test runtime from 40 minutes to under 4 minutes while maintaining full test coverage
Proposed and delivered console analytics strategy using Panorama for user behavior analysis, accessibility testing strategy, and Lighthouse security audits

PostureHub API Design & Development

Led API design and development for PostureHub across FrontEndService and Core Observability teams, implementing 6 APIs: GetAppConfiguration, GetFinding, GetPosturePolicy, ListAppEntities, ListFindings, ListPosturePolicies
Proposed and implemented ARN structure for PostureHub resources ensuring scalability and maintainability

AppServer & Third-Party Integrations

Co-delivered AppServer before re:Invent maintaining 90%+ test coverage; built Fabric OAuth system ensuring message integrity for Asana, Slack, and G-Suite integrations
Re-invented Salesforce audit log strategy handling both hourly and daily log intervals, implementing kill switch for large EventLogFile sizes and event-level pagination design
Designed and implemented ListSheets API and Smartsheet strategy end-to-end, simplifying normalized schema and guiding performance testing
Created automated weekly data extraction ETL Lambda pipeline from DynamoDB (Asana/Slack) to CSV for Business Intelligence team

IAM & Security

Developed and released AWS AppFabric managed IAM policies (FullAccess & ReadOnly) end-to-end, coordinating cross-team reviews with AppSec, IAM, KMS, and documentation teams through MCM release process
Onboarded Taj service for continuous API security testing, implementing automated security scanning workflows in CI/CD pipelines and production canaries
Participated as Guardian for threat model evaluations across EndUserPush, PostureHub, and Transmission services

Cross-Team Leadership & Operations

Mentored junior engineers, interns, and persistent team members; hosted office hours, conducted code reviews, and created comprehensive onboarding documentation
Led JDK 8/11 to JDK 17 migration initiative within AppFabric, driving timeline discussions during Ops meetings

Software Development Engineer · Amazon — Chime & WorkDocs (May 2021 — Nov 2021)

Technologies: React, Redux, Angular, Cloudscape

Designed and implemented Partner Tag Customization feature for Amazon Chime, replacing <EXTERNAL> with customizable <PARTNER> tags for trusted subsidiaries, supporting multiple partners per conversation with feature flag rollback capability
Proposed and demoed 3 rendering approaches; implemented scalable solution using UCBuzzExpress SDK supporting Web and Desktop clients with multi-language translation support
Led urgent migration of AWS WorkDocs Console from Angular to Cloudscape within 3-month deadline, driven by a critical security vulnerability; set up React testing framework with Babel configuration
Established greenfield project foundation including Polaris integration, internationalization support, and unit testing infrastructure

Full Stack Engineer · Learning Equality (Feb 2021 — May 2021)

Technologies: Django, VueJS, Python, SQL

Developed features for Kolibri, an open-source educational platform serving millions of learners worldwide, using Django, SQL, and VueJS
Led cross-functional project delivery coordinating frontend and backend milestones; optimized database queries and API endpoints improving system performance

Graduate Research Assistant · Old Dominion University (May 2019 — May 2020)

Technologies: Python, PHP, OpenCV, TensorFlow

Built quiz application used by 100s of students for English proficiency evaluation (HTML, CSS, JavaScript, PHP, MySQL)
Developed parking space detection prototype using OpenCV, Keras/TensorFlow with 90% accuracy
Improved real-time client portal UI efficiency by 70% using MVC architecture

Projects

Alpaca Trading Bot

Automated portfolio rebalancing bot using FastAPI, Next.js, and Alpaca broker API. Fetches stock rankings, compares week-over-week data, and executes trades across S&P 400/500/600 and NASDAQ 100 indices with configurable stock count and slack parameters.

Stack: Python, FastAPI, Next.js, Alpaca API, Docker, GitHub Actions

Source on GitHub: https://github.com/apurva-modi/alpaca-trading-bot

Live demo: https://apurva-modi-trading-bot.vercel.app/

Git-Ollama-Commit

Ollama-powered Git commit message generator that automates meaningful commit messages from staged changes. Supports bash and zsh. 5 stars on GitHub.

Stack: Shell, Ollama, AI, Git

Source on GitHub: https://github.com/apurva-modi/git-ollama-commit

Build Your Own Claude Code (Rust)

LLM-powered coding assistant clone of Claude Code built from scratch in Rust as part of CodeCrafters challenge. Implements OpenAI-compatible tool calling, agent loop, and multi-tool integration using HTTP RESTful APIs.

Stack: Rust, LLM, Tool Calling, Agent Loop, REST APIs

Source on GitHub: https://github.com/apurva-modi/codecrafters-claude-code-rust

Kolibri (Open Source Contributor)

Contributed to Kolibri, the offline app for universal education serving millions of learners worldwide. Built features using Django and VueJS.

Stack: Django, VueJS, Python, Open Source

Source on GitHub: https://github.com/apurva-modi/kolibri

Kaggle Flower Classification

Achieved 96.3% accuracy using CNN-EfficientNet with transfer learning on TPU, ranking in top 34% globally in Kaggle competition.

Stack: Python, TensorFlow, TPU, Transfer Learning

Source on GitHub: https://github.com/apurva-modi/Flower-classification

Netflix Clone SPA

React-based frontend resembling Netflix with posters from TMDB API and trailer playback using YouTube search integration.

Stack: React, TMDB API, YouTube API

Source on GitHub: https://github.com/apurva-modi/netflix-clone

Apurva Blog

React-based personal blogging website for publishing technical articles and insights.

Stack: React, JavaScript

Source on GitHub: https://github.com/apurva-modi/apurva-blog

Aadhaar Paperless Electoral System

Biometric-verified digital voting system using Aadhaar (India's national ID) with Spring Boot backend and MATLAB fingerprint verification.

Stack: Java, Spring Boot, MATLAB

Source on GitHub: https://github.com/apurva-modi/Aadhaar-Based-Paperless-Electoral-System

WSPT Quiz Web Application

Full-stack quiz app with student and admin views, used by university faculties for English proficiency evaluation. 4 stars on GitHub.

Stack: PHP, MySQL, Bootstrap, AJAX

Source on GitHub: https://github.com/apurva-modi/WSPT-quiz-webApp

COVID-19 Case Tracker

Real-time COVID-19 tracker using React with Material-UI and Chart.js, fetching live data from COVID-19 API.

Stack: React, Material-UI, Chart.js

Source on GitHub: https://github.com/apurva-modi/coronacase-tracker

NLP in TensorFlow

Solutions for Natural Language Processing in TensorFlow specialization by deeplearning.ai — sentiment analysis, text generation, and sequence models.

Stack: Python, TensorFlow, NLP

Source on GitHub: https://github.com/apurva-modi/NLP-tensorflow

Blockchain in JavaScript

Basic blockchain from scratch demonstrating proof-of-work, block chaining, and transaction validation concepts.

Stack: JavaScript, Blockchain

Source on GitHub: https://github.com/apurva-modi/Blockchain-in-JS

Education

Master of Science in Computer Science · Old Dominion University (2018 — 2020)

Winner: HackU2019 Hackathon
Virginia Datathon — Certificate of Appreciation

Bachelor of Engineering in Information Technology · Visvesvaraya Technological University (2014 — 2018)

Best Outstanding Performer Award
National Level IT Fiesta'17
TCS Tech Bytes

Skills

Languages: Rust, Python, Java, JavaScript, TypeScript, HTML/CSS, PHP, SQL

Frameworks: React, Next.js, Node.js, Django, FastAPI, Spring Boot, Express, Keras, TensorFlow, Tailwind CSS

Cloud & AWS: ECS, Lambda, S3, SQS, DynamoDB, CloudFormation, CDK, IAM, KMS, Kinesis, AppConfig, ALB, CloudWatch, Bedrock, Q Business, AppFabric

Tools & Concepts: Git, Docker, REST APIs, Microservices, OAuth, CI/CD, A/B Testing, Distributed Systems, Leader Election, Tokio Async Runtime, MCP Servers, AI Tooling, Ollama, Vercel

Full skill set: Rust, Python, Java, JavaScript, TypeScript, HTML/CSS, PHP, SQL, React, Next.js, Node.js, Django, FastAPI, Spring Boot, Express, Keras, TensorFlow, Tailwind CSS, ECS, Lambda, S3, SQS, DynamoDB, CloudFormation, CDK, IAM, KMS, Kinesis, AppConfig, ALB, CloudWatch, Bedrock, Q Business, AppFabric, Git, Docker, REST APIs, Microservices, OAuth, CI/CD, A/B Testing, Distributed Systems, Leader Election, Tokio Async Runtime, MCP Servers, AI Tooling, Ollama, Vercel.

Contact

Email: modiapurva03@gmail.com

LinkedIn: https://www.linkedin.com/in/apurva-modi/

GitHub: https://github.com/apurva-modi/

Open to Staff / SDE III roles · AI infra · distributed systems

Dive log · Seattle, WA

Apurva Modi

Software Development Engineer @ Amazon Web Services. Dive from graduation to today.

Resume

↓

Abyssal zone

End of the dive. Surface at modiapurva03@gmail.com

02.Skills

Languages

RustPythonJavaJavaScriptTypeScriptHTML/CSSPHPSQL

Frameworks & Libraries

ReactNext.jsNode.jsDjangoFastAPISpring BootExpressKerasTensorFlowTailwind CSS

Cloud & AWS Services

ECSLambdaS3SQSDynamoDBCloudFormationCDKIAMKMSKinesisAppConfigALBCloudWatchBedrockQ BusinessAppFabric

Tools & Concepts

GitDockerREST APIsMicroservicesOAuthCI/CDA/B TestingDistributed SystemsLeader ElectionTokio Async RuntimeMCP ServersAI ToolingOllamaVercel

Rust

Python

Java

JavaScript

TypeScript

HTML/CSS

PHP

SQL

React

Next.js

Node.js

Django

FastAPI

Spring Boot

Express

Keras

TensorFlow

Tailwind CSS

ECS

Lambda

SQS

DynamoDB

CloudFormation

CDK

IAM

KMS

Kinesis

AppConfig

ALB

CloudWatch

Bedrock

Q Business

AppFabric

Git

Docker

OAuth

Ollama

Vercel

Rust

Python

Java

JavaScript

TypeScript

HTML/CSS

PHP

SQL

React

Next.js

Node.js

Django

FastAPI

Spring Boot

Express

Keras

TensorFlow

Tailwind CSS

ECS

Lambda

SQS

DynamoDB

CloudFormation

CDK

IAM

KMS

Kinesis

AppConfig

ALB

CloudWatch

Bedrock

Q Business

AppFabric

Git

Docker

OAuth

Ollama

Vercel

Rust

Python

Java

JavaScript

TypeScript

HTML/CSS

PHP

SQL

React

Next.js

Node.js

Django

FastAPI

Spring Boot

Express

Keras

TensorFlow

Tailwind CSS

ECS

Lambda

SQS

DynamoDB

CloudFormation

CDK

IAM

KMS

Kinesis

AppConfig

ALB

CloudWatch

Bedrock

Q Business

AppFabric

Git

Docker

OAuth

Ollama

Vercel

03.Projects

{

GETPOSTLIST

}

Alpaca Trading Bot

PythonFastAPINext.jsAlpaca APIDockerGitHub Actions

$ commit▊

Git-Ollama-Commit

Ollama-powered Git commit message generator that automates meaningful commit messages from staged changes. Supports bash and zsh. 5 stars on GitHub.

ShellOllamaAIGit

Build Your Own Claude Code (Rust)

RustLLMTool CallingAgent LoopREST APIs

🎓

Kolibri (Open Source Contributor)

Contributed to Kolibri, the offline app for universal education serving millions of learners worldwide. Built features using Django and VueJS.

DjangoVueJSPythonOpen Source

Kaggle Flower Classification

Achieved 96.3% accuracy using CNN-EfficientNet with transfer learning on TPU, ranking in top 34% globally in Kaggle competition.

PythonTensorFlowTPUTransfer Learning

{

GETPOSTLIST

}

Netflix Clone SPA

React-based frontend resembling Netflix with posters from TMDB API and trailer playback using YouTube search integration.

ReactTMDB APIYouTube API

Apurva Blog

React-based personal blogging website for publishing technical articles and insights.

ReactJavaScript

Aadhaar Paperless Electoral System

Biometric-verified digital voting system using Aadhaar (India's national ID) with Spring Boot backend and MATLAB fingerprint verification.

JavaSpring BootMATLAB

🎓

WSPT Quiz Web Application

Full-stack quiz app with student and admin views, used by university faculties for English proficiency evaluation. 4 stars on GitHub.

PHPMySQLBootstrapAJAX

COVID-19 Case Tracker

Real-time COVID-19 tracker using React with Material-UI and Chart.js, fetching live data from COVID-19 API.

ReactMaterial-UIChart.js

NLP in TensorFlow

Solutions for Natural Language Processing in TensorFlow specialization by deeplearning.ai — sentiment analysis, text generation, and sequence models.

PythonTensorFlowNLP

Blockchain in JavaScript

Basic blockchain from scratch demonstrating proof-of-work, block chaining, and transaction validation concepts.

JavaScriptBlockchain

05. What's Next?

Get In Touch

I'm always open to discussing new opportunities, interesting projects, or just having a chat about technology. Feel free to reach out!

Or reach me directly:

End of the dive · Surface to say hi

Resume