Software Development Engineer · Amazon Web Services

Scalable services.
Generative AI. Infrastructure.

Three-plus years at AWS — Amazon Lex and Lambda. Features owned end-to-end, design through multi-region launch. Built for reliability and operational efficiency.

Open for SDE II & Senior SDE roles
Amazon Lex · Jun 2024 — Present
Shipped AI
at AWS Scale

Built Bot Analyzer — an LLM-powered configuration analysis feature on Amazon Bedrock and Step Functions, live across eleven AWS regions. Benchmarked twelve LLMs for accuracy, latency, and cost. Also designed Multi-modal Conversation for SMS capture during live voice calls.

Generative AI · Amazon Bedrock · Java · Python
AWS Lambda · Aug 2022 — Jun 2024
Infrastructure
at Zero Tolerance

Led region build automation cutting cycle time from fourteen to five days. Co-designed a multi-dimensional token-bucket throttling system — load-tested at 7× peak production traffic — protecting millions of Lambda users from cross-tenant spikes.

Distributed Systems · Reliability · Java · CloudFormation
Ed. 01 / 2026
'26
Seattle / PST

"I ship the thing.
Measure the thing.
Improve the thing."

— operating principle

// what I built

Four production features, two AWS teams.

01
Bot Analyzer

LLM-driven configuration analysis for Amazon Lex chatbots, surfacing alignment with AWS best practices through public API and the Lex console. End-to-end across 11 regions; 12 LLMs benchmarked for accuracy, latency, and cost.

amazon-lex · 2024 — GA
02
Multi-modal Conversation

SMS input capture during live IVR voice calls without interrupting the session. Extended bot channel APIs to integrate End User Messaging with Amazon Connect. Delivered as a private beta across four enterprise teams.

amazon-lex · 2024 — Private Beta
03
Region Build Automation

Led automation of Lambda's EventBridge control-plane region builds — replacing a fully manual sequence with an orchestrated CI/CD pipeline. Cycle time from 14 days to 5; zero manual steps and zero errors.

aws-lambda · 2023 — 2.8× speedup
04
Control-Plane Throttling

Co-designed a multi-dimensional token-bucket throttling system protecting the Lambda fleet from cross-tenant traffic spikes. Per-host local cache; async sync to data plane. Load-tested at 7× normal traffic; p99 overhead under 25 ms.

aws-lambda · 2023 — Shipped
Explore
01 Work Amazon Lex · AWS Lambda 02 Patents Research & Publications 03 Skills Languages & AWS Stack 04 About me Background & Contact
— Closing remarks Ed. 01 / 2026 · Seattle

Let's ship
something real.

Open for SDE II & Senior SDE roles

Role
SDE II / Senior SDE · AI infrastructure
Located
Seattle, WA · Pacific Time

— Han Cao