// experience

Two AWS teams,
three flagship features.

Amazon Lex and AWS Lambda. Three-plus years building production gen AI and distributed control planes.

[01]
amazon-web-services

Amazon Lex.

2024.06 →
present

Bot Analyzer.

LLM-driven configuration analysis for Amazon Lex chatbots, surfacing alignment with AWS best practices through public API and the Lex console. Designed and shipped across 11 AWS regions — end to end.

Benchmarked 12 LLMs on accuracy, latency, and cost. Built async pipeline with Step Functions, Bedrock, and DynamoDB. MapReduce architecture for parallel intent processing.

1000+
Analyses in 4 days
11
AWS regions
<60s
For 1k+ intent bots
12
LLMs benchmarked

Multi-modal Conversation.

SMS input capture during live voice (IVR) calls without interrupting the session. Extended bot channel APIs to integrate AWS End User Messaging with Amazon Connect.

Evaluated native channel vs. Lambda-based extensibility. Chose control-plane-driven design for backward-compatible multi-channel growth. Delivered Private Beta across 4 teams.

4
Teams coordinated
0
Call interruptions
Private Beta
Enterprise shipped
[02]
amazon-web-services

AWS Lambda.

2022.08 →
2024.06

Region Build Automation.

Led automation of Lambda EventBridge control plane region builds — on the critical path for every new Lambda region launch. Replaced a fully manual expansion process, eliminating human error from the build sequence entirely.

Redesigned orchestration model and CI/CD pipeline. Parallelised serial IA-gating tasks. Onboarded to automated region config and resource provisioning systems.

2.8×
Speedup
14 → 5
Days per build
0
Errors & manual steps

Control Plane Throttling.

Co-designed a multi-dimensional token bucket throttling system extending from account-level to Org ID and caller principal. Prevents cross-tenant traffic spikes from triggering availability incidents across the Lambda fleet.

Per-host local token cache with async background sync to central data plane. Throttle decisions removed from the request hot path. Load tested at ~7× normal traffic.

<25ms
p99 overhead
7×
Load tested
50K+
Throttle cache keys
// stack

Tools of the trade.

Java Python AWS Step Functions Amazon Bedrock DynamoDB S3 CloudFormation CloudWatch IAM EventBridge
— Closing remarks Ed. 01 / 2026 · Seattle

Let's ship
something real.

Open for SDE II & Senior SDE roles

Role
SDE II / Senior SDE · AI infrastructure
Located
Seattle, WA · Pacific Time

— Han Cao