Real-time Computer Vision for CCTV (102 countries)
Context: GoodVision needed real-time video analysis across multiple locations, serving customers in 102 countries.
Problem: Processing 1000+ camera streams with sub-second latency, scaling to handle peak loads, maintaining 99.9% uptime.
What I did: Designed edge processing architecture running on NVIDIA Jetsons, built model serving infrastructure achieving 99%+ detection accuracy, implemented MLOps pipeline for continuous deployment, optimized for GPU economics and edge deployment.
Result: 99%+ accuracy at sub-second latency, 10M+ daily requests handled reliably, cost per stream reduced by 40%.
Stack: AWS, AWS IoT, Docker, NVIDIA Jetson, Jetpack, PyTorch, TensorRT
Cost & performance architecture for GPU workloads
Context: ML workload requiring significant GPU compute with cost and latency constraints.
Problem: Balancing GPU costs, latency requirements, and scalability for variable workloads.
What I did: Architected hybrid cloud solution (on-demand + spot instances), implemented auto-scaling, optimized model inference, established cost monitoring and alerting.
Result: 90% cost reduction while maintaining latency SLAs, automated scaling handled 10x traffic spikes.
Stack: AWS EC2, ECS, CloudWatch, custom cost optimization
Engineering team scaling (0 → 20)
Context: Needed to scale engineering from founding team to support growth across 102 countries.
Problem: Hiring quality engineers, establishing technical culture, building processes for distributed team, maintaining delivery speed.
What I did: Built hiring process and technical interviews, established architecture principles, implemented CI/CD and code review practices, created onboarding system, set up cross-functional collaboration.
Result: Team grew from 0 to 20 engineers across time zones, delivery velocity increased 3x, technical debt managed systematically.
Stack: Hiring processes, technical culture, architecture governance, CI/CD, cross-functional processes