CertHub - RSS Feed Reader

Feb 5, 2026

How Convera built fine-grained API authorization with Amazon Verified Permissions

by Santhosh Veeraraman

In this post, we share how Convera used Amazon Verified Permissions to build a fine-grained authorization model for their API platform.

Read article →

Feb 4, 2026

Mastering millisecond latency and millions of events: The event-driven architecture behind the Amazon Key Suite

by Ali Ufuk Yucel

In this post, we explore how the Amazon Key team used Amazon EventBridge to modernize their architecture, transforming a tightly coupled monolithic system into a resilient, event-driven solution. We explore the technical challenges we faced, our implementation approach, and the architectural patterns that helped us achieve improved reliability and scalability. The post covers our solutions for managing event schemas at scale, handling multiple service integrations efficiently, and building an extensible architecture that accommodates future growth.

Read article →

Jan 30, 2026

Sovereign failover – Design for digital sovereignty using the AWS European Sovereign Cloud

by Ivo Kammerath

This post explores the architectural patterns, challenges, and best practices for building cross-partition failover, covering network connectivity, authentication, and governance. By understanding these constraints, you can design resilient cloud-native applications that balance regulatory compliance with operational continuity.

Read article →

Jan 30, 2026

Announcing the AWS Digital Sovereignty Well-Architected Lens

by Swapnonil Mukherjee

As organizations accelerate cloud adoption, meeting digital sovereignty requirements has become essential to build trust with customers and regulators worldwide. The challenge isn’t whether to adopt the cloud—it’s how to do so while meeting sovereignty requirements, using a multidisciplinary approach. Even though requirements vary by geography, organizations commonly address them through technical and operational controls […]

Read article →

Jan 29, 2026

How Artera enhances prostate cancer diagnostics using AWS

by Hariharan Ananthakrishnan

In this post, we explore how Artera used Amazon Web Services (AWS) to develop and scale their AI-powered prostate cancer test, accelerating time to results and enabling personalized treatment recommendations for patients.

Read article →

Jan 12, 2026

How Salesforce migrated from Cluster Autoscaler to Karpenter across their fleet of 1,000 EKS clusters

by Sana Jawad

This blog post examines how Salesforce, operating one of the world's largest Kubernetes deployments, successfully migrated from Cluster Autoscaler to Karpenter across their fleet of 1,000 plus Amazon Elastic Kubernetes Service (Amazon EKS) clusters.

Read article →

Dec 11, 2025

Architecting conversational observability for cloud applications

by Anton Aleksandrov

In this post, we walk through building a generative AI–powered troubleshooting assistant for Kubernetes. The goal is to give engineers a faster, self-service way to diagnose and resolve cluster issues, cut down Mean Time to Recovery (MTTR), and reduce the cycles experts spend finding the root cause of issues in complex distributed systems.

Read article →

Dec 10, 2025

How BASF’s Agriculture Solutions drives traceability and climate action by tokenizing cotton value chains using Amazon Managed Blockchain

by Kevin S. Ridolfi

BASF Agricultural Solutions combines innovative products and digital tools with practical farmer knowledge. This post explores how Amazon Managed Blockchain can drive a positive change in the agricultural industry by tokenizing food and cotton value chains for traceability, climate action, and circularity.

Read article →

Dec 8, 2025

She architects: Bringing unique perspectives to innovative solutions at AWS

by Kayalvizhi Kandasamy

Have you ever wondered what it is really like to be a woman in tech at one of the world's leading cloud companies? Or maybe you are curious about how diverse perspectives drive innovation beyond the buzzwords? Today, we are providing an insider's perspective on the role of a solutions architect (SA) at Amazon Web Services (AWS). However, this is not a typical corporate success story. We are three women who have navigated challenges, celebrated wins, and found our unique paths in the world of cloud architecture, and we want to share our real stories with you.

Read article →

Nov 26, 2025

Secure Amazon Elastic VMware Service (Amazon EVS) with AWS Network Firewall

by Sheng Chen

In this post, we demonstrate how to utilize AWS Network Firewall to secure an Amazon EVS environment, using a centralized inspection architecture across an EVS cluster, VPCs, on-premises data centers and the internet. We walk through the implementation steps to deploy this architecture using AWS Network Firewall and AWS Transit Gateway.

Read article →

Nov 19, 2025

Building an AI gateway to Amazon Bedrock with Amazon API Gateway

by Thomas Natschläger

In this post, we'll explore a reference architecture that helps enterprises govern their Amazon Bedrock implementations using Amazon API Gateway. This pattern enables key capabilities like authorization controls, usage quotas, and real-time response streaming. We'll examine the architecture, provide deployment steps, and discuss potential enhancements to help you implement AI governance at scale.

Read article →

Nov 19, 2025

Announcing the updated AWS Well-Architected Machine Learning Lens

by Steven DeVries

We are excited to announce the updated AWS Well-Architected Machine Learning Lens, now enhanced with the latest capabilities and best practices for building machine learning (ML) workloads on AWS.

Read article →

Nov 19, 2025

Architecting for AI excellence: AWS launches three Well-Architected Lenses at re:Invent 2025

by Anitha Selvan

At re:Invent 2025, we introduce one new lens and two significant updates to the AWS Well-Architected Lenses specifically focused on AI workloads: the Responsible AI Lens, the Machine Learning (ML) Lens, and the Generative AI Lens. Together, these lenses provide comprehensive guidance for organizations at different stages of their AI journey, whether you're just starting to experiment with machine learning or already deploying complex AI applications at scale.

Read article →

Nov 19, 2025

Announcing the updated AWS Well-Architected Generative AI Lens

by Dan Ferguson

We are delighted to announce an update to the AWS Well-Architected Generative AI Lens. This update features several new sections of the Well-Architected Generative AI Lens, including new best practices, advanced scenario guidance, and improved preambles on responsible AI, data architecture, and agentic workflows.

Read article →

Nov 18, 2025

Build priority-based message processing with Amazon MQ and AWS App Runner

by Aritra Nag

In this post, we show you how to build a priority-based message processing system using Amazon MQ for priority queuing, Amazon DynamoDB for data persistence, and AWS App Runner for serverless compute. We demonstrate how to implement application-level delays that high-priority messages can bypass, create real-time UIs with WebSocket connections, and configure dual-layer retry mechanisms for maximum reliability.

Read article →

Nov 14, 2025

Know before you go – AWS re:Invent 2025 guide to Well-Architected and Cloud Optimization sessions

by Anitha Selvan

Are you ready to maximize your Well-Architected and Cloud Optimization learning and networking time at re:Invent 2025? We have put together this comprehensive guide to help you plan your schedule and make the most of the Well-Architected and cloud optimization sessions available this year. These sessions will deliver the practical guidance your teams need to lead strategic cloud initiatives, design next-generation architectures, optimize costs, or secure AI-powered systems.

Read article →

Jul 25, 2025

How HashiCorp made cross-Region switchover seamless with Amazon Application Recovery Controller

by Dmitriy Novikov

In this post, we discuss HashiCorp’s journey from manual, stress-inducing failover procedures to a streamlined, confident approach that fundamentally changed how they deliver on their enterprise-grade resilience promises.

Read article →

Jun 2, 2025

Analyze media content using AWS AI services

by Jack Bradham

Organizations managing large audio and video archives face significant challenges in extracting value from their media content. Consider a radio network with thousands of broadcast hours across multiple stations and the challenges they face to efficiently verify ad placements, identify interview segments, and analyze programming patterns. In this post, we demonstrate how you can automatically transform unstructured media files into searchable, analyzable content.

Read article →

Aug 1, 2025

Maximizing Business Value Through Strategic Cloud Optimization

by Ryan Dsouza

As cloud spending continues to surge, organizations must focus on strategic cloud optimization to maximize business value. This blog post explores key insights from MIT Technology Review's publication on cloud optimization, highlighting the importance of viewing optimization as a continuous process that encompasses all six AWS Well-Architected pillars.

Read article →

Jun 26, 2025

Simplifying sustainability reporting using AWS and generative AI in banking

by Sachin Kulkarni

In this post, you learn how you can use generative AI services on Amazon Web Services (AWS) to automate your sustainability reporting requirements, reduce manual effort, and improve accuracy. You do this by implementing an automated solution for extracting, processing, and validating data from corporate reports.

Read article →

Jun 23, 2025

Amazon Bedrock baseline architecture in an AWS landing zone

by Abdel-Rahman Awad

In this post, we explore the Amazon Bedrock baseline architecture and how you can secure and control network access to your various Amazon Bedrock capabilities within AWS network services and tools. We discuss key design considerations, such as using Amazon VPC Lattice auth policies, Amazon Virtual Private Cloud (Amazon VPC) endpoints, and AWS Identity and Access Management (IAM) to restrict and monitor access to your Amazon Bedrock capabilities.

Read article →

Jun 12, 2025

How Stellantis streamlines floating license management with serverless orchestration on AWS

by Göksel SARIKAYA

In this post, we explore a unique scenario where an ISV, unable to provide a floating license option for cloud usage, worked with Stellantis to develop an alternative solution. This approach, implemented with the ISV’s permission, treats named user licenses as if they were floating, automatically assigning and removing them based on the state of user workbench instances.

Read article →

Jul 18, 2025

Implement monitoring for Amazon EKS with managed services

by Aritra Nag

In this post, we show you how to implement comprehensive monitoring for Amazon Elastic Kubernetes Service (Amazon EKS) workloads using AWS managed services. This solution demonstrates building an EKS platform that combines flexible compute options with enterprise-grade observability using AWS native services and OpenTelemetry.

Read article →

Jun 12, 2025

Build a multi-Region AWS PrivateLink backed service with seamless failover

by Madhav Vishnubhatta

This post demonstrates how the Issuer Solutions business of Global Payments, as a service provider, implemented cross-Region failover for an AWS PrivateLink backed service exposed to their customers. Their solution enables failover to a secondary Region without customer coordination, reducing Recovery Time Objective (RTO).

Read article →

Jul 8, 2025

Migrate and modernize VMware workloads with AWS Transform for VMware

by Kiran Reid

AWS Transform for VMware is a service that tackles cloud migration challenges by significantly reducing manual effort and accelerating the migration of critical VMware workloads to AWS Cloud. In this post, we highlight its comprehensive capabilities, including streamlined discovery and assessment, intelligent network conversion, enhanced security and compliance, and orchestrated migration execution.

Read article →

Jul 14, 2025

How Scale to Win uses AWS WAF to block DDoS events

by Ben "Fuzzy" Shonaldmann

In this post, you'll learn how Scale to Win configured their network topology and AWS WAF to protect against DDoS events that reached peaks of over 2 million requests per second during the 2024 US presidential election campaign season. The post details how they implemented comprehensive DDoS protection by segmenting human and machine traffic, using tiered rate limits with CAPTCHA, and preventing CAPTCHA token reuse through AWS WAF Bot Control.

Read article →

Jul 25, 2025

How Zapier runs isolated tasks on AWS Lambda and upgrades functions at scale

by Anton Aleksandrov

In this post, you’ll learn how Zapier has built their serverless architecture focusing on three key aspects: using Lambda functions to build isolated Zaps, operating over a hundred thousand Lambda functions through Zapier's control plane infrastructure, and enhancing security posture while reducing maintenance efforts by introducing automated function upgrades and cleanup workflows into their platform architecture.

Read article →

Aug 14, 2025

Deploy LLMs on Amazon EKS using vLLM Deep Learning Containers

by Vishal Naik

In this post, we demonstrate how to deploy the DeepSeek-R1-Distill-Qwen-32B model using AWS DLCs for vLLMs on Amazon EKS, showcasing how these purpose-built containers simplify deployment of this powerful open source inference engine. This solution can help you solve the complex infrastructure challenges of deploying LLMs while maintaining performance and cost-efficiency.

Read article →

Aug 14, 2025

How Karrot built a feature platform on AWS, Part 2: Feature ingestion

by Hyeonho Kim

This two-part series shows how Karrot developed a new feature platform, which consists of three main components: feature serving, a stream ingestion pipeline, and a batch ingestion pipeline. This post covers the process of collecting features in real-time and batch ingestion into an online store, and the technical approaches for stable operation.

Read article →

Aug 14, 2025

How Karrot built a feature platform on AWS, Part 1: Motivation and feature serving

by Hyeonho Kim

This two-part series shows how Karrot developed a new feature platform, which consists of three main components: feature serving, a stream ingestion pipeline, and a batch ingestion pipeline. This post starts by presenting our motivation, our requirements, and the solution architecture, focusing on feature serving.

Read article →

Aug 21, 2025

Simplify multi-tenant encryption with a cost-conscious AWS KMS key strategy

by Itay Meller

In this post, we explore an efficient approach to managing encryption keys in a multi-tenant SaaS environment through centralization, addressing challenges like key proliferation, rising costs, and operational complexity across multiple AWS accounts and services. We demonstrate how implementing a centralized key management strategy using a single AWS KMS key per tenant can maintain security and compliance while reducing operational overhead as organizations scale.

Read article →

Aug 19, 2025

How CommBank made their CommSec trading platform highly available and operationally resilient

by Kris Severijns

In this post, we explore how CommSec, Australia's leading online broker, transitioned from a multicloud environment to AWS as their sole cloud provider while implementing Amazon Application Recovery Controller (ARC) zonal shift to maintain high availability and operational resilience. The consolidation resulted in significant benefits including 25% base capacity reduction, two times faster deployments, and improved failover capabilities through ARC zonal shift, enabling CommSec to continue serving millions of customers while meeting strict regulatory requirements.

Read article →

Sep 30, 2025

Build resilient generative AI agents

by Yiwen Zhang

Generative AI agents in production environments demand resilience strategies that go beyond traditional software patterns. AI agents make autonomous decisions, consume substantial computational resources, and interact with external systems in unpredictable ways. These characteristics create failure modes that conventional resilience approaches might not address. This post presents a framework for AI agent resilience risk analysis […]

Read article →

Oct 1, 2025

Modernization of real-time payment orchestration on AWS

by Neeraj Kaushik

The global real-time payments market is experiencing significant growth. According to Fortune Business Insights, the market was valued at USD 24.91 billion in 2024 and is projected to grow to USD 284.49 billion by 2032, with a CAGR of 35.4%. Similarly, Grand View Research reports that the global mobile payment market, valued at USD 88.50 […]

Read article →

Sep 22, 2025

A scalable, elastic database and search solution for 1B+ vectors built on LanceDB and Amazon S3

by Audra Devoto

In this post, we explore how Metagenomi built a scalable database and search solution for over 1 billion protein vectors using LanceDB and Amazon S3. The solution enables rapid enzyme discovery by transforming proteins into vector embeddings and implementing a serverless architecture that combines AWS Lambda, AWS Step Functions, and Amazon S3 for efficient nearest neighbor searches.

Read article →

Oct 22, 2025

BASF Digital Farming builds a STAC-based solution on Amazon EKS

by Kevin S. Ridolfi

This post was co-written with Frederic Haase and Julian Blau with BASF Digital Farming GmbH. At xarvio – BASF Digital Farming, our mission is to empower farmers around the world with cutting-edge digital agronomic decision-making tools. Central to this mission is our crop optimization platform, xarvio FIELD MANAGER, which delivers actionable insights through a range […]

Read article →