The Customer Data Analytics Platform (CDAP) reference architecture is fully serverless. Your solution can be deployed with infrastructure as code and automation for fast iteration and consistent deployments. Use Amazon CloudWatch for application and Infrastructure monitoring.
Overview
How it works
This architecture helps you build modern customer data analytics pipelines and derive insights from the data you collect.
Step 1
Well-Architected Pillars
The architecture diagram above is an example of a Solution created with Well-Architected best practices in mind. To be fully Well-Architected, you should follow as many Well-Architected best practices as possible.
Operational Excellence
Security
Use Lake Formation for unified governance to centrally manage security, access control (at the table, row, column security level), and audit trails. It also enables automatic schema discovery and conversion to required formats. API Gateway enforces policies that control security aspects such as authentication, authorization, or traffic management.
Reliability
Serverless architecture enables the solution to be automatically scalable, available, and deployed across all Availability Zones.
Performance Efficiency
By using serverless technologies, you only provision the exact resources you need. To maximize the performance of the CDAP solution, test with multiple instance types. Use API Gateway Edge endpoints for geographically dispersed customers. Use Regional for regional customers (and when using other AWS services within the same Region).
Cost Optimization
By using serverless technologies and automatically scaling, you only pay for the resources you use. Serverless services don’t cost anything while they’re idle.
Sustainability
Minimize your environmental impact. Data lake uses processes to automatically move infrequently accessed data to cold storage with Amazon S3 Lifecycle configurations. By extensively using managed services and dynamic scaling, this architecture minimizes the environmental impact of the backend services.