# Guidance for Deploying a Supply Chain Data Hub on AWS

## Overview

This Guidance demonstrates architectural options for building a supply chain operational data hub. The hub ingests data from thousands of disparate sources, including internal sources about planning and execution and external sources about shipment tracking. The hub then generates a single, harmonized view of data. Visibility into data from various enterprise and execution systems can be used for real-time planning around demand forecasts, inventory, and procurement. The data hub helps supply chain organizations make data-driven decisions that improve delivery times and increase customer satisfaction.

## How it works

This architecture shows options for ingesting data from supply chain planning, execution, and real-time shipment status providers as well as building cross-category scorecards using AWS services to create a supply chain data lake for analysts and planners.

[Download the architecture diagram](https://d1.awsstatic.com/solutions/guidance/architecture-diagrams/deploying-a-supply-chain-data-hub-on-aws.pdf)

![Architecture diagram](/images/solutions/deploying-a-supply-chain-data-hub-on-aws/images/deploying-a-supply-chain-data-hub-on-aws-1.png)

1. **Step 1**: Supply chain data is collected from multiple data sources across the enterprise, including enterprise resource planning (ERP) and customer relationship management (CRM) software-as-a service (SaaS) applications, manufacturing shop-floor edge devices, logs, streaming media, and social media.
1. **Step 2**: Based on the type of data source, AWS Database Migration Service (AWS DMS), AWS DataSync, Amazon Kinesis, Amazon Managed Streaming for Apache Kafka (Amazon MSK), AWS IoT Core, and Amazon AppFlow ingest data into the supply chain data lake hosted on AWS.
1. **Step 3**: AWS Data Exchange integrates third-party data that may be useful in predicting shipment estimated time of arrival (such as weather data) into the supply chain data lake.
1. **Step 4**: AWS Lake Formation helps with building the scalable supply chain data lake.
1. **Step 5**: Amazon Simple Storage Service (Amazon S3) is the foundation for supply chain data lake storage.
1. **Step 6**: AWS Glue extracts, transforms, catalogs, and ingests data across multiple data stores like ERP, planning, and shipment visibility systems.
1. **Step 7**: Amazon Athena is a serverless interactive query service that analyzes data in Amazon S3 using standard SQL.
1. **Step 8**: Amazon QuickSight provides dashboards that help planners analyze data about supply chain planning, execution, and real-time shipment status to make informed business decisions.
1. **Step 9**: Amazon Redshift, a cloud data warehouse, analyzes structured and semi-structured data.
1. **Step 10**: Amazon EMR provides the cloud big data platform for processing vast amounts of data using open source tools.
1. **Step 11**: Amazon SageMaker builds, trains, and deploys ML models, and AWS AI services add intelligence to supply chain applications.
1. **Step 12**: Amazon Neptune graph database optimizes network queries for speed and accuracy.
## Well-Architected Pillars

The architecture diagram above is an example of a Solution created with Well-Architected best practices in mind. To be fully Well-Architected, you should follow as many Well-Architected best practices as possible.

### Operational Excellence

This Guidance is deployed with infrastructure as code (IaC), a DevOps principle that helps you maintain infrastructure through repeatable and reliable processes. Supply chain stakeholders, including business, development, and operations teams, should align on an IaC strategy. [Read the Operational Excellence whitepaper](/wellarchitected/latest/operational-excellence-pillar/welcome.html)


### Security

DataSync uses cross-account access to delegate access to data and resources across different AWS accounts. QuickSight uses fine-grained access control to secure access to dashboards. [Read the Security whitepaper](/wellarchitected/latest/security-pillar/welcome.html)


### Reliability

Services such as Amazon S3, AWS Glue, DataSync, Athena, and QuickSight are highly available, allowing you to scale workloads based on demand. [Read the Reliability whitepaper](/wellarchitected/latest/reliability-pillar/welcome.html)


### Performance Efficiency

Serverless technologies in this architecture allow you to provision the exact resources needed at any given time. [Read the Performance Efficiency whitepaper](/wellarchitected/latest/performance-efficiency-pillar/welcome.html)


### Cost Optimization

Services in this architecture can automatically scale to meet demand, so you only pay for the resources consumed without under or over provisioning. [Read the Cost Optimization whitepaper](/wellarchitected/latest/cost-optimization-pillar/welcome.html)


### Sustainability

Services in this architecture are serverless and scalable, optimizing backend resource consumption to reduce environmental impact. [Read the Sustainability whitepaper](/wellarchitected/latest/sustainability-pillar/sustainability-pillar.html)


[Read usage guidelines](/solutions/guidance-disclaimers/)

