Nobel Prize Data Lake — Medallion Architecture on AWS
A medallion-architecture data lake on AWS S3 with Prefect-orchestrated ETL Lambdas — raw API responses → bronze joins → silver analytics table.
A medallion-architecture data lake on AWS S3 with Prefect-orchestrated ETL Lambdas — raw API responses → bronze joins → silver analytics table.
A distributed ML pipeline processing 17M Amazon reviews with PySpark MLlib on AWS Glue — including S3 medallion storage, feature engineering, and model serialization for batch inference.
An event-driven serverless application on AWS: scheduled Lambda polling a third-party API, S3 event-triggered DynamoDB sync, and API Gateway endpoints for queries.