Welcome to emr-serverless-pyspark-uv-rap-template 🚀
This is the main documentation for the emr-serverless-pyspark-uv-rap-template project, a minimal yet
powerful template for deploying PySpark applications to AWS EMR Serverless.
This project demonstrates a modern Python development workflow using uv for packaging and a built-in CLI for seamless deployments.
🧭 Guides
Explore the guides to get started and learn about best practices for configuration and deployment.
- Infrastructure Setup with Terraform 🏗️: A step-by-step guide to provisioning your AWS infrastructure using Terraform.
- EMR Serverless Guide ⚙️: A complete guide to setting up the necessary IAM roles and AWS resources for EMR Serverless.
- Business Config Workflow 🗂️: Best practices for managing your application's configuration in a versioned and auditable way.
✨ Key Features
- Fast Dependency Management: Uses
uvfor near-instant dependency resolution and packaging. - Automated Deployments: A built-in
deploy-to-emrCLI to build, package, and submit jobs. - Immutable Artifacts: Creates versioned, immutable releases in S3 for full auditability.
- Apache Iceberg Ready: Includes a sample job that writes to an Apache Iceberg table using the AWS Glue Catalog.