AWS Glue Interactive Sessions Now Support Spark Connect for Remote Interactive Workloads

AWS has announced that AWS Glue Interactive Sessions now support Apache Spark Connect. This integration enables developers to build and execute Apache Spark applications from their preferred local or managed environments, such as notebooks in Amazon SageMaker Unified Studio, without requiring a local Spark installation. By decoupling the client application execution from the Spark driver, Spark Connect provides a lightweight and highly portable client-side development experience. The shift to a thin-client architecture means that development environments no longer need to match the complex dependencies, libraries, and runtime versions of the remote Spark cluster. Instead, client-side environments generate DataFrame operations via the Spark Connect protocol, which are then transmitted to and executed directly on AWS Glue serverless Spark engines. This reduces overhead, accelerates initialization times, and ensures consistency between development and production runtimes. This feature is particularly beneficial for teams using Amazon SageMaker Unified Studio or other custom notebook environments, as it streamlines the integration of collaborative data science workflows with AWS Glue scalable data processing capabilities. Engineers can transition from exploratory data analysis to production-scale pipeline execution with minimal configuration changes.
Related tools
Recommended tools for this topic
These picks prioritize high-intent tools relevant to this topic. Some links may include partner or affiliate tracking.
High-value hosting and deployment path for frontend and cloud readers.
View VercelStrong cloud alternative for startups and developer-led infrastructure decisions.
View DigitalOceanA strong security and edge platform match across CDN, Zero Trust, and app protection.
View CloudflareComparison
| Aspect | Before / Alternative | After / This |
|---|---|---|
| Architecture | Thick client requiring a local Spark runtime and environment setup | Thin client decoupled from the Spark driver using a lightweight protocol |
| Dependency Matching | Local client dependencies must strictly match the remote Glue cluster | Only the Spark Connect client library is needed, regardless of remote runtime |
| Notebook Integration | Complex kernel configuration required for third-party notebook environments | Seamless integration via standard Spark Connect APIs in SageMaker Unified Studio |
Action Checklist
- Install the Spark Connect client library in your local or SageMaker notebook environment Ensure the client version aligns with the Spark Connect protocol specifications
- Configure the connection string pointing to your AWS Glue Interactive Session endpoint Verify that IAM roles grant the necessary permissions for AWS Glue session access
- Initialize your SparkSession using the remote Spark Connect URI Replace standard SparkSession builders with the remote connect configuration
- Run interactive test queries to verify execution on the serverless Glue engine Monitor AWS Glue console to verify session state and active driver connections
Source: AWS What's New
This page summarizes the original source. Check the source for full details.


