Back to news
cloud Priority 4/5 6/18/2026, 11:05:15 AM

AWS Glue Interactive Sessions Now Support Spark Connect for Remote Interactive Workloads

AWS Glue Interactive Sessions Now Support Spark Connect for Remote Interactive Workloads

AWS has announced that AWS Glue Interactive Sessions now support Apache Spark Connect. This integration enables developers to build and execute Apache Spark applications from their preferred local or managed environments, such as notebooks in Amazon SageMaker Unified Studio, without requiring a local Spark installation. By decoupling the client application execution from the Spark driver, Spark Connect provides a lightweight and highly portable client-side development experience. The shift to a thin-client architecture means that development environments no longer need to match the complex dependencies, libraries, and runtime versions of the remote Spark cluster. Instead, client-side environments generate DataFrame operations via the Spark Connect protocol, which are then transmitted to and executed directly on AWS Glue serverless Spark engines. This reduces overhead, accelerates initialization times, and ensures consistency between development and production runtimes. This feature is particularly beneficial for teams using Amazon SageMaker Unified Studio or other custom notebook environments, as it streamlines the integration of collaborative data science workflows with AWS Glue scalable data processing capabilities. Engineers can transition from exploratory data analysis to production-scale pipeline execution with minimal configuration changes.

Related tools

Recommended tools for this topic

These picks prioritize high-intent tools relevant to this topic. Some links may include partner or affiliate tracking.

#aws#cloud#official

Comparison

AspectBefore / AlternativeAfter / This
ArchitectureThick client requiring a local Spark runtime and environment setupThin client decoupled from the Spark driver using a lightweight protocol
Dependency MatchingLocal client dependencies must strictly match the remote Glue clusterOnly the Spark Connect client library is needed, regardless of remote runtime
Notebook IntegrationComplex kernel configuration required for third-party notebook environmentsSeamless integration via standard Spark Connect APIs in SageMaker Unified Studio

Action Checklist

  1. Install the Spark Connect client library in your local or SageMaker notebook environment Ensure the client version aligns with the Spark Connect protocol specifications
  2. Configure the connection string pointing to your AWS Glue Interactive Session endpoint Verify that IAM roles grant the necessary permissions for AWS Glue session access
  3. Initialize your SparkSession using the remote Spark Connect URI Replace standard SparkSession builders with the remote connect configuration
  4. Run interactive test queries to verify execution on the serverless Glue engine Monitor AWS Glue console to verify session state and active driver connections

Source: AWS What's New

This page summarizes the original source. Check the source for full details.

Related