Questions tagged with AWS Glue

Content language: English

Select up to 5 tags to filter

Sort by most recent

Filter Questions by

AllAnsweredUnansweredNo Answer

Browse through the questions and answers listed below or filter and sort to narrow down your results.

Glue job failing with "Unknown time-zone ID: GMT000" error

Hi, We have a glue job with these details: version 4, worker_type G.4X with 20 number of workers. It runs Python script. When executing, it fails with below error: :...

Accepted AnswerAWS Glue

answers

votes

views

Sharif

asked 18 days ago

Issues Connecting AWS Sagemaker Glue Notebook to Redshift Serverless

I am trying to connect my AWS Glue notebook in Sagemaker Studio to Redshift Serverless, but I keep encountering a connection timeout error. The network mode is: Public internet access. To this mode, I...

Amazon SageMaker AWS Glue Amazon Redshift Amazon SageMaker Studio Amazon Redshift Serverless

answers

votes

195

views

Harshdeep

asked 19 days ago

Refresh Redshift materialized view using a Glue Job script

I'm trying to refresh a materialized view with a glue job, connecting to Redshift cluster using boto3 authenticating with a database username. The execution timeouts with no errors in CloudWatch. I'm...

AWS Glue Amazon Redshift

answers

votes

176

views

NLopeDeBarrios

asked 20 days ago

Unable to crawl S3 folders using Glue Crawler

Hi I am facing **ERROR : Internal Service Exception** while trying to crawl the S3 bucket folder using the Glue crawler. Carwler Target is the Glue catalog tables. Earlier it worked for one crawler...

AWS Glue

answers

votes

views

ravi_tb

asked 20 days ago

Delta table column mapping support in Glue/Athena

I'm confused by AWS documentation regarding compatibility with delta tables. We need to delete a column that is the "column mapping" feature supported in delta-lake 1.2.0 and we do it through spark...

Amazon Athena AWS Glue

answers

votes

231

views

Sergii

asked 23 days ago

Copy files from one S3 bucket to another S3 bucket

Hi, I am creating a Glue job to copy files from Source S3 to another target S3. The source S3 and Glue Job are in same AWS account. But the target bucket is different account. 1. I can read the file...

AWS Glue

answers

votes

267

views

Bharath

asked 24 days ago

AWS Glue Version 4 Jupyter Notebook- how to update to latest boto3?

I am using Glue Version 4 notebook in Glue Studio. Also tried Script version in the console. All of them do not recognize Lake Formation hybrid opt-in APIs. Throws below error. "AttributeError:...

Accepted AnswerAWS Glue

answers

votes

223

views

aarts

asked 25 days ago

How to Preserve Directory Structure in PySpark ETL Job on AWS Glue

I have been implementing a small ETL job using Pyspark. **I plan to deploy it to AWS Glue and will use an S3 bucket. to read and write my files instead of local file, once it is ready.** This ETL...

Amazon Simple Storage Service AWS Glue AWS Batch Extract Transform & Load Data

answers

votes

308

views

Pankesh Patel

asked 25 days ago

Duplicate entries in target Glue data Catalog table using ETL

I am using AWS GLUE ETL job that is fetching data from Mongo DB and putting it to AWS Glue catalog table but the issue is everytime the job runs it is creating the duplicate entries.(If there are 1000...

Amazon Athena Amazon QuickSight AWS Glue Extract Transform & Load Data

answers

votes

353

views

Deepak Puri

asked a month ago

Efficiently Migrating and Archiving Data from RDS Postgres to S3 in Parquet Format

Hello AWS Community, I am currently storing event logs in an RDS Postgres database and am looking for an efficient way to manage the growing size of our tables. Here's what I am aiming to...

Accepted AnswerAmazon Athena PostgreSQL AWS Glue

answers

votes

404

views

Alan

asked a month ago

how to manage aws glue jobs?

say, i have some data in s3 bucket and an aws glue crawler job that reads and creates few aws glue catalog tables. I want to read the data in these tables and push it to some other database like...

AWS Glue Amazon DynamoDB Extract Transform & Load Data

answers

votes

644

views

clouduser

asked a month ago

Salesforce to Redshift Data Transfer

I have been trying to send data from Salesforce to Redshift using App Flow. Every time when I setup the flow I am getting an error 'Connector timed out'. I have tried both serverless and cluster. I am...

Serverless AWS Glue Amazon Connect Extract Transform & Load Data Amazon Redshift

answers

votes

675

views

Dhananjay

asked a month ago

1
2
3
4
5
•••
140
12 / page

Questions tagged with AWS Glue

Glue job failing with "Unknown time-zone ID: GMT000" errorlg...

Issues Connecting AWS Sagemaker Glue Notebook to Redshift Serverlesslg...

Refresh Redshift materialized view using a Glue Job scriptlg...

Unable to crawl S3 folders using Glue Crawlerlg...

Delta table column mapping support in Glue/Athenalg...

Copy files from one S3 bucket to another S3 bucketlg...

AWS Glue Version 4 Jupyter Notebook- how to update to latest boto3?lg...

How to Preserve Directory Structure in PySpark ETL Job on AWS Gluelg...

Duplicate entries in target Glue data Catalog table using ETLlg...

Efficiently Migrating and Archiving Data from RDS Postgres to S3 in Parquet Formatlg...

how to manage aws glue jobs?lg...

Salesforce to Redshift Data Transferlg...

Glue job failing with "Unknown time-zone ID: GMT000" error

Issues Connecting AWS Sagemaker Glue Notebook to Redshift Serverless

Refresh Redshift materialized view using a Glue Job script

Unable to crawl S3 folders using Glue Crawler

Delta table column mapping support in Glue/Athena

Copy files from one S3 bucket to another S3 bucket

AWS Glue Version 4 Jupyter Notebook- how to update to latest boto3?

How to Preserve Directory Structure in PySpark ETL Job on AWS Glue

Duplicate entries in target Glue data Catalog table using ETL

Efficiently Migrating and Archiving Data from RDS Postgres to S3 in Parquet Format

how to manage aws glue jobs?

Salesforce to Redshift Data Transfer