Questions tagged with AWS Glue

Content language: English

Select up to 5 tags to filter
Sort by most recent

Browse through the questions and answers listed below or filter and sort to narrow down your results.

Hi, We have a glue job with these details: version 4, worker_type G.4X with 20 number of workers. It runs Python script. When executing, it fails with below error: :...
Accepted AnswerAWS Glue
2
answers
0
votes
91
views
Sharif
asked 18 days ago
I am trying to connect my AWS Glue notebook in Sagemaker Studio to Redshift Serverless, but I keep encountering a connection timeout error. The network mode is: Public internet access. To this mode, I...
1
answers
0
votes
195
views
asked 19 days ago
I'm trying to refresh a materialized view with a glue job, connecting to Redshift cluster using boto3 authenticating with a database username. The execution timeouts with no errors in CloudWatch. I'm...
2
answers
0
votes
176
views
profile picture
asked 20 days ago
Hi I am facing **ERROR : Internal Service Exception** while trying to crawl the S3 bucket folder using the Glue crawler. Carwler Target is the Glue catalog tables. Earlier it worked for one crawler...
1
answers
0
votes
98
views
ravi_tb
asked 20 days ago
I'm confused by AWS documentation regarding compatibility with delta tables. We need to delete a column that is the "column mapping" feature supported in delta-lake 1.2.0 and we do it through spark...
1
answers
0
votes
231
views
Sergii
asked 23 days ago
Hi, I am creating a Glue job to copy files from Source S3 to another target S3. The source S3 and Glue Job are in same AWS account. But the target bucket is different account. 1. I can read the file...
5
answers
0
votes
267
views
Bharath
asked 24 days ago
I am using Glue Version 4 notebook in Glue Studio. Also tried Script version in the console. All of them do not recognize Lake Formation hybrid opt-in APIs. Throws below error. "AttributeError:...
Accepted AnswerAWS Glue
2
answers
0
votes
223
views
AWS
aarts
asked 25 days ago
I have been implementing a small ETL job using Pyspark. **I plan to deploy it to AWS Glue and will use an S3 bucket. to read and write my files instead of local file, once it is ready.** This ETL...
0
answers
1
votes
308
views
profile picture
asked 25 days ago
I am using AWS GLUE ETL job that is fetching data from Mongo DB and putting it to AWS Glue catalog table but the issue is everytime the job runs it is creating the duplicate entries.(If there are 1000...
2
answers
0
votes
353
views
asked a month ago
Hello AWS Community, I am currently storing event logs in an RDS Postgres database and am looking for an efficient way to manage the growing size of our tables. Here's what I am aiming to...
1
answers
0
votes
404
views
Alan
asked a month ago
say, i have some data in s3 bucket and an aws glue crawler job that reads and creates few aws glue catalog tables. I want to read the data in these tables and push it to some other database like...
1
answers
0
votes
644
views
asked a month ago
I have been trying to send data from Salesforce to Redshift using App Flow. Every time when I setup the flow I am getting an error 'Connector timed out'. I have tried both serverless and cluster. I am...
2
answers
0
votes
675
views
asked a month ago