Questions tagged with Amazon EMR

Content language: English

Select up to 5 tags to filter
Sort by most recent

Browse through the questions and answers listed below or filter and sort to narrow down your results.

Is there a way to use s3-dist-cp to copy files from a bucket that uses Requestor payments?
2
answers
0
votes
362
views
asked 7 months ago
Upgrading from EMR versions 6.11 to 6.12 (even tried 7.0.0), I'm seeing these errors on the same exact job with the same resources - has something changed with how EMRFS has been implemented? What is...
1
answers
0
votes
1401
views
Dev
asked 7 months ago
Good morning, As recently, a vulnerability on Resource Manager has been exploited, we are worried and want to confirm with you about the impact....
2
answers
0
votes
307
views
Hx
asked 7 months ago
I am trying to install happybase package on Zeppelin notebook ( or for that matter any package ) . How do I do a pip install from a Zeppelin cell . %pip or !pip is not recognized
2
answers
0
votes
299
views
asked 7 months ago
Is there a way to check the integrity of files copied with S3DistCp at the end of the copy, like DistCp checksum?
1
answers
0
votes
321
views
asked 7 months ago
EMR had 1 primary, 1 core and 5 task nodes. All 3 group of nodes were on demand (including task group). I didn't use spot purchasing for task group to avoid unexpected termination. But still EMR...
1
answers
0
votes
629
views
asked 7 months ago
In AWS EMR, I encountered the following error message when running a pyspark job, which ran successfully on my local machine. > [System Error] Fail to delete the temp folder Is there a way to...
Accepted AnswerAmazon EMR
1
answers
0
votes
296
views
asked 7 months ago
When using EMR 7.0.0 in EMR Serverless (have not tried EKS or EC2), after connecting to the application through a EMR Studio workspace, the pyspark kernel doesn't work in a notebook. It stays in...
1
answers
0
votes
418
views
tomups
asked 7 months ago
Hi, We have an EMR cluster with multiple concurrent steps gets executed seamlessly. Not sure what happened certainly, but the step logs, application logs are not published to s3 from yesterday....
Accepted AnswerAmazon EMR
3
answers
0
votes
539
views
Scott M
asked 7 months ago
I am trying to use aws emr-serverless get-dashboard-for-job-run cli command to pull information from emr-serverless but am stumped. This command returns a url and auth token. If I go to the url, it...
0
answers
0
votes
155
views
ebethj
asked 7 months ago
Hi, after EMR 7.0.0 was released in the previous week, we wanted to start using it. # Problem We have shell script EMR steps that are executed during the start of the cluster. These EMR steps never...
Accepted AnswerAmazon EC2Amazon EMR
2
answers
0
votes
459
views
EGeist
asked 7 months ago
Hi, One of my dev team members, asking to share the emr spark artifacts s3 location for building a Java application. I referred this doc...
Accepted AnswerAmazon EMR
1
answers
1
votes
254
views
Vaas
asked 7 months ago