Questions tagged with Amazon EMR

Content language: English

Select up to 5 tags to filter

Sort by most recent

Filter Questions by

AllAnsweredUnansweredNo Answer

Browse through the questions and answers listed below or filter and sort to narrow down your results.

Reading Hbase data directly from S3 after migrating Hbase to S3 without EMR

Hello As part of Cloud Migration and Modernization approach using using AWS, the requirement is to migrate Hbase data directly to S3 then read the data from S3 using Java Microservices. (EMR would not...

Accepted AnswerAmazon Simple Storage Service AWS Fargate Amazon EMR Microservices

answers

votes

279

views

bnaha

asked 7 days ago

S3 Metadata Query across date folder or hundred thousand folders

I have a use case where I need to run Batch EMR job on schedule (daily). I can make folders on date basis for my data coming from IoT. Or I can make folders for each device sending IoT data and put...

Amazon Simple Storage Service Analytics Amazon EMR Storage S3 Select

answers

votes

272

views

Ashutosh

asked 21 days ago

AWS EMR parallel tasks and performance issue

Trying to load data of 200GB into dynamo using spark EMR but facing performance issues. """ Copy paste the following code in your Lambda function. Make sure to change the following key parameters for...

AWS Lambda Amazon EMR Amazon DynamoDB

answers

votes

611

views

Chitranshu

asked a month ago

EMR: HBase cluster WAL writer failure

I'm trying to create a EMR 7.1.0 cluster with HBase enabled for full S3 backup (including WAL) via the web console. However, no AWSServiceRoleForEMRWAL role is automatically being created and thus my...

Amazon EMR

answers

votes

258

views

donaldthecat

asked a month ago

Can you use OSS Trino on EMR with Lake Formation access controls

I'm trying to find out if Trino on EMR supports access controls maintained in Lake Formation. My catalog is AWS Glue. I couldn't find any documentation on Lake Formation or EMR side that would talk...

Amazon EMR AWS Glue AWS Lake Formation

answers

votes

329

views

Saawgr

asked a month ago

Service: EmrServerlessResourceManager; Status Code: 403; Error Code: AccessDeniedException

Hello, Can we get solution for this error `Service: EmrServerlessResourceManager; Status Code: 403; Error Code: AccessDeniedException` while running spark submit jobs at EMR Serverless. Below is...

Serverless Amazon EMR Containers Amazon EMR Serverless

answers

votes

541

views

Ashwath

asked a month ago

Default EMR Spark python environment contains two different versions of the dateutil package

I noticed that when you create a new EMR cluster using Spark, the default Python environment includes two different packages that both provide the "dateutil"...

Amazon EMR

answers

votes

467

views

dgibson

asked 2 months ago

EMR HDFS data restore

Hello Experts, Technically speaking, EBS volumes assigned to the EMR core nodes are persistent storage and I have specifically created them to not delete on cluster termination. Then, I have attached...

Accepted AnswerAmazon EMR

answers

votes

448

views

Scott M

asked 2 months ago

Issues running PySpark on AWS Lambda

I know the recommended strategy is to use EMR Serverless or EMR. However, I have a particular use case where I only need to run a fairly small PySpark job and need quick results. I've already gotten...

AWS Lambda Amazon EMR Amazon EMR Serverless

answers

votes

677

views

zzzz8888

asked 2 months ago

Amazon EMR sg for Master and Core nodes

Why does Amazon EMR creates inbound rule entries for master and core security groups? ![Core SG](/media/postImages/original/IM6Mggxg_vTQSTJFNCM0FRPA) ![Master...

Accepted AnswerAWS CloudFormation Amazon EC2 Amazon EMR

answers

votes

569

views

Ricardo Estrada

asked 3 months ago

Get the Last Execution Code block time on EMR notebook/workspace

I have an EMR workspace under which I have 4 Jupyter notebooks created on which PySpark code blocks are run. I want to get the last execution code block time across all 4 notebooks to determine the...

Accepted AnswerAmazon EMR Amazon EMR Studio

answers

votes

553

views

Sukrit

asked 3 months ago

How can I change default s3 storage class of Hive connector of EMR Trino?

I want to change the default s3 storage class to INTELLIGENT_TIERING of Hive connector of EMR Trino 426 (EMR 6.15.0). I found the [hive.s3.storage-class option in the Trino 426 official...

Accepted AnswerAmazon EMR

answers

votes

622

views

rePost-User-3418860

asked 4 months ago

1
2
3
4
5
•••
26
12 / page

Questions tagged with Amazon EMR

Reading Hbase data directly from S3 after migrating Hbase to S3 without EMRlg...

S3 Metadata Query across date folder or hundred thousand folderslg...

AWS EMR parallel tasks and performance issuelg...

EMR: HBase cluster WAL writer failurelg...

Can you use OSS Trino on EMR with Lake Formation access controlslg...

Service: EmrServerlessResourceManager; Status Code: 403; Error Code: AccessDeniedExceptionlg...

Default EMR Spark python environment contains two different versions of the dateutil packagelg...

EMR HDFS data restorelg...

Issues running PySpark on AWS Lambdalg...

Amazon EMR sg for Master and Core nodeslg...

Get the Last Execution Code block time on EMR notebook/workspacelg...

How can I change default s3 storage class of Hive connector of EMR Trino?lg...

Reading Hbase data directly from S3 after migrating Hbase to S3 without EMR

S3 Metadata Query across date folder or hundred thousand folders

AWS EMR parallel tasks and performance issue

EMR: HBase cluster WAL writer failure

Can you use OSS Trino on EMR with Lake Formation access controls

Service: EmrServerlessResourceManager; Status Code: 403; Error Code: AccessDeniedException

Default EMR Spark python environment contains two different versions of the dateutil package

EMR HDFS data restore

Issues running PySpark on AWS Lambda

Amazon EMR sg for Master and Core nodes

Get the Last Execution Code block time on EMR notebook/workspace

How can I change default s3 storage class of Hive connector of EMR Trino?