hadoop - Knowing usage of mapper and reducer -

- May 15, 2010

I'm running a dip Latin script on 550 GB of data. I reddor is default 1. It takes approximately 38 minutes to generate the result Are there. I want to know that if the number of resellers who execute the script faster, then

Any help would be appreciated.

In addition to this, I had to know the concept behind changing mapers and reducers.

The increase in the number of reducers will definitely be helpful (if the operation you have done is aggregation), as real aggregation is decreasing, running many reducers will increase the performance.

You can use the word 'parallel' to determine the number of reducers in the pig. Ex: A = Load 'myfile' AS (T, U, V); B = Group A By T PARALLEL 18;

The number of mappers is decided by using the input size and the map. The number of mappers is usually equal to the number of input divisions.

Search This Blog

BAVO

hadoop - Knowing usage of mapper and reducer -

Comments

Post a Comment

Popular posts from this blog

c++ - Cmake produces file extensions in static library archives -

c# - Roxy file manager in MVC doesn't accept session path -

c# - Check if a NumericUpDown has no value -