time - measure the duration of the whole map phase per node in Hadoop -


At the moment, I know that JobTracker can show you the time that happens according to each map job, but what I want Not every map works, but in the first cluster, until the last one till the last one till one node , it takes time, for example: 1 map takes 2 seconds, but when you work 100 maps How to measure time nodes, and all in parallel Can not be executed?

Is it possible to know the entire map phase (all map operations) to execute per node?

You can use user defined Java counters to get time taken by all the mappers per node Are there.

In the mapper implementation, you need to do the following: 1. Override the setup and record start time long startTime = System.currentTimeMillis (); 2. Cleanup method override

long expiration time = System.currentTimeMillis (); String hostname = java.net.InetAddress.getLocalHost () GetHostName (). Context.getCounters (hostname, "time consuming") Increment (end time - start time);

Comments

Popular posts from this blog

Pass DB Connection parameters to a Kettle a.k.a PDI table Input step dynamically from Excel -

multithreading - PhantomJS-Node in a for Loop -

c++ - MATLAB .m file to .mex file using Matlab Compiler -