time - measure the duration of the whole map phase per node in Hadoop -
At the moment, I know that JobTracker can show you the time that happens according to each map job, but what I want Not every map works, but in the first cluster, until the last one till the last one till one node , it takes time, for example: 1 map takes 2 seconds, but when you work 100 maps How to measure time nodes, and all in parallel Can not be executed? Is it possible to know the entire map phase (all map operations) to execute per node? You can use user defined Java counters to get time taken by all the mappers per node Are there. In the mapper implementation, you need to do the following: 1. Override the setup and record start time long startTime = System.currentTimeMillis (); 2. Cleanup method override
long expiration time = System.currentTimeMillis (); String hostname = java.net.InetAddress.getLocalHost () GetHostName (). Context.getCounters (hostname, "time consuming") Increment (end time - start time);
Comments
Post a Comment