Hadoop DP Notes: How can we control the parallel copying of RDBMS tables into hadoop ?

How can we control the parallel copying of RDBMS tables into hadoop ?

We can control/increase/decrease speed of copying by configuring the number of map tasks to be run for each sqoop copying process. We can do this by providing argument -m 10 or –num-mappers 10 argument to sqoop import command. If we specify -m 10 then it will submit 10 map tasks parallel at a time. Based on our requirement we can increase/decrease this number to control the copy speed.

Hadoop DP Notes

Search

How can we control the parallel copying of RDBMS tables into hadoop ?

Visitors