Hadoop Mapreduce Flow Custom InputFormat, RecordReader [closed]











up vote
1
down vote

favorite












I'm new to Hadoop and currently I'm learning mapreduce design pattern from Donald Miner & Adam Shook MapReduce Design Pattern book. So in this book there is Cartesian Product Join Pattern and it create a custom InputFormat and RecordReader and it makes me confuse about the flow. I have many question about the code :




  1. Where is InputFormat and RecordReader code actually run? Is it in AppMaster for InputFormat and NodeManager for RecordReader?

  2. Why is there another InputFormat instance in the CartesianRecordReader? In all MapReduce flow that I found, InputFormat is running before record reader does.


Here is the source code
https://github.com/adamjshook/mapreducepatterns/blob/master/MRDP/src/main/java/mrdp/ch5/CartesianProduct.java



That's all, thanks in advance :)










share|improve this question















closed as too broad by cricket_007, GBlodgett, Graham, Shiladitya, Matthew L Daniel Nov 11 at 5:00


Please edit the question to limit it to a specific problem with enough detail to identify an adequate answer. Avoid asking multiple distinct questions at once. See the How to Ask page for help clarifying this question. If this question can be reworded to fit the rules in the help center, please edit the question.



















    up vote
    1
    down vote

    favorite












    I'm new to Hadoop and currently I'm learning mapreduce design pattern from Donald Miner & Adam Shook MapReduce Design Pattern book. So in this book there is Cartesian Product Join Pattern and it create a custom InputFormat and RecordReader and it makes me confuse about the flow. I have many question about the code :




    1. Where is InputFormat and RecordReader code actually run? Is it in AppMaster for InputFormat and NodeManager for RecordReader?

    2. Why is there another InputFormat instance in the CartesianRecordReader? In all MapReduce flow that I found, InputFormat is running before record reader does.


    Here is the source code
    https://github.com/adamjshook/mapreducepatterns/blob/master/MRDP/src/main/java/mrdp/ch5/CartesianProduct.java



    That's all, thanks in advance :)










    share|improve this question















    closed as too broad by cricket_007, GBlodgett, Graham, Shiladitya, Matthew L Daniel Nov 11 at 5:00


    Please edit the question to limit it to a specific problem with enough detail to identify an adequate answer. Avoid asking multiple distinct questions at once. See the How to Ask page for help clarifying this question. If this question can be reworded to fit the rules in the help center, please edit the question.

















      up vote
      1
      down vote

      favorite









      up vote
      1
      down vote

      favorite











      I'm new to Hadoop and currently I'm learning mapreduce design pattern from Donald Miner & Adam Shook MapReduce Design Pattern book. So in this book there is Cartesian Product Join Pattern and it create a custom InputFormat and RecordReader and it makes me confuse about the flow. I have many question about the code :




      1. Where is InputFormat and RecordReader code actually run? Is it in AppMaster for InputFormat and NodeManager for RecordReader?

      2. Why is there another InputFormat instance in the CartesianRecordReader? In all MapReduce flow that I found, InputFormat is running before record reader does.


      Here is the source code
      https://github.com/adamjshook/mapreducepatterns/blob/master/MRDP/src/main/java/mrdp/ch5/CartesianProduct.java



      That's all, thanks in advance :)










      share|improve this question















      I'm new to Hadoop and currently I'm learning mapreduce design pattern from Donald Miner & Adam Shook MapReduce Design Pattern book. So in this book there is Cartesian Product Join Pattern and it create a custom InputFormat and RecordReader and it makes me confuse about the flow. I have many question about the code :




      1. Where is InputFormat and RecordReader code actually run? Is it in AppMaster for InputFormat and NodeManager for RecordReader?

      2. Why is there another InputFormat instance in the CartesianRecordReader? In all MapReduce flow that I found, InputFormat is running before record reader does.


      Here is the source code
      https://github.com/adamjshook/mapreducepatterns/blob/master/MRDP/src/main/java/mrdp/ch5/CartesianProduct.java



      That's all, thanks in advance :)







      java hadoop design-patterns mapreduce






      share|improve this question















      share|improve this question













      share|improve this question




      share|improve this question








      edited Nov 13 at 16:14

























      asked Nov 9 at 16:07









      Irvan

      3617




      3617




      closed as too broad by cricket_007, GBlodgett, Graham, Shiladitya, Matthew L Daniel Nov 11 at 5:00


      Please edit the question to limit it to a specific problem with enough detail to identify an adequate answer. Avoid asking multiple distinct questions at once. See the How to Ask page for help clarifying this question. If this question can be reworded to fit the rules in the help center, please edit the question.






      closed as too broad by cricket_007, GBlodgett, Graham, Shiladitya, Matthew L Daniel Nov 11 at 5:00


      Please edit the question to limit it to a specific problem with enough detail to identify an adequate answer. Avoid asking multiple distinct questions at once. See the How to Ask page for help clarifying this question. If this question can be reworded to fit the rules in the help center, please edit the question.































          active

          oldest

          votes






















          active

          oldest

          votes













          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes

          Popular posts from this blog

          Schultheiß

          Verwaltungsgliederung Dänemarks

          Liste der Kulturdenkmale in Wilsdruff