Thursday, 2 October 2014

XML Parsing using Map Reduce or Pig in Hortonworks



I have a sample file..I want to parse XML file in MR(Map Reduce) or Pig Latin..Can any once have solution on the below Example..and in my requirement i have mentioned the output also... I have a large volume of data...One student info i have listed below...XML file below..(Also added as a comment)


Input XML: Please dont mention any nick names or alias names Discount applied only if paid full fees at a time Pass with Distinction Atleast one number is mandatory Only Regular students information is available not for distance or summer course students


Output:(in Text or CSV file & that file i am giving it as input to my Hive table) Sname Gender Sid Cname Cid Branch totalfees Totalmarks textinmarks Mobile TextinCollegeinfo textinFees


No comments:

Post a Comment