Load XML File in Hive

We can load XML data in HIVE Table very easily just like simple delimited file.
The only difference between loading Delimited File and XML File is we have to use Hive provided xpath UDF in order to extract the data residing within the tags.

All the steps that I have used are committed in following file on GitHub GIST.
You can find below screenshots depicting the execution scenarios of those commands.

Loading XML into Hive
Process Screenshot
VIEW records
Incremental Records via VIEW

Thank you for having a read.
Kindly revert back if you have any doubts or need more clarifications.

Published by milindjagre

I founded my blog www.milindjagre.co four years ago and am currently working as a Data Scientist Analyst at the Ford Motor Company. I graduated from the University of Connecticut pursuing Master of Science in Business Analytics and Project Management. I am working hard and learning a lot of new things in the field of Data Science. I am a strong believer of constant and directional efforts keeping the teamwork at the highest priority. Please reach out to me at milindjagre@gmail.com for further information. Cheers!

One thought on “Load XML File in Hive

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

This site uses Akismet to reduce spam. Learn how your comment data is processed.

%d bloggers like this: