Introduction to Hive :
Hive is like a data warehousing tool that is available in the Hadoop Eco system. It uses it own programming language called "HQL" like traditional query language "SQL". But HQL will have its own syntax.
Hive mainly used to process structured and semi structured data. Hive will stores the data in HDFS. Hive maintains data in terms of tables and columns only but ultimately Hive's table also a file. Hive mainly has two components one is warehouse and other is metadata.
Hive stores data in warehouse and all schema related information can be stored in metadata.
***This page is still under construction***
Hive is like a data warehousing tool that is available in the Hadoop Eco system. It uses it own programming language called "HQL" like traditional query language "SQL". But HQL will have its own syntax.
Hive mainly used to process structured and semi structured data. Hive will stores the data in HDFS. Hive maintains data in terms of tables and columns only but ultimately Hive's table also a file. Hive mainly has two components one is warehouse and other is metadata.
Hive stores data in warehouse and all schema related information can be stored in metadata.
***This page is still under construction***
No comments:
Post a Comment