- What kind of data warehouse application is suitable for Hive? What are the types of tables in Hive?
- Explain the SMB Join in Hive?
- How HIVE is different from RDBMS?
- What are the types of database does Hive support ?
- In Hive, how can you enable buckets?
- Is Hive suitable to be used for OLTP systems? Why?
- What is the Object Inspector functionality is in Hive?
- What are limitations of Hive?
- What are the different Modes in the Hive?
- What is Hive Bucketing?
- What is the difference between partition and bucketing?
- Where does the data of a Hive table gets stored?
- How data transfer happens from HDFS to Hive?
- What does the Hive query processor do?
- Explain about SORT BY, ORDER BY, DISTRIBUTE BY and CLUSTER BY in Hive.
- What is the difference between local and remote metastore?
- Which classes are used in Hive to Read and Write HDFS Files?
- Explain the functionality of ObjectInspector.
- What is ObjectInspector functionality in Hive?
- How does bucketing help in the faster execution of queries?
- Why will mapreduce not run if you run select * from table in hive?
- What is Hive MetaStore?
- What are the three different modes in which hive can be run?
- How can you prevent a large job from running for a long time?
- When do we use explode in Hive?
- What are the different components of a Hive architecture?
- How can you connect an application, if you run Hive as a server?
- Can we LOAD data into a view?
- Is it possible to add 100 nodes when we already have 100 nodes in Hive? If yes, how?
- Can Hive process any type of data formats?
- How can you stop a partition form being queried?
- What is a Hive variable? What do we use it for?
- What is SerDe in Apache Hive?
- Whenever we run a Hive query, a new metastore_db is created. Why?
- Can we change the data type of a column in a hive table?
- Why does Hive not store metadata information in HDFS?
- How does Hive deserialize and serialize the data?
- What is RegexSerDe?
- While loading data into a hive table using the LOAD DATA clause, how do you specify it is a hdfs file and not a local file ?
- Explain about the different types of partitioning in Hive?
- What is the significance of ‘IF EXISTS” clause while dropping a table?
- How can Hive avoid mapreduce?
- What is the relationship between MapReduce and Hive? or How Mapreduce jobs submits on the cluster?
- What is ObjectInspector functionality?
- Suppose that I want to monitor all the open and aborted transactions in the system along with the transaction id and the transaction state. Can this be achieved using Apache Hive?
- Can a partition be archived? What are the advantages and disadvantages?
- does the archiving of Hive tables save space in HDFS?
- does Hive support record level Insert, delete or update?
- What are the default record and field delimiter used for hive text files?
- What is difference between static and dynamic partition of a table?
- Why do we perform partitioning in Hive?
- How does partitioning help in the faster execution of queries?
- Can you list few commonly used Hive services?
- What is the default maximum dynamic partition that can be created by a mapper/reducer? How can you change it?
- Why do we need buckets?
- Can we name view the same as the name of a Hive table?
- What Options are Available When It Comes to Attaching Applications to the Hive Server?
- When should we use SORT BY instead of ORDER BY?
- What are the uses of Hive Explode?
- Can we run UNIX shell commands from Hive? Can Hive queries be executed from script files? If yes, how?
- How is ORC file format optimised for data storage and analysis?
- What is the difference between Internal and External Table?
- Explain the different types of join in Hive.
- What is a metastore in Hive?
- What is the functionality of Query Processor in Apache Hive?
- What is the utilization of Hcatalog?
- How will you optimize Hive performance?
- In case of embedded Hive, can the same metastore be used by multiple users?
- When to use Map reduce mode?
- What is the importance of Thrift server & client, JDBC and ODBC driver in Hive?
- The property set to run hive in local mode as true so that it runs without creating a mapreduce job is
- When a partition is archived in Hive it
- A user creates a UDF which accepts arguments of different data types, each time it is run. It is an example of
- While querying a hive table for a Array type column, if the array index is nonexistent then
- A GenericUDF is a Function that
- Which of the following scenarios are not prevented by enabling strict mode in Hive?
- In hive, what happens when the schema does not match the file content?
- The DISTRIBUTED BY clause in hive
- In ______ mode HiveServer2 only accepts valid Thrift calls.
- The disadvantage of compressing files in HDFS is
- The partitioning of a table in Hive creates more
- The Property that decides what is the maximum number of files that can be sampled during the use of the LIMIT clause is
- For optimizing join of three tables, the largest sized tables should be placed as
- The drawback of managed tables in hive is
- Which of the following command sets the value of a particular configuration variable (key)?
- The below expression in the where clause RLIKE '.*(Chicago|Ontario).*'; gives the result which match
- What is the disadvantage of using too many partitions in Hive tables?
- By default when a database is dropped in Hive:
- Explode in Hive is used to convert complex data types into desired table formats.
- Point out the correct statement.
- Point out the correct statement
- Point out the wrong statement:
- Hive converts queries to all except
- The thrift service component in hive is used for