Hbase Online Tutorials

HBase is an open-source non-relational distributed database modeled after Google's Bigtable and written in Java. It is developed as part of Apache Software Foundation's Apache Hadoop project and runs on top of HDFS (Hadoop Distributed File System) or Alluxio, providing Bigtable-like capabilities for Hadoop. That is, it provides a fault-tolerant way of storing large quantities of sparse data (small amounts of information caught within a large collection of empty or unimportant data, such as finding the 50 largest items in a group of 2 billion records, or finding the non-zero items representing less than 0.1% of a huge collection).

Apache HBase
Original author(s)Powerset
Developer(s)Apache Software Foundation
Initial release28 March 2008; 16 years ago (2008-03-28)
Stable release
2.4.x2.4.14 / 29 August 2022; 2 years ago (2022-08-29)
2.5.x2.5.3 / 5 February 2023; 22 months ago (2023-02-05)
Preview release
3.0.0-alpha-3 / 27 June 2022; 2 years ago (2022-06-27)
RepositoryGitHub Repository, Gitbox Repository
Written inJava
Operating systemCross-platform
TypeDistributed database
LicenseApache License 2.0
Websitehbase.apache.org

HBase features compression, in-memory operation, and Bloom filters on a per-column basis as outlined in the original Bigtable paper. Tables in HBase can serve as the input and output for MapReduce jobs run in Hadoop, and may be accessed through the Java API but also through REST, Avro or Thrift gateway APIs. HBase is a wide-column store and has been widely adopted because of its lineage with Hadoop and HDFS. HBase runs on top of HDFS and is well-suited for fast read and write operations on large datasets with high throughput and low input/output latency.

HBase is not a direct replacement for a classic SQL database, however Apache Phoenix project provides a SQL layer for HBase as well as JDBC driver that can be integrated with various analytics and business intelligence applications. The Apache Trafodion project provides a SQL query engine with ODBC and JDBC drivers and distributed ACID transaction protection across multiple statements, tables and rows that use HBase as a storage engine.

HBase is now serving several data-driven websites but Facebook's Messaging Platform migrated from HBase to MyRocks in 2018. Unlike relational and traditional databases, HBase does not support SQL scripting; instead the equivalent is written in Java, employing similarity with a MapReduce application.

In the parlance of Eric Brewer's CAP Theorem, HBase is a CP type system.

History

edit

Apache HBase began as a project by the company Powerset out of a need to process massive amounts of data for the purposes of natural-language search. Since 2010 it is a top-level Apache project.

Facebook elected to implement its new messaging platform using HBase in November 2010, but migrated away from HBase in 2018.

The 2.4.x series is the current stable release line, it supersedes earlier release lines.

Use cases & production deployments

edit

Enterprises that use HBase

edit

The following is a list of notable enterprises that have used or are using HBase:

  • 23andMe
  • Adobe
  • Airbnb uses HBase as part of its AirStream realtime stream computation framework
  • Alibaba Group
  • Amadeus IT Group, as its main long-term storage DB.
  • Bloomberg, for time series data storage
  • Facebook used HBase for its messaging platform between 2010 and 2018
  • Flipkart uses HBase for its search index and user insights.[9]
  • Flurry
  • HubSpot
  • Imgur uses HBase to power its notifications system[10][11]
  • Kakao[12]
  • Netflix[13]
  • Pinterest[14]
  • Quicken Loans
  • Rocket Fuel
  • Salesforce.com[15]
  • Sears
  • Sophos, for some of their back-end systems.
  • Spotify uses HBase as base for Hadoop and machine learning jobs.[16]
  • Twitter
  • Tuenti uses HBase for its messaging platform.[17][18]
  • Xiaomi
  • Yahoo!

See also

edit
  • NoSQL
  • Wide column store
  • Bigtable
  • Apache Cassandra
  • Oracle NOSQL
  • Hypertable
  • Apache Accumulo
  • MongoDB
  • Project Voldemort
  • Riak
  • Sqoop
  • Elasticsearch
  • Apache Phoenix

References

edit
  1. ^ a b c "Apache HBase – Apache HBase Downloads". Retrieved 27 September 2022.
  2. ^ Chang, et al. (2006). Bigtable: A Distributed Storage System for Structured Data
  3. ^ "Apache HBase – Powered By Apache HBase". hbase.apache.org. Retrieved 8 April 2018.
  4. ^ a b "Migrating Messenger storage to optimize performance". www.facebook.com. 26 June 2018. Retrieved 5 July 2018.
  5. ^ Facebook: Why our 'next-gen' comms ditched MySQL Retrieved: 17 December 2010
  6. ^ "Consistency Tradeoffs in Modern Distributed Database System Design" (PDF). February 2012. Retrieved 23 October 2024.
  7. ^ HBaseCon (2 August 2016). "Apache HBase at Airbnb". slideshare.net. Retrieved 8 April 2018.
  8. ^ "Near Real Time Search Indexing". 4 January 2018.
  9. ^ "Is data locality always out of the box in Hadoop?". 10 March 2018.

Latest online Hbase Tutorials with example so this page for both freshers and experienced candidate who want to get job in Hbase company

Latest online Hbase Tutorials for both freshers and experienced

advertisements

View Tutorials on Hbase View all questions

Ask your interview questions on Hbase

Write Your comment or Questions if you want the answers on Hbase from Hbase Experts
Name* :
Email Id* :
Mob no* :
Question
Or
Comment* :
 





Disclimer: PCDS.CO.IN not responsible for any content, information, data or any feature of website. If you are using this website then its your own responsibility to understand the content of the website

--------- Tutorials ---