BACKGROUND IMAGE: iSTOCK/GETTY IMAGES
At its core, the HPE Vertica Analytics Platform from Hewlett Packard Enterprise is a column-oriented, relational database system built specifically to handle modern analytic workloads. The platform uses a clustered approach to storing big data, offering high-performance query and analytics functionality.
The HPE Vertica Analytics Platform delivers advanced SQL analytics as a standards-based relational database management system with full support for SQL, Java Database Connectivity (JDBC) and Open Database Connectivity (ODBC). This enables companies to preserve their investments and years of training in these technologies because all SQL programming tools and languages work seamlessly together.
The platform is a relational columnar store that groups data on disk by column, rather than by row. Columnar databases make it easier to access columns of data for grouping and aggregation because the next piece of data read off disk is the next attribute in a column, not a row. By reading just the columns referenced by the query, disk I/O can be reduced and query processing improved.
The HPE Vertica Analytics Platform is available on premises, in the cloud or deployed on a Hadoop cluster.
HPE Vertica Analytics features and highlights
HPE Vertica Analytics Platform 7.2 delivers enhanced support for Hadoop. SQL queries can be executed directly on ORC files, which can provide a significant boost in performance over traditional Hadoop access.
This version also provides machine data log text search, which helps users collect and index large log file data sets such as those created by networked systems. Log mining can help IT organizations with data security practices by identifying and predicting failures and cyberattacks, as well as investigating unauthorized access.
The improved SQL analytics functions of HPE Vertica 7.2 further enhance the large library of built-in analytical functions ranging from relational online analytical processing to geospatial sentiment analysis for deriving insight from varying forms of data.
HPE Vertica 7.2 also delivers improved manageability with more granular backup and recovery and dynamic alerts, enhanced security with improved Lightweight Directory Access Protocol synchronization, role-based access and automated maintenance of users and user groups. Enhanced third-party cloud interoperability is also improved for data warehouse as a service, with a wizard for simplifying loading data from Amazon S3 into Vertica.
Data compression is also more efficient because like data is grouped together on disk by column, meaning that many repetitive or similar values can be aggressively compressed. The HPE Vertica Analytics Platform features a library of compression algorithms that it applies automatically based on data type. Up to a 90% reduction in disk space can be achieved when compressing data on the HPE Vertica Analytics Platform, lowering storage costs and potentially speeding up queries.
The HPE Vertica Analytics Platform is a shared-nothing, distributed database designed to work on clusters of inexpensive, off-the-shelf servers and its performance is scaled by adding new servers to the cluster. The platform's grid architecture reduces hardware and scaling costs substantially when compared to traditional databases that require large servers with many CPUs and storage area networks. Clustering can also improve performance by parallelizing querying and loading across the nodes in the cluster for higher throughput.
The HPE Vertica Analytics Platform is supported on a variety of Linux distributions, including:
- Red Hat Enterprise Linux 6.x and 7.x
- SUSE Linux Enterprise Server 11.0 through 11.0 SP3
- Debian Linux 7.0 through 7.7
- CentOS 6.x and 7.x
- Ubuntu 12.04 LTS and 14.04 LTS
Client drivers are offered for JDBC, ODBC and ADO.NET. Users can download them individually or as client packages. HPE also provides a module specific to Hadoop for HPE Vertica client machines.
HPE Vertica licensing, pricing and support
On-premises. Users can use the community edition for free up to 1 terabyte (TB). For a full licensed version with full support, minimum of 1 TB, users are charged per terabyte when installing on the Vertica Platform.
In the cloud. Users can choose to pay either storage-centric pricing or per query-centric pricing. For example, users could buy 1 TB and 100,000 queries for $999. If more queries are needed, users can buy a query pack for $999 to get an additional 500,000 queries -- and 250 GB storage. A free 14-day trial version of HPE Vertica OnDemand is available.
For customers wishing to use Vertica for SQL on Hadoop, licensing is by node. Users pay per node with no other restrictions on queries, concurrent users, storage used, etc.
Support and maintenance are negotiated as part of the initial purchase.
Accessing big data using SQL on Hadoop
Learn how SQL-on-Hadoop tools work
The role of data warehouse software in BI and analytics