Vertica is an Analytics database management software company.[ Network World staff: "New database company raises funds, nabs ex-Oracle bigwigs", [1] LinuxWorld, February 14, 2007][Brodkin, J: "10 enterprise software companies to watch", [2] Network World, April 11, 2007] Vertica was founded in 2005 by the database researcher Michael Stonebraker with Andrew Palmer as the founding CEO. Ralph Breslauer and Christopher P. Lynch served as CEOs later on.
Lynch joined as chairman and CEO in 2010 and was responsible for Vertica's acquisition by Hewlett Packard in March 2011.[ HP News Release: "HP to Acquire Vertica: Customers Can Analyze Massive Amounts of Big Data at Speed and Scale" Feb. 2011][
HP News Release: "HP Completes Acquisition of Vertica Systems, Inc." March 22, 2011.] The acquisition expanded the HP Software portfolio for enterprise companies and the public sector group.[ ComputerWorld.com: "Update: HP to buy Vertica for analytics." Kanaracus. Feb. 2011.] As part of the merger of Micro Focus and the Software division of Hewlett Packard Enterprise, Vertica joined Micro Focus in September 2017.[ SiliconAngle: "Vertica survives software industry turmoil to emerge as key cloud and big data player" Albertson.] As part of OpenText acquisition of Micro Focus, Vertica joined OpenText in January 2023.
Products
The column-oriented Vertica Analytics Database was designed to manage large, fast-growing volumes of data and with fast query performance for
and other query-intensive applications. The product claims to greatly improve query performance over traditional
RDBMS, and to provide high availability and
exabyte scalability on commodity enterprise servers. Vertica runs on multiple
cloud computing as well as on
Apache Hadoop nodes. Vertica's Eon Mode separates compute from storage, using S3 object storage and dynamic allocation of compute notes.
[ Press Release: "Micro Focus Announces Vertica in Eon Mode for Pure Storage" Sept 17, 2019]
Vertica's design features include:
-
Column-oriented storage organization, which increases performance of sequential record access at the expense of common transactional operations such as single record retrieval, updates, and deletes.
[Monash, C: "Are row-oriented RDBMS obsolete?" [8] DBMS2, January 22, 2007]
-
Massively parallel processing (MPP) architecture to distribute queries on independent nodes and scale performance linearly.
-
Standard SQL interface with many analytics capabilities built-in, such as time series gap filling/interpolation, event-based windowing and sessionization, pattern matching, event series joins, statistical computation (e.g., regression analysis), and geospatial analysis.
-
In-database machine learning including categorization, fitting and prediction without down-sampling and data movement. Vertica offers a variety of in-database algorithms, including linear regression, logistic regression, k-means clustering, Naive Bayes classification, Random Forest, XGBoost, and support vector machine regression and classification. It also allows deployment of ML models to multiple clusters.
-
Data compression, possible because columns of homogeneous datatype are stored together and because updates to the main store are batched.
[Monash, C: "Mike Stonebraker on database compression – comments",[9] DBMS2, March 24, 2007]
-
Automated workload management, data replication, server recovery, query optimization, and storage optimization.
-
Native integration with open source big data technologies like Apache Kafka and Apache Spark.
-
Support for standard programming interfaces, including ODBC, JDBC, ADO.NET, and OLEDB.
-
High-performance and parallel data transfer to statistical tools and built-in machine learning algorithms.
Vertica's specialized approach aims to significantly increase query performance in data warehouses, while reducing hardware costs.[ One Size Fits All? Part 2: Benchmarking Results (sect. 3.1)]
Since 2011, Vertica has offered a limited-capacity community edition for free.
In July, 2021, Vertica announced an SaaS offering, Vertica Accelerator, running on Amazon AWS.[ PR Newswire: "Vertica Announces Early Access of Vertica Accelerator" Micro Focus. June 15, 2021.]
Optimizations
Vertica originated as the
C-Store column-oriented database, an
open source research project at MIT and other universities, published in 2005.
Vertica runs on Computer cluster of commodity servers or on commercial clouds. It integrates with Hadoop, using HDFS.
In 2018, Vertica introduced Vertica in Eon Mode, a separation of compute and storage architecture. The Eon architecture allows for elastic increase and decrease in compute capability as needed for workload elasticity. It also allows instantiation of multiple isolated sub-clusters dedicated to different workloads while maintaining a single shared data repository. It operates on shared object storage in the cloud, and also runs on object storage compatible hardware on-premises for private cloud implementations.
Version 10.1.1 of Vertica introduced Docker and Kubernetes support.[ Vertica Blog: "Vertica 10.1.1 Goes Beyond Analytics with Support for Azure Cloud, Kubernetes, and Containers" Healey. April 30,2021]
Many BI, data visualization, and ETL tools work with Vertica Analytics Platform. Vertica supports Kafka for streaming data ingestion.
In 2021, Vertica released a connector for Spark.
Vertica also integrates with Grafana, Helm, Go, and Distributed R.
Company events
In January 2008,
Sybase filed a patent-infringement lawsuit against Vertica.
In January 2010, Vertica prevailed in a preliminary hearing,
[Monash, C: "Vertica slaughters Sybase in patent litigation",[13] DBMS2, January 14, 2010] and in June, 2010, Sybase and Vertica resolved the suit, with the court dismissing all infringement claims.
[Vertica Press Release, "Vertica Resolves Sybase Patent Lawsuits" http://www.vertica.com/news/press/vertica-resolves-sybase-patent-lawsuits/]
Since 2013, Vertica has held an annual user conference, now called Vertica Unify.
External links