As Incorta loads data and metadata via one of its 240+ connectors, it builds an open data lake that can feed both analytics and data science use cases. Leveraging standard storage formats such as Parquet to provide both archival data durability and analytics performance without the risk of vendor lock-in, customers can choose between Amazon S3, Google Cloud Storage, and Microsoft Azure Data Lake Storage Gen2 for best security, availability and low-cost.
The result is a single repository of data, and a single-source of truth for multiple workloads including business intelligence, analytics and machine learning. It also means incredibly fast performance whether you are exploring data for insights, or are developing solutions in Python, R or Scala inside of a Spark notebook.
From an infrastructure perspective, Incorta is supremely flexible. It can be run on-premises, hosted by your own cloud provider, or delivered as a fully-managed cloud service. The platform can run as a complete standalone data and analytics pipeline, or as a component within a larger analytics and BI technology portfolio.