SEARCH FINANCIAL SERVICES INFRASTRUCTURE SECURITY SCIENCE INTERVIEWS

 

     

Dremio Debuts Self-Service Data Platform

July 21, 2017

Dremio has entered into the data analytics market with the immediate availability of the Dremio Self-Service Data Platform, a fundamentally new approach to data analytics. Working with existing data sources and business intelligence tools, Dremio’s solution eliminates the need for traditional ETL, data warehouses, cubes, and aggregation tables, as well as the infrastructure, copies of data, and effort these systems entail. Dremio combines consumer-grade ease-of-use with enterprise-grade security and governance, and includes ground-breaking execution and data acceleration technologies that dramatically accelerate analytical processing. Dremio was released as a new open source project under the Apache license and is available for download today.

Founded in 2015 by a team of big data experts, Dremio has raised more than $15 million. The company’s software is being used by leading organizations in the US, Europe, Asia, and Australia, such as Daimler, a leading producer of premium cars and the world's largest manufacturer of commercial vehicles, and OVH, Europe's leading cloud provider. Additionally, technology providers including Microsoft, Tableau, Qlik, as well as open source communities like Python Pandas and R are collaborating with Dremio to deliver end-to-end self-service for data analytics.

Despite promises of software designed to unlock the value of data, analysts and data scientists continue to struggle to harness data for business intelligence and data science. Dremio accelerates time to insight by empowering analysts and data scientists to be independent and self-directed in their use of data, from any source and at any scale, while preserving governance and security.

“In our personal lives, most people expect to get answers to questions in just a few seconds. But in the workplace, it can take months to answer a question,” said Tomer Shiran, co-founder and CEO of Dremio. “We believe there is an enormous opportunity to improve the data experience for people in the workplace, by connecting popular BI and data science tools to the diverse data stores of the modern enterprise, eliminating the need for ETL and data warehouses. Dremio empowers analysts and data scientists to discover, explore, share, and accelerate any data at any time, no matter where it is or how big it is.”

“Dremio is a new breed of data analytics platform that doesn’t require ETL, cubes, data warehouses, or even data virtualization tools to deliver self-service analytics to data analysts,” said Wayne Eckerson, founder and principal consultant, Eckerson Group. “The big data platform, designed from the ground up for the cloud and Hadoop, works with any BI product or data science tool, sits between users and data sources, eliminating the need for data movement. This speeds deployments and provides agile access to data.”

Dremio provides a future-proof strategy for data, allowing customers to choose the best tools for analysts, and the right database technologies for applications, without compromising on the ability to leverage data to power the business.

Key capabilities include:

• Apache Arrow Execution Engine. Dremio is the first Apache Arrow-based distributed query execution engine. This represents a breakthrough in performance for analytical workloads as it enables extreme hardware efficiency and minimizes serialization and deserialization of in-memory data buffers between Dremio and client technologies like Python, R, Spark, and other analytical tools. Arrow is also designed for GPU and FPGA hardware acceleration, making it a powerful paradigm for machine learning workloads.

• Native Query Push Downs. Instead of performing full table scans for all queries, Dremio optimizes processing into underlying data sources, maximizing efficiency and minimizing demands on operational systems. Dremio rewrites SQL in the native query language of each data source, such as Elasticsearch, MongoDB, and HBase, and optimizes processing for file systems such as Amazon S3 and HDFS.

• Dremio Reflections™. Dremio accelerates processing and isolates operational systems from analytical workloads by physically optimizing data for specific query patterns, including columnarizing, compressing, aggregating, sorting, partitioning, and co-locating data. Dremio maintains multiple reflections of datasets, optimized for heterogeneous workloads that are fully transparent to users. Dremio’s query planner automatically selects the best reflections to provide maximum efficiency, providing a breakthrough in performance that accelerates processing by up to a factor of 1000.

• Comprehensive Data Lineage. Dremio's Data Graph preserves a complete view of the end to end flow of data for analytical processing. Companies have full visibility into how data is accessed, transformed, joined, and shared across all sources and all analytical environments. This transparency facilitates data governance, security, knowledge management, and remediation activities.

• Self-Service Model. Dremio was designed with analysts and data scientists in mind, providing a powerful and intuitive interface for users to easily discover, curate, accelerate, and share data for specific needs, without being dependent on IT. Users can also launch their favorite tools from Dremio directly, including Tableau, Qlik, Power BI, and Jupyter Notebooks.

• Built for the Cloud. Dremio was designed for modern cloud infrastructure, and is able to take advantage of elastic compute resources as well as object storage such as Amazon S3 for its Reflection Store. In addition, Dremio can analyze data from a wide variety of cloud-native and cloud-deployed data sources.

Customer Applications of Dremio

Because Dremio can be run in the cloud, on-premises, or as a service provisioned and managed in a Hadoop cluster – customers can easily deploy Dremio to meet their needs at any scale. Popular use cases include BI on Modern Data, like Elasticsearch, S3, and MongoDB; Data Acceleration, making even the largest datasets interactive in speed; Self-Service Data, making consumers of data more independent and less reliant on IT; and Data Lineage, tracking the full lineage of data through all analytical jobs across tools and users.

“With over 1 million customers and 270,000 servers across our 20 data centers, telemetry data about our infrastructure is a critical asset we use to remain competitive while providing a great experience to our customers," said Vincent Terrasi, head of data, analytics, and CRM for OVH. "Dremio helps our data managers and analysts work with our data, independently and effectively. We are proud to be a part of this important open source community.”

“At Quantium, our ability to generate and operationalize analytics is key to delivering value to our clients,” said Alex Shaw, head of big data platforms, Quantium. “Embedding intelligence at scale is a critical differentiator for our services, as we combine our industry expertise and market leading tools with our customer's data assets and our own data ecosystem. Working with Dremio, we were quickly able to achieve a 5x improvement while testing our key analysis workloads.”

Dremio Partner Ecosystem

By working closely with partners, Dremio looks to change the current approach to data analytics by expanding the big data, business intelligence, and analytics ecosystem for the enterprise.

“The goal of Microsoft Power BI is to democratize data analysis and make it available to all users in an enterprise," said Miguel Martinez, senior product marketing manager, Microsoft Power BI, Microsoft Corp. "Dremio's ability to accelerate data from any source makes users of Power BI more productive. We are proud to be working with Dremio to make this product available to our customers."

“Qlik is a pioneer in self-service BI and visual analytics," said Hjalmar Gislason, VP of data at Qlik. “Dremio shares our vision of making analysts and data scientists increasingly independent and productive. I have been waiting for a solution like Dremio to emerge in the rapidly evolving landscape of modern data sources, and am excited about the benefits it will bring to our more than 40,000 customers.”

"Python is as an essential language for data science," said Wes McKinney, creator of Pandas and software architect at Two Sigma Investments. "Dremio is solving an important piece of the data analytics stack, by providing Apache Arrow-based query execution across the different systems that store data. Fast and easy access to data improves computational efficiency and makes data scientists more productive."

“With more than 100,000 curated datasets, Enigma is the leading provider of analysis-ready public data,” said Hicham Oudghiri, CEO of Enigma Technologies. ”Customers rely on our open source intelligence to enrich their enterprise data to drive smarter decision making. Dremio’s approach for self-service data analytics can drive immense productivity in all types of organizations. We are excited to partner with this innovative open source company.”

Dremio is distributed as a Community Edition, which is open source and free for anyone, as well as an Enterprise Edition, which is available as part of an annual subscription with support, a commercial license, and enterprise features.

Terms of Use | Copyright © 2002 - 2017 CONSTITUENTWORKS SM  CORPORATION. All rights reserved. | Privacy Statement