Tech News, Magazine & Review WordPress Theme 2017
  • Home
  • Supply Chain Updates
  • Global News
  • Contact Us
  • Home
  • Supply Chain Updates
  • Global News
  • Contact Us
No Result
View All Result
No Result
View All Result
Home Supply Chain Updates

Databricks nudges closer to bridging the gap between data lakes and warehouses

usscmc by usscmc
November 14, 2020
Databricks nudges closer to bridging the gap between data lakes and warehouses
Share on FacebookShare on Twitter

Continuing its quest to make freeform data lakes a viable alternative to highly structured data warehouses, Databricks Inc. today debuted an engine that enables many workloads previously targeted at data warehouses to be executed on data lakes instead.

SQL Analytics is said to combine data warehousing performance with data lake economics to enable SQL queries of a data lake to perform up to nine times faster than they would on a data warehouse. It’s another building block in the construction of what the company calls “lakehouse,” which is an architecture that combines both types of workloads without requiring extract databases to be created for warehousing purposes.

Data warehouses are highly structured databases that combine information from multiple sources in a single repository that can be queried to discover new relationships between data elements. Data lakes are centralized repositories that combine structured, unstructured and semistructured data and are commonly used for machine learning and data science applications.

Conventional wisdom holds that the two architectures are fundamentally incompatible, but Databricks believes it can find common ground.

“This is about providing not just a first class data science platform for machine learning and data science but also for queries in a way that is highly performant, low latency and at high user concurrency,” said Joel Minnick, Databricks’ vice president of marketing. “We believe the data lake is the center of gravity because it’s so good at handling the unstructured information that data science and machine learning innovation comes from. Data warehouses weren’t built for that. ”

A lakehouse supports both kinds workloads with a single architecture. SQL Analytics is built on Delta Lake, an open-source table storage layer created by Databricks and released to open source a year ago. It provides some of the data reliability and quality features that data lakes typically lack.

“Delta Lake provides reliability by making data lakes operate in transactional way,” Minnick said. It does that by adding a transaction log to the data lake that supersedes the data itself.

“So now I’m querying transaction logs to get the single source of truth regardless of the data in the lake itself,” Minnick said. “By running SQL workloads directly on the data lake, I can substantially reduce the number of ETL [extract/transfer/load] pipelines I have to maintain.” That translates into fewer copies of data and less risk of conflict.

Data quality matters

Minnick said SQL Analytics does not eliminate the need to massage data for consistency or to conduct ETL if moving the data elsewhere. “If the data’s messy, then the data’s messy, but not having to push it around is an advantage for a lot of our customers,” he said. “By doing the transformations on the data lake, everyone is using the same data set and there’s one source of truth.”

Although various tools have long enabled SQL queries to be performed on data lakes, performance has typically been a downside. Databricks said it has come up with two ways to improve responsiveness. The first is by creating auto-scaling endpoints that keep query latency consistently low under high user load. The second is Delta Engine, which it said can complete queries quickly against data sets of any size.

“With Delta Engine we were able to solve the throughput issue,” Minnick said. “With SQL Analytics, customers can create SQL-tuned clusters that stand up or spin down based on the number of users querying that data lake.” That means customers can get the concurrency benefits of a data warehouse without leaving the data lake environment.

Databricks said SQL Analytics doesn’t obviate the need for a data warehouse but can handle most warehouse-like applications that don’t require writing updates or driving operational processes. “Right now we’re focusing mainly on business intelligence analytics and reporting,” rather than write-intensive processes, Minnick said.

Databricks, which is privately held, said it achieved a greater than $350 million revenue run rate in the third quarter of 2020, up from $200 million in the same quarter last year.

SQL Analytics will be available for public preview on Nov. 18.

Photo: Pok_Rie/Pixabay

Since you’re here …

Show your support for our mission with our one-click subscription to our YouTube channel (below). The more subscribers we have, the more YouTube will suggest relevant enterprise and emerging technology content to you. Thanks!

Support our mission:    >>>>>>  SUBSCRIBE NOW >>>>>>  to our YouTube channel.

… We’d also like to tell you about our mission and how you can help us fulfill it. SiliconANGLE Media Inc.’s business model is based on the intrinsic value of the content, not advertising. Unlike many online publications, we don’t have a paywall or run banner advertising, because we want to keep our journalism open, without influence or the need to chase traffic.The journalism, reporting and commentary on SiliconANGLE — along with live, unscripted video from our Silicon Valley studio and globe-trotting video teams at theCUBE — take a lot of hard work, time and money. Keeping the quality high requires the support of sponsors who are aligned with our vision of ad-free journalism content.

If you like the reporting, video interviews and other ad-free content here, please take a moment to check out a sample of the video content supported by our sponsors, tweet your support, and keep coming back to SiliconANGLE.

usscmc

usscmc

No Result
View All Result

Recent Posts

  • How Hapag Lloyd captured a major market share in the Container Shipping Industry in USA
  • Why USA’s East Coast is the Favorite Destination for Manufacturing Companies
  • How Trade Relations Between the USA and UK Improved After Keir Starmer Became Prime Minister
  • Tips and Tricks for Procurement Managers to Handle Their Supplier Woes
  • The Crazy Supply Chain of Walmart Spanning Across the Globe

Recent Comments

  • Top 5 Supply Chain Certifications that are in high demand | Top 5 Certifications on Top 5 Globally Recognized Supply Chain Certifications
  • 3 Best Procurement Certifications that are most valuable | Procurement Newz on Top 5 Globally Recognized Supply Chain Certifications

Archives

  • July 2024
  • June 2024
  • May 2024
  • April 2024
  • March 2024
  • February 2024
  • January 2024
  • December 2023
  • November 2023
  • October 2023
  • September 2023
  • August 2023
  • July 2023
  • June 2023
  • May 2023
  • April 2023
  • March 2023
  • February 2023
  • January 2023
  • December 2022
  • November 2022
  • October 2022
  • September 2022
  • August 2022
  • July 2022
  • June 2022
  • May 2022
  • April 2022
  • March 2022
  • February 2022
  • January 2022
  • December 2021
  • November 2021
  • October 2021
  • September 2021
  • August 2021
  • July 2021
  • June 2021
  • May 2021
  • April 2021
  • March 2021
  • February 2021
  • January 2021
  • December 2020
  • November 2020
  • October 2020
  • September 2020
  • August 2020
  • July 2020
  • June 2020
  • May 2020
  • April 2020
  • March 2020
  • February 2020
  • January 2020
  • December 2019
  • November 2019
  • September 2019

Categories

  • Global News
  • Supply Chain Updates

Meta

  • Log in
  • Entries feed
  • Comments feed
  • WordPress.org
  • Antispam
  • Contact Us
  • Disclaimer
  • Home
  • Privacy Policy
  • Terms of Use

© 2025 www.usscmc.com

This website uses cookies to improve your experience. We'll assume you're ok with this, but you can opt-out if you wish. Cookie settingsACCEPT
Privacy & Cookies Policy

Privacy Overview

This website uses cookies to improve your experience while you navigate through the website. Out of these cookies, the cookies that are categorized as necessary are stored on your browser as they are essential for the working of basic functionalities of the website. We also use third-party cookies that help us analyze and understand how you use this website. These cookies will be stored in your browser only with your consent. You also have the option to opt-out of these cookies. But opting out of some of these cookies may have an effect on your browsing experience.
Necessary
Always Enabled
Necessary cookies are absolutely essential for the website to function properly. This category only includes cookies that ensures basic functionalities and security features of the website. These cookies do not store any personal information.
Non-necessary
Any cookies that may not be particularly necessary for the website to function and is used specifically to collect user personal data via analytics, ads, other embedded contents are termed as non-necessary cookies. It is mandatory to procure user consent prior to running these cookies on your website.
SAVE & ACCEPT
No Result
View All Result
  • Home
  • Supply Chain Updates
  • Global News
  • Contact Us

© 2025 www.usscmc.com