Dremio - What is an SQL Lakehouse and Why Would You Need One?
Common Sense Virtual Roundtable:

What is an SQL Lakehouse Platform and Why Would You Need One?

November 10th, Successfully held

Presented by

Dremio

Here’s what we discussed:

A SQL lakehouse platform brings together the best of both the data warehouse and the data lake, and removes the limitations of traditional data warehouses and closed data architectures.

In this virtual roundtable session, we discussed how this is achievable and get answers to the following questions:

  • Why do you need an SQL lakehouse platform?
  • Can it meet your latency and throughput requirements?
  • Will it improve the efficiency of overburdened data engineers?
  • Does it support data portability with open APIs, data formats, and source code?
  • Perhaps most importantly, can enterprise teams govern data consumption on the SQL lakehouse platform?

The session was moderated by Scott Gay from Dremio.

Solution Expert 

Scott Gay
Solution Architect at Dremio
LinkedIn

Here is what we learned:

Mark Perrault, Technical Solution Architect (Data and Special Projects), Sallie Mae

• Mark said Sallie Mae has a lake that’s more like a pond. It’s disconnected, and they are trying to figure out how it should be set up.
• Mark’s instant reaction when he hears data lake or lakehouse or anything with lake in it is that it’s a buzzword. It sounds cool and neat, and it allows you to see the S3 pipeline quickly. How to govern it is a struggle because the user may not know that it is raw data. How do you control the dataflow?
• It took Sallie Mae 6 to 9 months to build their data lake, and it was quickly used to build reports. Mark and his team needed to make sure that when it’s used in raw format, the end user understands that. How the data gets there is the least of their concerns, while what data gets there is their biggest concern.
• The technology is hyper-fast, and the technical team needs to know how to slow its flow.
• Mark said he is cautiously optimistic. As they move out of the data warehouse, they want to be somewhat agnostic, but have a tool layers on that will scale it.
• The value of the lake is that the data is in its raw format.
• Mark asked what keeps it mobile.

Request detailed session notes

We are careful about who we send this key document to. The session notes will be sent based on the request, provided your profile matches our qualification criteria.

About Dremio:

Dremio is a high-performance SQL (data) lakehouse platform built on an open data architecture that helps to accelerate BI and Analytics directly on cloud data lake storage. Created by veterans of open source and big data technologies, and the creators of Apache Arrow, Dremio is a fundamentally new approach to data analytics that helps companies get more value from their data, faster. Dremio makes data engineering teams more productive, and data consumers more self-sufficient.

Participation in the Virtual Roundtable is free of charge to qualified attendees. Once you’ve completed the registration, we’ll confirm your invitation and send you a calendar invite with a link to the meeting.

If you don’t qualify, we’ll suggest other learning events that may be a better fit for you.

Here’s what attendees at past events have said:

Other active Virtual Roundtables

Talk to Us

Mitch Speers

+1 917-750-7287

Ash Prasad

+1 218-389-7816




This site is protected by Invisible reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Menu