Data lake	Data warehouse
Supported data types	Data lakes can handle a combination of structured, semistructured and unstructured data, which commonly is stored in its native format to make the full sets of raw data available for analysis.	Data warehouses typically store structured data from transaction processing systems and other business applications. In most cases, the data is cleansed and curated before going into a data warehouse.
Analytics uses	Data lakes are primarily used for data science applications that involve machine learning, predictive modeling and other advanced analytics techniques. Analytics goals aren't always predefined.	Data warehouses support less-complex BI, ad hoc analysis, reporting and data visualization applications, usually with a predefined purpose for analyzing business operations and tracking KPIs.
Users	Data scientists and lower-level data analysts are the primary users of data lakes. They're often supported by data engineers, who build data pipelines and help prepare data for analysis as needed.	Business analysts, executives and operational workers use data warehouses through self-service BI tools. Alternatively, BI analysts and developers run queries in data warehouses for business users.
Data processing methods	Data lakes support traditional extract, transform and load (ETL) processes, but they're more likely to use extract, load and transform, or ELT, in which data is loaded as is and transformed for specific uses.	ETL processes are common for data integration and preparation in data warehouses. The data structure is finalized before data sets are loaded to support the planned BI and analytics applications.
Schema approach	The schema for data sets can be defined after they're stored in a data lake, using a schema-on-read approach.	Schemas in data warehouses are defined before data sets are loaded, following schema-on-write practices.
Data storage	Data typically is stored in platforms other than relational databases, such as the Hadoop Distributed File System, cloud object storage services or NoSQL databases.	Most commonly, data is stored in relational databases using conventional disk storage. Data warehouses can also be built on columnar databases, similarly with disk storage.
Costs	Hardware costs can be less expensive because data lakes use lower-cost servers and storage. Data management might cost less, too. But the large size of some data lakes can erase the cost advantages.	In general, the large servers and disk storage systems required for data warehouses make them more expensive to deploy than data lakes. Managing a data warehouse can also be more costly.
Business benefits	Data lakes enable data science teams to analyze diverse sets of structured and unstructured data and create analytical models that provide insights for strategic planning and business decision-making.	Data warehouses provide a centralized repository of consolidated and curated data sets that can be easily accessed and used to analyze business performance and support operational decisions.

What is data management and why is it important?

What is a data lake?

What is a data warehouse?

Data lake vs. data warehouse: 8 important differences

Which platform is right for my organization?

Next Steps

Related Resources

Dig Deeper on Data management strategies

Dremio: Understanding Apache Iceberg (the data lakehouse backbone)

Lakehouse architecture the best fit for modern data needs

data lakehouse

data warehouse