> For the complete documentation index, see [llms.txt](https://help.whaly.io/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://help.whaly.io/core-concepts/getting-started/data-stack-architecture.md).

# Data stack architecture

A Data Stack is a set of "layers" that are processing your data through the entire pipeline from extraction to consumption. This stack is needed in order to collect, store, transform, query and offer data to stakeholders so that they can extract useful information and make better decision.

The vendors you select for the different layer of your stack will evolve based on your business needs (number of stakeholders, current data adoption, overall trust and data skills of your team) and on your data (volume, origin, quality). Don't see your stack as being a "frozen/static" setup but more an ever evolving organism.

What is important to understand is that the layers that compose the stack are independent of the set of vendors that compose your stack at a given moment.

Here is a simplified view of a common data stack.

<img src="/files/qGnjKE7GcIIAjUGYplg1" alt="A common Data stack" class="gitbook-drawing">

### Why so many layers?

The rationale behind a data stack is the following:

a. Data needs to be extracted and loaded into a "**Storage layer**" that is designed for Data blending and analysis, called a "Data Warehouse" is needed to consolidate all your data data.

b. To feed this central storage layer, an "**Extraction layer**" is needed to connect to the siloed apps of your business that are producing the data (CRM, Finance, Database).

c. The data coming from your Business Apps needs to be cleaned and abstracted before being served to end users.  This is what is achieved in the "**Transform Layer**"

d. Your stakeholders (C-Levels/Managers/Data Analysts/Data Scientists...) need the proper tools to consume the cleaned and consolidated data. As they all have different jobs, responsibilities and skills, they need different set of tools to get access to their data. This is the "**Consumption Layer**".

## Where is Whaly positioned in the Data Stack?

Whaly is your **Business Intelligence**. It is part of the **Consumption layer**. The Business Intelligence is the main data access point for your whole company. It contains a set of curated Dashboards / Questions and Explorations that are easy to use interfaces to answer quickly to most questions asked in your Business.

The **Business Intelligence** is the most adopted tool of the Consumption Layer as everyone can use it across your organization. Hence it is the cornerstone of your data consumption layer where C-Levels, Engineers, Data Practitioners and regular users meet. The other tools of the consumption layer will be specialized tool for dedicated teams (Data Scientist, Engineers, ...).

#### About Whaly integrated modelling layer and connectors

In order to help you speed up your **Business Intelligence** projects deployment, Whaly offers "enablers" in the form of a **modelling layer** and **integrated connectors**. Those "enablers" can have an overlap with some existing vendors/components of your data stack, and can be replaced when/if you need to.


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://help.whaly.io/core-concepts/getting-started/data-stack-architecture.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.