SAP Datasphere Tutorial: Build Your First Data Model from Scratch

A hands-on SAP Datasphere tutorial covering spaces, connections, data flows, replication flows, analytic models, and consumption through SAP Analytics Cloud and SQL.

Updated June 14, 2026

SAP Datasphere Tutorial

SAP Datasphere is SAP's cloud data warehouse. This SAP Datasphere tutorial walks you through the practical path from a fresh tenant to a working analytic model that business users can consume in SAP Analytics Cloud, Excel, or any SQL client. If you are new to Datasphere and want a hands-on, step-by-step walkthrough instead of a feature list, start here.

By the end of this tutorial you will have created a space, connected a data source, brought data in, built an analytic model, and queried it. The concepts you pick up along the way (spaces, connections, data flows vs replication flows, analytic models) are the ones you will use in every real Datasphere project.

Prerequisites

To follow this tutorial you need:

An SAP BTP account with an entitlement for SAP Datasphere. SAP offers a free tier on some hyperscaler regions, which is enough for this walkthrough.
The DW Administrator or DW Modeler role collection assigned to your user in the BTP subaccount.
A data source. Easiest options: an SAP S/4HANA system you can reach, a public CSV file in cloud storage, or one of SAP's open sample datasets (the SAP Goldref sample data and the sap-samples/datasphere GitHub repo both work well).
Basic SQL knowledge. You do not need to write raw SQL to complete the tutorial, but understanding SELECT, JOIN, and GROUP BY will make the modeling steps feel natural.

Step 1: Access Datasphere

If Datasphere is not yet subscribed in your subaccount:

Open the BTP cockpit for your subaccount.
Go to Service Marketplace and search for SAP Datasphere.
Click Create to subscribe, accepting the default plan.
Assign the DW Administrator role collection to your user under Security > Users.
From the subscription in Service Marketplace, click Go to Application.

The link opens the Datasphere home page, which is the entry point for everything: spaces, connections, the Data Builder, the Business Builder, and the data marketplace. Bookmark this URL; it is the one you will use daily.

Step 2: Create a Space

A space is the unit of isolation in Datasphere. Each space has its own storage, its own compute, and its own membership. You typically create one space per environment (dev, test, prod) or per business domain (sales, finance, HR).

To create your first space:

From the left navigation, open System > Spaces.
Click New Space and give it a name and a space ID (for example, SALES).
Set Storage to the default HANA Cloud database assigned to your tenant.
Assign yourself and any collaborators as members with the Modeler role.
Save.

The space is now the target for everything you build next. Database users, data access controls, and storage quotas are all scoped to the space you are working in. When you later transport content from dev to prod, the transport carries objects between spaces.

Step 3: Connect a Data Source

A connection is Datasphere's link to an external system. Datasphere ships with more than 40 connection types: S/4HANA, BW/4HANA, HANA Cloud, SuccessFactors, ABAP CDS views, cloud databases (Snowflake, Databricks, Redshift, BigQuery), object storage, generic OData, and partner connectors.

To connect an S/4HANA source (the most common case):

Open Data Builder > Sources (or Connections depending on tenant version).
Click New Connection and pick SAP S/4HANA or SAP ABAP.
Fill in the connection details: host, client, system number, and credentials. If the source is on premise, route the connection through an SAP Cloud Connector bound to the Datasphere Data Provisioning Agent.
Test the connection, then save.

For a zero-setup alternative, use a CSV file connection or upload a local file directly into a local table. This is a great way to try the modeling steps even if you have no SAP system available.

After the connection is saved, the source's tables and ABAP CDS views become available as remote tables you can use in data flows, replication flows, and views.

Step 4: Create a Data Flow or Replication Flow

Datasphere gives you two main ways to bring data in, and the choice matters.

Replication flow copies data from a source into a Datasphere local table, either on a schedule or near real time. Minimal transformation. Use it when you want the data physically inside Datasphere for performance and reliability.
Data flow is an ETL pipeline. You read from a source, apply a chain of transformations (joins, projections, aggregations, calculated columns, filters), and write to a local table. Use it when the source data needs shaping before it is useful.

To create a replication flow for an S/4HANA sales order table:

Open Data Builder and create a new Replication Flow.
Add a source (your S/4HANA connection) and pick the table or CDS view to replicate.
Set the target to your space and choose the load type: Initial Only for a one-off snapshot, or Initial and Delta for ongoing near-real-time replication.
Set a schedule or enable streaming.
Save, then Run the flow once to load the data.

To create a data flow that transforms the replicated data:

In Data Builder, create a new Data Flow.
Drag the replicated local table onto the canvas as the source.
Add transforms: a Projection to pick columns, an Aggregation to compute totals by customer, a Join to enrich with a customer dimension.
Add a Local Table target and map the output columns.
Save and run the data flow.

The result is a clean, modeled local table that is ready to feed an analytic model.

The SQL underneath a typical data flow would look roughly like this if you wrote it by hand (Datasphere generates it for you):

CREATE COLUMN TABLE "SALES"."CUSTOMER_REVENUE" AS (
  SELECT
    h."CustomerID",
    c."CustomerName",
    c."Region",
    SUM(h."NetAmount")        AS "TotalRevenue",
    COUNT(DISTINCT h."OrderID") AS "OrderCount"
  FROM "SALES"."SALES_ORDER_HEADER" h
  JOIN "SALES"."CUSTOMER" c
    ON h."CustomerID" = c."CustomerID"
  WHERE h."Status" = 'Completed'
  GROUP BY h."CustomerID", c."CustomerName", c."Region"
);

Step 5: Build an Analytic Model

Data flows and local tables live in the Data Builder (the technical layer). To make the data consumable by business tools, you promote it into the Business Builder as an analytic model. The analytic model is Datasphere's semantic object: it defines measures, dimensions, hierarchies, associations, and the default behavior consumers see.

To build an analytic model:

Open the Business Builder and choose Analytic Model.
Pick the fact source (the local table or view from Step 4).
Add dimensions (for example, Customer and Region) by associating to dimension views.
Define measures: the default measures are the numeric columns, but you can add calculated measures such as Revenue per Order = TotalRevenue / OrderCount.
Add hierarchies where they make sense (Region > Country > City for geography).
Configure variables for runtime filtering (a date range, a region picker).
Save and Deploy the model.

Once deployed, the analytic model is published to the catalog and is available as a live data source to SAP Analytics Cloud, Excel, and any SQL consumer with access.

Step 6: Consume the Model

The analytic model is the contract between IT and the business. From here, you consume it through whichever tool fits the audience:

SAP Analytics Cloud is the primary consumer. Create a connection of type SAP Datasphere in SAC, select the analytic model, and build a story or dashboard on top of it with live data.
Microsoft Excel connects through the SAP Analytics Cloud add-in or the Datasphere ODBC driver, letting analysts build pivot tables directly on the model.
SQL clients (DBeaver, DataGrip, the Datasphere SQL console) query the model through a standard SQL interface. Datasphere exposes an OpenSQL schema for the space.

A SQL query against the deployed model looks like standard SQL:

SELECT
  "Region",
  SUM("TotalRevenue") AS "Revenue",
  SUM("OrderCount")   AS "Orders"
FROM "SALES"."CUSTOMER_REVENUE_MODEL"
WHERE "FiscalYear" = '2026'
GROUP BY "Region"
ORDER BY "Revenue" DESC;

Because the semantic definitions (currency handling, hierarchies, security) live in the analytic model, every consumer sees consistent results.

Datasphere Concepts Reference

The terminology is the steepest part of the learning curve. Keep this reference handy:

Concept	What it is	Where it lives
Space	Isolated unit with its own storage, compute, and members	System configuration
Connection	Link to an external source (S/4HANA, cloud DB, file, OData)	Connections area
Remote Table	A table or view exposed through a connection, queried in place	Data Builder
Local Table	A table physically stored in the Datasphere space	Data Builder
Replication Flow	Scheduled or streaming copy of source data into a local table	Data Builder
Data Flow	ETL pipeline with transforms, writes to a local table	Data Builder
Intelligent Lookup	A join object with business semantics and auto-matching	Data Builder
View (graphical/SQL)	A modeled view on top of tables and other views	Data Builder
Association	A relationship between two entities, used by models and lookups	Data Builder / Business Builder
Analytic Model	The semantic object consumed by SAC, Excel, and SQL	Business Builder
Task Chain	Orchestration that sequences flows on a schedule	Data Builder

Datasphere with AI Coding Assistants

Datasphere has a dense, specific vocabulary. A general-purpose AI assistant will routinely confuse data flows with replication flows, suggest BW concepts that do not apply, or generate SQL that uses syntax the Datasphere OpenSQL layer rejects. The YAML and JSON formats for connections, agents, and content transport are Datasphere-specific and unforgiving.

The sap-datasphere skill gives an AI assistant the correct terminology, object model, and configuration patterns so it can help you design spaces, draft data flow logic, generate content transport definitions, and produce SQL that actually runs on the Datasphere SQL layer. Install it to bring accurate Datasphere context into your editor:

npx skills add secondsky/sap-skills --skill sap-datasphere

Pair it with sap-sac-planning or sap-sac-scripting if your consumption layer is SAP Analytics Cloud, and with sap-abap-cds if you are federating ABAP CDS views from S/4HANA. With the right skills loaded, an assistant can help you pick the right object type for each requirement, draft the modeling steps, and generate CLI commands for automating content transport between dev and prod tenants.

Note: SAP Skills is a community-maintained, open-source collection of plugins for AI coding assistants. It is not an official SAP product and is not affiliated with or endorsed by SAP SE. The skills encode publicly documented SAP knowledge to help AI assistants produce more accurate SAP code.

Related Skills

Data & Analytics

sap-bw-query

Use when automating SAP BW query inspection, InfoProvider metadata reads (characteristics, key figures), metadata-verified specification review, unsaved draft preparation, or human-confirmed query draft population through Eclipse or HANA Studio with BW Modeling Tools.

5 CommandsMCP

analyticsdashboarddatahana+5

--skill sap-bw-query

Data & Analytics

sap-datasphere

SAP Datasphere development skill with 3 specialized agents, 5 slash commands, and validation hooks. Use when building data warehouses on SAP BTP, creating analytic models, configuring data flows and replication flows, setting up connections, managing spaces and users, implementing data access controls, using the datasphere CLI, or inspecting authenticated Datasphere browser UI state with Microsoft Edge CDP. Covers Data Builder, Business Builder, analytic models, 40+ connection types, real-time replication, task chains, content transport, and data marketplace.

3 Agents5 CommandsMCPHooks

analyticsbtpdashboarddata+6

--skill sap-datasphere

Data & Analytics

sap-sac-custom-widget

SAP Analytics Cloud (SAC) Custom Widget development. Use when building custom visualizations, extending SAC with Web Components, or creating Widget Add-Ons. Covers JSON metadata, JavaScript Web Components, lifecycle functions, data binding with feeds, styling/builder panels, property/event/method definitions, third-party library integration, hosting, security, performance, and debugging. Includes Widget Add-On feature (QRC Q4 2023+) and templates for widgets, charts, and KPI cards.

3 Agents3 CommandsHooks

analyticscustomdashboarddata+7

--skill sap-sac-custom-widget

Data & Analytics

sap-sac-planning

SAP Analytics Cloud (SAC) planning guidance for planning models, planning-enabled stories, data actions, multi actions, version management, data locking, calendar/input workflows, allocations, value driver trees, BPC live planning, and Seamless Planning with SAP Datasphere. Use this for planning design, planning APIs, data action debugging, planning performance reviews, and authenticated SAC planning story triage in Microsoft Edge via CDP; use sap-sac-scripting for non-planning SAC scripts and sap-datasphere for Datasphere modeling.

3 Agents3 CommandsHooks

analyticsapidashboarddata+6

--skill sap-sac-planning

Data & Analytics

sap-sac-scripting

Comprehensive SAC scripting skill for SAP Analytics Cloud Analytics Designer and Optimized Story Experience. This skill should be used when the user asks to "create SAC script", "debug Analytics Designer", "optimize SAC performance", "planning operations in SAC", "filter data in SAC", "use DataSource API", "chart scripting", "table manipulation", "SAC event handlers", "version management", "data locking", "Optimized Story Experience API", "OSE scripting", "OSE widget API", "OSE DataSource", "story scripting API", "OSE planning API", "OSE method", "optimized story", "SAC story scripting", "story script", "SAC scripting", "debug SAC runtime in Microsoft Edge via CDP", or works with SAC widgets, planning models, or analytics applications.

4 Agents4 CommandsMCPHooks

analyticsapidashboarddata+6

--skill sap-sac-scripting

Data & Analytics

sap-sac-test-automation

SAP Analytics Cloud (SAC) automated testing skill for designing capability-gated browser discovery and deterministic Playwright test suites for SAC stories, dashboards, reports, planning workflows, comments, permissions, visual regression, and reusable QA automation. This skill should be used when building SAC end-to-end tests, onboarding SAC dashboards into Playwright, creating dashboard profiles or scenario YAML, using Microsoft Edge/CDP, Chrome DevTools MCP, Vercel Labs agent-browser, or manual discovery for SAC components, testing SAC optimized stories, configuring SAC auth storage state, managing visual/data baselines, testing comments, planning writeback, data actions, multi actions, role-based views, restricted Windows/company environments, or creating SAC failure triage artifacts.

AgentsCommands

analyticsautomationcapdashboard+9

--skill sap-sac-test-automation

Frequently Asked Questions

From the SAP BTP cockpit, open Service Marketplace, subscribe to SAP Datasphere, and assign the relevant role collections (such as DW Administrator or DW Modeler) to your user. The subscription URL opens the Datasphere home page where you manage spaces, connections, and models.

A space is a self-contained unit inside a Datasphere tenant with its own storage, compute, and user membership. Spaces are how you separate dev/test/prod, business domains, or partner projects. Database users and data access controls are scoped to a space.

Use the Data Builder to create local tables or SQL/graphical views on top of your connections, then use the Business Builder to create an analytic model that defines measures, dimensions, and associations. The analytic model is what consumption tools like SAP Analytics Cloud query.

Yes. SAP Analytics Cloud has a native live data connection to SAP Datasphere analytic models. Create a connection of type SAP Datasphere in SAC, point it at your tenant, and build stories and dashboards directly on top of the analytic models you publish.

A replication flow copies data from a source (an S/4HANA table or ABAP CDS view, a cloud database, or a file) into a Datasphere local table on a schedule or near real time. It is the lightweight alternative to a data flow when you need the data in Datasphere with minimal transformation.

Datasphere is the cloud-native, federation-first data warehouse. BW/4HANA is the traditional, ETL-heavy data warehouse that replicates and persists large volumes. The two are often integrated, with Datasphere acting as the consumption and federation layer over BW and S/4HANA data.

Explore all SAP Analytics & Data skills

All guides

SAP Datasphere Tutorial

Prerequisites

Step 1: Access Datasphere

Step 2: Create a Space

Step 3: Connect a Data Source

Step 4: Create a Data Flow or Replication Flow

Step 5: Build an Analytic Model

Step 6: Consume the Model

Datasphere Concepts Reference

Datasphere with AI Coding Assistants

Related Skills

Frequently Asked Questions

How do I access SAP Datasphere?

What is a space in Datasphere?

How do I create a data model in Datasphere?

Can I use SAP Analytics Cloud with Datasphere?

What is a replication flow?

How is Datasphere different from BW?