This document describes the full installation process for deploying Pulse in a Dataiku environment.
It assumes familiarity with Dataiku administration concepts such as plugins, code environments, Code Studios, and API keys.
To install Pulse, you must be a Dataiku Platform Admin on the primary (core) Dataiku instance.
This is required because Pulse installation involves:
- Installing a Dataiku plugin
- Creating a Code Environment
- Creating a Code Studio template
- Creating and managing Dataiku projects
- Running administrative macros
In practice, full admin access on the core instance is required.
Pulse can collect metadata and usage data from multiple Dataiku instances.
For each instance you want to connect:
- You must be a full admin
OR - An admin must generate an API key with sufficient permissions
API keys must be able to:
- Read metadata
- Read audit and usage logs
These API keys are configured during Pulse setup.
-
Log in to Dataiku as an administrative user
-
Navigate to Waffle → Plugins
-
Install the plugin from Git:
https://github.com/dataiku/dss-plugin-dataiku-pulse.git -
Build the plugin code environment
- No containers are required
After installation, open the Plugin Settings page.
Create a single parameter set named:
primary
Populate the following sections:
- Repository URL
- Branch (typically
main)
- Dashboard Project Key
DATAIKU_PULSE_DASHBOARD - Dashboard Host URL (hostname or IP:port)
- Dashboard Host API Key (admin-level)
- Blob Storage Connection (AWS, Azure, or GCS)
- Worker Project Key
DATAIKU_PULSE_WORKER - One entry per Dataiku instance:
- Hostname or IP:port
- Admin-level API key
- User to own and run scenarios
- Optional settings:
- Ignore certificates
- Project data parallelization
- Core count
You may create additional parameter sets for host-specific overrides:
- Custom user
- Certificate behavior
- Parallelism / core settings
-
Navigate to Code Studios
-
Create a template named:
dataiku_pulse_dashboard⚠️ This name is required -
Configure the template to:
- Run on Kubernetes
- Use the Dataiku Pulse (Streamlit Custom) block
- Disable user permissions
-
Build the template
-
Create the Pulse Dashboard Project
-
Navigate to Macros
-
Filter on:
Dataiku Pulse: Initialize -
Run:
- Initialize Dashboard
- Initialize Workers
-
Navigate to Code → Code Studios
-
Publish the application as a Web Application
- Auto-start enabled
Pulse stores all collected data in external blob storage and loads final tables into DuckDB.
- AWS and Azure work out of the box using standard credentials
- GCS requires additional IAM configuration
See docs/gcs-auth.md for GCS-specific setup and validation.
Once the initial collection cycle completes, the Pulse dashboard will begin displaying insights and usage metrics.
At this point:
- Core installation is complete
- Additional customization and extensions can be added as needed