General-Purpose Data Agent

Your data context, in every tool you use.

Pulse-Agent is a general-purpose autonomous AI agent for Data Engineers, Data Analysts, and Data Scientists. Plug it into the tools you already use — it carries your data context across them, plans, acts, and verifies. Local-first.

Local-first 8 LLM providers 34 skills
Why it's different

One agent. Every tool you already use.

Other data agents are tied to a single product. Pulse-Agent is a portable context layer — it plugs into the orchestrator, warehouse, transform engine, BI tool, and notebook you already run, and carries your data context across them.

Orchestration
F-PulseAirflowPrefectDagsterKestra
Warehouses
SnowflakeBigQueryRedshiftDatabricksPostgresDuckDB
Transforms
dbtSQLMeshF-Pulse
Catalog & Lineage
D-PulseDataHubMarquezOpenLineage
BI
TableauLookerSupersetMetabasePower BI
Notebooks & IDEs
JupyterVS CodeCursor

First-party Hybridyn integrations · everything else via open connectors

Three Roles · One Agent

Real personality differentiation.

Not three configs of the same bot — three agents with different verbs, different fears, different toolboxes.

Data Engineer

verb: build

Diagnoses pipeline failures across any orchestrator, patches broken transforms, profiles new sources, and proposes layered models.

fears: Silent data corruption5 role skills

Data Analyst

verb: answer

Builds dashboards from a sentence, writes SQL against any warehouse, and validates results before they reach a stakeholder.

fears: Wrong number to a stakeholder4 role skills

Data Scientist

verb: investigate

Profiles features, runs experiments in a sandboxed Docker, compares model variants, and flags leakage before training.

fears: Overfit shipped to prod3 role skills
The Loop

Plan → Act → Verify.

Every task runs the same loop. You see the plan before any action. You approve every WRITE. The agent verifies its own output before reporting done.

Plan

Decompose into steps. Classify each as READ / WRITE / DESTRUCTIVE. Show the user.

Act

Execute one step at a time. Halt at any WRITE for approval. Halt at DESTRUCTIVE for double-confirm.

Verify

Check output against expected shape. Re-run sanity checks. Surface anything unexpected.

10 Guardrails · Hard Limits

Autonomous, not unsupervised.

Every guardrail is enforced in code, not in a system prompt. The agent literally cannot bypass them.

Token budget breaker
$0.50/task default circuit breaker
Iteration cap
10 steps max per task
Kill switch
Stop the agent at any moment
Action classifier
READ / WRITE / DESTRUCTIVE on every step
SQL re-classifier
Re-checks raw SQL before execution
Approval gate
Every WRITE waits for human OK
Double-confirm
DESTRUCTIVE actions require two confirmations
Append-only audit log
Every action recorded, never edited
Per-connector scopes
Permissions narrow to what the task needs
Output verifier
Results checked before they reach you
See It Work

Four flows. Real tasks.

Fix the broken pipeline

DE

Orchestrator failure → diagnose → patch → rerun

Build a churn dashboard

DA

From one sentence to a working chart

Why is this query slow?

DA

Profile → rewrite → benchmark

Onboard this CSV

DE

Profile → infer schema → propose layered model

34 Skills Out of the Box

Role + Industry packs.

Skills are Markdown files with YAML frontmatter. Add your own — Pulse-Agent picks them up automatically.

Role Skills · 10
Data Engineer5 skills
Data Analyst4 skills
Data Scientist3 skills
Industry Packs · 24
Retail8 skills
BFSI8 skills
Healthcare8 skills
Local-First · Provider Agnostic

Runs fully local. Zero config.

Pulse-Agent auto-detects a local Ollama on localhost:11434. No API key, no cloud, no data leaving the machine. Bring an API key for any of 8 providers when you want a stronger model.

Auto-detects Ollama — works offline
Untrusted code runs in --network=none Docker sandbox
Append-only audit log on disk
Open source skill format — fork and own
pulse-agent providers
$ pulse-agent providers
Claude
OpenAI
Gemini
Azure
Ollama
DeepSeek
Mistral
Groq
> detected:
✓ ollama @ localhost:11434
· using llama3:8b
ready · no API key required
Ambient Mode

It watches so you don't have to.

A background watcher polls your connected tools — orchestrator failures, dbt test failures, dashboard alerts, data quality drops. When something needs attention, Pulse-Agent pings you with a one-line summary and a proposed plan. Rate-limited to 3 pings/hour by default.

Built On
Python 3.11Tauri 2FastAPIDuckDBscikit-learnDocker sandboxPyPIGitHub Releases
Use Cases

A day in the life with Pulse-Agent

Real workflows where the agent saves the most time, across all three data roles.

Incident Response

3am pipeline failure

Airflow DAG fails. Agent pulls the task log, traces the upstream Snowflake table, finds the schema drift, and posts a root cause to Slack with a one-line fix. You wake up to a draft PR.

Lineage

Impact analysis before a drop

You want to drop a column. Agent walks the lineage through dbt, the warehouse, and Tableau, then lists every model, dashboard, and downstream consumer that breaks. Decide before you act.

Authoring

Draft a dbt model

Ask for a monthly active users model. Agent reads your events schema, follows your existing project conventions, drafts the SQL, runs it in dev, shows you the row count, and stops for review.

Discovery

Profile a new source

Point the agent at a new Postgres database. Within a minute it inventories tables, profiles row counts and null rates, flags PII candidates, and proposes a Bronze landing schema for ingest.

Optimization

Explain a slow query

Paste a Snowflake query that takes 90 seconds. Agent reads the query plan, identifies the broadcast join causing the spill, suggests the rewrite, and tells you what the new estimated cost is.

Compliance

Compliance answer in two minutes

Auditor asks 'where does customer_email live and who can see it.' Agent queries the catalog, walks lineage, lists every table, every consumer, and every role with access. One screen, one answer.

FAQ

Common questions

Is Pulse-Agent really free?

Yes, fully free. There is no paid tier of Pulse-Agent. No telemetry-on-by-default, no usage caps, no feature gating, no signup wall. The desktop app and CLI are both free to download and run forever.

Do I need an API key?

No. Pulse-Agent ships with Ollama as the default provider — runs entirely on your machine, no API key, no data leaving your laptop. If you want to use Claude, OpenAI, or another hosted provider for higher quality, you bring your own key. Pulse-Agent has no hosted backend and never proxies your requests.

Which tools does it integrate with?

Orchestrators: Airflow, Prefect, Dagster, Kestra, F-Pulse. Warehouses: Snowflake, BigQuery, Redshift, Databricks, Postgres, DuckDB. Transforms: dbt, SQLMesh. Catalogs: DataHub, Marquez, OpenLineage, D-Pulse. BI: Tableau, Looker, Superset, Metabase, Power BI. Notebooks/IDEs: Jupyter, VS Code, Cursor.

How is it different from a chat-with-your-data tool?

Most chat-with-your-data tools are read-only. They answer questions but can't act. Pulse-Agent has a Plan → Act → Verify loop with ten guardrails — it can actually execute work (run queries, edit files, trigger pipelines) under explicit, auditable controls. It's an agent, not a chatbot.

Will it touch my production data without asking?

No. Read-only steps run automatically. Any write step (DDL, DML, file edit, pipeline trigger) stops and asks unless you've explicitly authorized that exact operation in advance. Connections are tagged read-only or writable, and writes against unmarked connections are blocked at the runtime.

What are the ten guardrails?

No destructive SQL without per-call approval; no writes to production warehouses unless tagged writable; no credentials in LLM context; no silent file edits; no expensive queries without cost preview; no schema changes without impact preview; no cross-environment moves without confirmation; no calls outside the configured allowlist; no result retention beyond the session unless requested; no background actions in ambient mode without opt-in.

Does it work without F-Pulse or D-Pulse?

Yes. F-Pulse and D-Pulse are listed as first-party integrations, not requirements. If you never use a Hybridyn product, Pulse-Agent still works against Airflow, Snowflake, dbt, Tableau, Jupyter, and the rest. The breadth is the entire point.

What is ambient mode?

An opt-in background mode where Pulse-Agent watches your data systems for failures, anomalies, and slow queries, and surfaces them as quiet notifications. You can ignore them or click in to find the agent already halfway through a root-cause analysis.

One agent. Every tool you already use.

Free. Local-first. Plugs into the data tools you already run.