Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.datagenie.ai/llms.txt

Use this file to discover all available pages before exploring further.

What this covers

Once GO has context on what you’re expecting — from your problem statement, raw data, and dashboard snapshot — it begins the technical heavy lifting. This walkthrough covers every step from data transformation to a finalized Blueprint ready to generate autonomous insights.
The Blueprint is your dataset’s operational definition — the complete encoding of what DataGenie monitors, how it aggregates, and which dimensions it tracks. Every downstream feature (Top Stories, Wisdom, Explorer, Dashboards) reads from this single source of truth.

The complete GO workflow

Data Transformation

GO suggests specific Spark queries to transform your raw data. It also recommends bucketed dimensions — like grouping ages or price brackets — to make your analysis more powerful. Once you approve the transformation, GO validates the results with a live Schema and Data Preview.

Review the Onboarding Blueprint

Click the Onboarding Blueprint icon in the top right to open the sidebar. This is the source of truth for your new dataset — every decision GO and you have made together is captured here.

Define the Time Anchor

Conversationally define your Time Anchor — the column that tells DataGenie how to track your performance over hours, days, or months. This is critical for accurate time-series analysis.

Set the Granularity

GO asks for the granularity at which autonomous stories and metrics should be generated — daily, weekly, or monthly.

Verify KPIs

GO validates the SQL expressions for each KPI to ensure they are mathematically sound and in context. Every metric is verified before moving forward.

Confirm Dimensions

GO suggests the specific segments — dimensions — to track so you can uncover the “why” behind your numbers. Review and confirm the dimension list.

Finalize and Upload the Blueprint

Review the finalized Blueprint — a complete package of your data DNA. Once satisfied, upload it to start autonomous insights generation.

The Onboarding Blueprint

The Blueprint is the output of the entire GO session — a complete, reusable configuration that encodes your KPIs, dimensions, time anchor, granularity, and transformation logic. It’s what DataGenie uses to begin generating Top Stories and anomaly detection from day one.

Data Transformation

Spark queries and bucketed dimensions that shape your raw data into an analytics-ready format.

KPI Definitions

SQL-validated metric expressions mapped to your business language.

Dimension Assignments

The slicing attributes GO recommends for uncovering the why behind your numbers.

Time Anchor

The column that anchors all time-series analysis and period comparisons.

Granularity

The resolution at which DataGenie generates stories — daily, weekly, or monthly.

Mock Anomaly Preview

Even before your data is live, GO generates mock anomaly previews — showing exactly how DataGenie will flag deviations based on your specific KPIs and dimensions.

Async and iterative by design

Since GO is an asynchronous process, you can jump back and refine any step until it’s perfect. Nothing is locked until you finalize and upload the Blueprint.
You’ve successfully turned a conversation into a configuration — the Blueprint is your data DNA, ready to power autonomous insights in DataGenie.

What to do next

Top Stories

Start reviewing the most impactful metric changes surfaced by DataGenie.

Datasets

Fine-tune your KPI configuration, dimensions, and anomaly detection after GO completes.

Anomaly Detection

Configure the detection models that power your stories.