Anton Gorshkov

Head of Engineering
LinkedIn
Anton is a long-time engineering leader and systems thinker with over two decades of experience building mission-critical platforms in financial services. As a former Managing Director at Goldman Sachs, he led initiatives across Data Strategy for Asset Management, Portfolio Management and Trading Systems, Alternative Investments, and Quantitative Infrastructure — including a successful GS Accelerate incubator venture. Anton is known for combining hands-on technical depth with a talent for building high-impact teams. His leadership is grounded in deep engineering intuition, a passion for elegant system design, and a relentless focus on solving real business problems. At Genesis Computing, Anton brings that same energy to architecting next-generation agentic systems that augment data teams and accelerate enterprise intelligence.
November 6, 2025

How Hard Could It Be? A Tale of Building an Enterprise Agentic Data Engineering Platform

Anton Gorshkov
Head of Engineering
Stay in the Fast Lane
News and product updates in Agentic AI for enterprise data teams.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

Over the good part of the year, I’ve been wrangling LLMs and wrangling is the right word here.  They do remind me of wild horses sometimes - majestic and beautiful, but also very dangerous if you’re not careful.

After 25 years of building systems that mostly worked and occasionally didn't catch fire, I'm here to tell you a story. A story about how we built Genesis, our enterprise-ready agentic data engineering platform. But more importantly, a story about how YOU could build one too!

Because really, how hard could it be?

Chapter 1: The Deceptively Simple Beginning

Picture this: It's a Tuesday morning, your third coffee is kicking in, and your CTO walks over with that look. You know the one.

"Hey, I just saw this demo where someone connected ChatGPT to their database and it wrote all their SQL queries! Can we build something like that? But, you know, enterprise-ready?"

You smile. You've been around the block. You've survived the XML years. You've implemented microservices before it was cool (and after it wasn't). This? This is just connecting an LLM to a database.

Python
def build_data_agent():
llm = SuperSmartLLM()  
db = Database()         

question = "Show me last quarter's revenue"   
sql = llm.generate_sql(question)              

return db.execute(sql)


Done! Ship it! 🚀
Oh, sweet summer child...

Chapter 2: The First Wrinkle - "Wait, Which Database?"

Your prototype works beautifully with that one PostgreSQL database. Then Karen from Finance mentions they need it to work with Snowflake. And Tom from Operations casually drops that all their data is in Databricks. Oh, and Legal has some "minor concerns" about that SQLite database full of contracts.

Challenge Discovered: Universal Data Connectivity

Suddenly, your elegant one-liner needs to handle a dozen different SQL dialects, authentication methods, and connection protocols.

In Genesis, we solved this by building what I like to call the "United Nations of Databases" - a connector architecture where each database gets its own specialized adapter. Each connector speaks the native language of its database while presenting a unified interface to the agents.

Python
# What started as db.execute(sql) became...
connector = ConnectorFactory.create(
    type="snowflake",
    auth_method="oauth",  # or "keypair" or "password" or...
    warehouse_size="XSMALL",  # because money
    role="DATA_SCIENTIST",    # because permissions
    timeout_seconds=30,       # because patience
    retry_policy=ExponentialBackoff(),  # because reality
)


But hey, still manageable, right?

Chapter 3: The Plot Thickens - "Can It Do More Than Just Query?"

Your data agent is happily running queries when someone asks, "Can it create a dbt model?"

"Sure!" you say, adding a create_file function.

"Can it commit to Git?"

"I... suppose?"

"Can it download files from our data vendor's API?"

"..."

"Can it send the results to Slack?"

"Hold on..."

Challenge Discovered: The Tool Ecosystem Explosion

What started as a simple query bot now needs to be a Swiss Army knife of data operations. Each tool seems simple in isolation. File operations? Easy. Git operations? No problem. Web scraping? Been there. But when your agent needs all of these, AND needs to use them safely, AND needs to know when to use which tool...

In Genesis, we built a tool framework where each capability is a self-contained, permission-controlled module. Our agents don't just have tools; they have a hardware store with a very strict security guard:

SQL
@gc_tool(
    required_permissions=["file_write", "git_commit"],
    rate_limit="10/minute",  # Because wisdom
    audit_log=True,          # Because compliance
)
def create_dbt_model(model_name: str, sql_query: str):
    # What could possibly go wrong?


Current tool count in Genesis: 107. And counting.

Chapter 4: The Surprise Party Nobody Wanted - "Is It Doing Something?"

Your agent is now happily running along, writing queries, creating files, making commits. Then your manager walks by:

"What's it doing?"

"Running queries!"

"Which queries?"

"Good... queries?"

"On which tables?"

"Important... tables?"

"Show me."

Nervous sweating intensifies

Challenge Discovered: Real-time Observability

It turns out that when you give an AI agent the keys to your data kingdom, people want to know what it's doing. In real-time. With full audit trails. And the ability to stop it if it starts doing something creative.

Genesis solved this with what I call "Panopticon-as-a-Service" - WebSocket streaming, OpenTelemetry tracing, and enough logging to make the NSA jealous.  We don’t want a black-box, we want the most transparent, the most magnified glass box possible with enough tooling to zoom in and out as needed.

Chapter 5: The Scaling Surprise - "Can It Handle Our Production Workload?"

Your agent works beautifully in dev. It's answering questions, creating pipelines, making everyone happy. Then someone decides to point it at the production data warehouse with 50,000 tables and asks it to "document everything."

Your laptop fan sounds like a jet engine. The office lights dim. Somewhere, a circuit breaker trips.

Challenge Discovered: Scale Without Sacrifice

Genesis handles this with process isolation and resource management that would make a container orchestration platform proud. That bulk operation? It's running in its own process with memory limits, CPU throttling, and a stern talking-to about playing nice with others.

Chapter 6: The Security Awakening - "Who Gave It Permission to Drop Tables?"

Everything's running smoothly until you get that call. You know the one. It starts with "So, an interesting thing happened..."

Turns out your agent interpreted "clean up the test data" rather liberally.

Challenge Discovered: Enterprise-Grade Security / Guardrails

In Genesis, we implemented what I call "Defense in Depth, Paranoia in Practice":

  • Authentication: "Who are you?" (OAuth, SAML, certificates, blood samples*)
  • Authorization: "What can you do?" (Role-based, attribute-based, mood-based*)
  • Caller Rights: "The agent has YOUR permissions, not God mode"
  • Audit Everything: "Yes, everything. Even this log entry about logging."

Moreover, if you played enough with LLM, you know that no matter how many times you put:

“IMPORTANT! DO NOT DROP TABLES - EVER!”

In the Agent’s instruction set, sometimes… rarely… the LLM will ignore those instructions.  It will be nice about it, apologetic even, but that means you can’t rely on instruction following alone, you’ll need to come up with additional guardrails implemented in code.

Chapter 7: The Collaboration Conundrum - "Can We Have Multiple Agents?"

Success! Your agent is so useful that every team wants their own. Marketing wants a "Campaign Performance Agent." Sales wants a "Pipeline Analysis Agent." Engineering wants a "Why Is Production Down Agent."

Now they all need to work together. What could go wrong?

Challenge Discovered: Multi-Agent Orchestration

Genesis solved this with our Mission system - think of it as air traffic control for agents:

mission: analyze_customer_churn

agents:

SQL
mission: analyze_customer_churn
agents:
  - DataExtractionAgent: "Get the raw data"
  - AnalysisAgent: "Find the patterns"  
  - VisualizationAgent: "Make pretty charts"
  - EmailAgent: "Send results to executives"
coordination:
  - sequential: [DataExtraction, Analysis]
  - parallel: [Visualization, Email]
  - retry_policy: "until_success_or_heat_death_of_universe"

Now you have a whole workflow that not only resembles what actually happens at a company, but allows you to institutionalize that process in a way that makes it repeatable and documentable.

Chapter 8: The Interface Intervention - "My CEO Wants to Use It"

Your beautifully functional command-line interface is working perfectly. Then you get the email: "The CEO wants to try the data agent."

The CEO's relationship with command lines ended with DOS 3.1.

Challenge Discovered: Human-Friendly Interfaces

Genesis provides multiple interfaces because we learned that one size fits none:

  • A modern React dashboard for the "I need it pretty" crowd
  • APIs for the "I'll build my own UI with blackjack" crowd
  • CLI for those of us who think GUIs peaked with ASCII art

Chapter 9: The Deployment Dance - "It Works on My Machine"

Time to deploy! Should be simple, right? Your local setup works perfectly.

"We need it in AWS," says the Cloud Team. "Actually, on-premise," says Security. "Inside Snowflake," says the Data Team. "All of the above," says the Enterprise Architect.

Eye twitches

Challenge Discovered: Deploy Anywhere Architecture

Genesis handles this with more deployment options than a Swiss Army knife has blades:

SQL
# Dockerfile for cloud
FROM ubuntu:latest AS cloud-deployment
# 500 lines of config

# Dockerfile for on-premise  
FROM redhat:enterprise AS paranoid-deployment
# 1000 lines of security hardening

# Snowflake Native App
CREATE APPLICATION PACKAGE genesis_in_snowflake AS
-- SQL pretending to be infrastructure

Epilogue: The Truth Revealed

So there you have it. Building an enterprise-ready agentic data engineering platform is totally straightforward! You just need to:

  1. Build universal database connectivity (Chapter 2)
  2. Create a comprehensive tool ecosystem (Chapter 3)
  3. Implement real-time observability (Chapter 4)
  4. Design for massive scale (Chapter 5)
  5. Lock down security tighter than Fort Knox (Chapter 6)
  6. Orchestrate multiple agents like a symphony conductor (Chapter 7)
  7. Build interfaces for humans of all technical levels (Chapter 8)
  8. Support every deployment scenario ever conceived (Chapter 9)
  9. Ensure AI doesn't write code that makes developers cry (Chapter 10)

And about 67 other things we didn't have space to cover.

Easy peasy! 🎉

The Real Moral of the Story

After 25 years in this industry, I've learned that the difference between a demo and a production system is like the difference between a paper airplane and a Boeing 747. Both fly, technically.

Genesis exists because we've solved these challenges so you don't have to. But if you do decide to build your own... well, you definitely CAN!

Remember: Every complex system started with someone saying "How hard could it be?"

The answer, dear reader, is always "Harder than you think, but not impossible."

Now if you'll excuse me, I need to stop an agent that interpreted "optimize the database" a bit too enthusiastically.

Want to learn more? Get in touch!

Experience what Genesis can do for your team.
Request a Demo
Stay in the Fast Lane
News and product updates in Agentic AI for enterprise data teams.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

Keep Reading

Launching the Genesis App through the Snowflake Marketplace
Genesis Walkthrough #7: Exploring Mission Results
Genesis Walkthrough #6: Mission document flow
Genesis Walkthrough #4: Genesis Mission prompt for required information
View All Articles
April 27, 2026
From "Something's Broken" to Root Cause in 5 Minutes
No items found.
No items found.
April 23, 2026
40 Minutes to Reverse-Engineer a Legacy Data Warehouse (Including the Ghost Artifacts Nobody Knew Existed)
No items found.
No items found.
April 22, 2026
From Raw Claims Data to a Live Analytics Dashboard in 7 Minutes
No items found.
No items found.
April 20, 2026
Meet Genesis Twin: The Digital Twin That Ends the Monday Morning Data Fire Drill
No items found.
No items found.
April 9, 2026
Super Data Science: ML & AI Podcast with Jon Krohn
Matt Glickman
April 8, 2026
Connecting Data Sources in Genesis
Todd Beauchene
Promotional banner for Genesis Computing
March 31, 2026
How Genesis Automates Synthetic Data Generation for Databricks Dev Environments in Under 34 Minutes
Todd Beauchene
March 19, 2026
The Death of Traditional BI - Part 1
Genesis Computing
March 11, 2026
AI Agent Builds dbt Analytics Schema in 30 Minutes
Todd Beauchene
February 26, 2026
Genesis Bronze, Silver, Gold Agentic Data Engineering: From Dashboard Sketch to Production Pipeline
Genesis Computing
February 19, 2026
How Genesis Automates Data Pipeline Development in Hours
Genesis Computing
February 12, 2026
3 cortex Codes Running in Parallel?
Justin Langseth
February 10, 2026
Powering Up Cortex Code with Genesis Superpowers
Matt Glickman
February 2, 2026
Automate Dashboard Creation with Genesis
Justin Langseth
January 27, 2026
Using AI Agents to Generate Synthetic Data
Justin Langseth
January 12, 2026
The Junior Data Engineer is Now an AI Agent
Matt Glickman
December 22, 2025
From Requirements to Production Pipelines With Genesis Missions
Genesis Computing
December 4, 2025
20 Years at Goldman Taught Me How to Manage People. Turns Out, Managing AI Agents Isn't That Different.
Anton Gorshkov
December 2, 2025
A CEO's Perspective on the Shift to AI Agents
Genesis Computing
December 2, 2025
Genesis Walkthrough #1: Exploring an S3 Bucket with Genesis Agents
Todd Beauchene
December 2, 2025
Genesis Walkthrough #2: Loading data from S3 into Snowflake with Genesis
Todd Beauchene
December 2, 2025
Genesis Walkthrough #3: Using a Blueprint to launch a mission
Todd Beauchene
December 2, 2025
Genesis Walkthrough #4: Genesis Mission prompt for required information
Todd Beauchene
December 2, 2025
Genesis Walkthrough #5: Checking in on a running mission
Todd Beauchene
December 2, 2025
Genesis Walkthrough #6: Mission document flow
Todd Beauchene
December 2, 2025
Genesis Walkthrough #7: Exploring Mission Results
Todd Beauchene
December 2, 2025
Genesis Walkthrough #8: DBT Engineering Blueprint
Todd Beauchene
November 7, 2025
Exploring Genesis UI: Agents & Their Tool
Todd Beauchene
November 7, 2025
Launching the Genesis App through the Snowflake Marketplace
Todd Beauchene
November 7, 2025
Exploring Mission Features in Genesis UI
Todd Beauchene
November 6, 2025
How Hard Could It Be? A Tale of Building an Enterprise Agentic Data Engineering Platform
Anton Gorshkov
November 4, 2025
Better Together: Genesis and Snowflake Cortex Agents API Integration
Genesis Computing
October 31, 2025
Exploring Genesis UI: Agent Workflows
Todd Beauchene
October 27, 2025
Agent Server [1/3]: Where Enterprise AI Agents Live, Work, and Scale
Justin Langseth
October 27, 2025
Agent Server [2/3]: Where Should Your Agent Server Run?
Justin Langseth
October 27, 2025
Agent Server [3/3]: Agent Access Control Explained: RBAC, Caller Limits, and Safer A2A
Justin Langseth
October 26, 2025
Delivering on agentic potential: how can financial services firms develop agents to add real value?
No items found.
No items found.
October 20, 2025
Blueprints: How We Teach Agents to Work the Way Data Engineers Do
Justin Langseth
October 20, 2025
Context Management: The Hardest Problem in Long-Running Agents
Justin Langseth
October 20, 2025
Progressive Tool Use
Genesis Computing
August 22, 2025
Your Data Backlog Isn't Just a List — It's a Risk Ledger
Genesis Computing
August 14, 2025
The Future of Data Engineering: From Months to Hours with Agentic AI
Genesis Computing
Matt Glickman gives an interview at Snowflake Summit 2025
June 27, 2025
Ex-Snowflake execs launch Genesis Computing to ease data pipeline burnout with AI agents
No items found.
No items found.
June 25, 2025
GXS Uses Autonomous AI Agents to Speed Data Engineering from Months to Hours
No items found.
No items found.
June 5, 2025
Enterprise AI Data Agents: Automating Bronze Layer to Snowflake dbt Pipelines
No items found.
No items found.
June 4, 2025
Stefan Williams, Snowflake & Matt Glickman, Genesis Computing | Snowflake Summit 2025
No items found.
No items found.
The Evolution of Data Work: Introducing Agentic Data Engineering
Matt Glickman
Justin Langseth