About the role

We're looking for a founding engineer to own Airweave's data and infrastructure layer, the systems that make our distributed search and data pipelines scalable, reliable and observable.

At Airweave, you'll build and operate the platform that thousands of AI agents depend on. That means distributed sync pipelines pulling data from dozens of sources, vector databases powering LLM search, and the orchestration layer that keeps it all running. You'll work closely with the product team, but your focus is on the foundation: making sure data flows reliably at scale, LLM inference stays fast, and the whole system holds up under real production load.

This is early-stage infrastructure work. The architecture is still being shaped, and your decisions will define how we scale.

What you'll work on

Design and scale distributed data pipelines that sync hundreds of millions of documents from dozens sources into advanced search indexes
Build and improve Temporal workflows for parallel sync orchestration: retries, backpressure, and failure recovery across workers
Own our Kubernetes deployments with Helm charts: autoscaling, and resource management for bursty search, sync and LLM workloads
Scale PostgreSQL for high-throughput; connection pooling, read replicas, partitioning (we ask a lot from this database)
Manage vector database (Vespa) infrastructure: sharding, replication, backup strategies for large-scale agentic search
Orchestrate and optimize LLM inference pipelines: batching, caching, provider failover
Build monitoring and alerting with Prometheus, Grafana, and custom instrumentation for cluster health
Infrastructure as code for the base with Terraform

Founding Software Engineer, Data Infrastructure

About the role

What you'll work on

You might be a fit if

What we offer

About Airweave

Required skills

Other roles at Airweave

Job details

Company

Funding

Founders

What happens next.

Confirm the fit

I pitch you to the company

A meeting lands on your calendar