Data Infrastructure Engineer in San Mateo
Energy Jobline is the largest and fastest growing global Energy Job Board and Energy Hub. We have an audience reach of over 7 million energy professionals, 400,000+ monthly advertised global energy and engineering jobs, and work with the leading energy companies worldwide.
We focus on the Oil & Gas, Renewables, Engineering, Power, and Nuclear markets as well as emerging technologies in EV, Battery, and Fusion. We are committed to ensuring that we offer the most exciting career opportunities from around the world for our jobseekers.
Job DescriptionJob DescriptionAbout zaimler
AI agents can't reason over data they don't understand. Enterprise data today is fragmented across dozens of systems with no shared context, meaning, or structure, and that's why most enterprise AI is failing. The shift from copilots to autonomous agents is creating an entirely new infrastructure layer, and we're building it.
zaimler is the context infrastructure for the agentic era: a platform that automatically discovers domain knowledge, maps relationships, and gives AI agents the semantic understanding to operate with precision at scale. Imagine knowledge graphs that support real-time inference, built for systems that need to reason, not just retrieve.
zaimler was founded by Biswajit Das (ex-VP Engineering, Truera), a Data Infra veteran and former Chief Architect at Visa, and Sofus Macskassy (ex-Director of Engineering, LinkedIn), who built one of the largest knowledge graphs in production in the industry at LinkedIn. We're a small, senior team at the seed stage, deploying with major enterprises across insurance, travel, and technology. If you want to build infrastructure that the next decade of AI runs on, we'd love to talk.
The Role
We’re looking for a Data Infrastructure Engineer to build the foundational distributed data layer that feeds our semantic platform. You’ll design, build, and scale systems for high-throughput data ingestion, transformation, and real-time processing.
What You’ll Do
- Build and operate large-scale data pipelines on Spark, Kafka, and Ray.
- Design fault-tolerant streaming and batch systems that move terabytes reliably.
- Optimize data workflows for performance, cost, and latency.
- Collaborate with ML and product engineers to ensure data is discoverable, structured, and queryable.
- Automate deployments with Kubernetes, Terraform, and CI/CD pipelines.
- Monitor, debug, and improve distributed jobs in production.
What We’re Looking For
- Deep experience with distributed data systems (Spark, Kafka, Flink, Ray).
- Strong programming skills (Python, Scala, or Java).
- Comfort with Kubernetes and cloud environments (AWS/GCP/Azure).
- Solid understanding of streaming vs. batch tradeoffs, state management, and scaling patterns.
- Ability to collaborate across data, infra, and ML teams.
Why Join
- A rare chance to be a founding engineer shaping both company and product direction.
- Competitive salary, benefits, and meaningful equity.
- Work alongside engineers and researchers from LinkedIn, Visa, Meta, and Branch.
- Onsite culture in San Mateo, designed for deep collaboration and high-velocity building.
- Full benefits package (Medical, Dental, Vision, 401k).
- We transfer H-1B visas and assist with immigration processes.
We value builders over résumés. If this role excites you but you don't check every box, we still want to hear from you. zaimler is an equal opportunity employer.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
If you are interested in applying for this job please press the Apply Button and follow the application process. Energy Jobline wishes you the very best of luck in your next career move.