Skip to main content

Infrastructure Architect in Edinburgh

Energy Jobline is the largest and fastest growing global Energy Job Board and Energy Hub. We have an audience reach of over 7 million energy professionals, 400,000+ monthly advertised global energy and engineering jobs, and work with the leading energy companies worldwide.

We focus on the Oil & Gas, Renewables, Engineering, Power, and Nuclear markets as well as emerging technologies in EV, Battery, and Fusion. We are committed to ensuring that we offer the most exciting career opportunities from around the world for our jobseekers.

Job Description

Job Title: AI Infrastructure Architect

Location: Edinburgh, Scotland

Type: Permanent

On-Site Working Required, No Sponsorship Provided

Responsibilities:

Design a unified AI Infra & Serving architecture platform for composite AI workloads such as LLM Training & Inference, RLHF, Agent, and Multimodal processing. This platform will integrate inference, orchestration, and state management, defining the technical evolution path for Serverless AI + Agentic Serving

Design a heterogeneous execution framework across CPU/GPU/NPU for agent memory, tool invocation, and long-running multi-turn conversations and tasks. Build an efficient memory/KV-cache/vector store/logging and state-management subsystem to support agent retrieval, planning, and persistent memory.

Build a high-performance Runtime/Framework that defines the next- Serverless AI foundation through elastic scaling, cold start optimization, batch processing, function-based inference, request orchestration, dynamic decoupled deployment, and other features to support performance scenarios such as multiple models, multi-tenancy, and high concurrency.

Key Requirements:

  • Strong foundational knowledge in system architecture, or computer architecture, operating systems, and runtime environments;
  • Hands-on experience with Serverless architectures and cloud- optimization technologies such as containers, Kubernetes, service orchestration, and autoscaling
  • vLLM, SGLang, Ray Serve, etc.); understand common optimization concepts such as continuous batching, KV-Cache reuse, parallelism, and compression/quantization/distillation
  • Proficient in using Profiling/Tracing tools; experienced in analyzing and optimizing system-level bottlenecks regarding GPU utilization, memory/bandwidth, Interconnect Fabric, and network/storage paths
  • Proficient in at least one system-level language (e.g., C/C++, Go, Rust) and one scripting language (e.g., Python)


If you're interested in applying, please reach out to daniel@microtech-global.com

If you are interested in applying for this job please press the Apply Button and follow the application process. Energy Jobline wishes you the very best of luck in your next career move.

Infrastructure Architect in Edinburgh

Edinburgh, UK
Full time

Published on 02/21/2026

Share this job now