Skip to main content

Cloud Operations Engineer - Unified Cloud & Disaster Recovery

Job Description

Job Title: Cloud Operations Engineer – Unified Cloud & Disaster Recovery

Location: New Jersey City, NJ (Work from Office)


Overview -

Vibhathi Labs are seeking a skilled and motivated Cloud Operations Engineer with expertise in Unified Cloud and Disaster Recovery to join our team in New Jersey City. The ideal candidate will be responsible for ensuring the reliability, availability, security, and performance of our cloud infrastructure while driving operational excellence and mentoring junior team members.


Key Responsibilities -

  • Monitor system health and proactively troubleshoot incidents to ensure high service availability and reliability.
  • Deploy and manage cloud infrastructure leveraging automation and DevOps best practices.
  • Maintain and optimize CI/CD pipelines, configuration management tools, and version control systems.
  • Partner with development teams for seamless code and environment deployments across various environments.
  • Perform detailed root cause analysis and lead incident resolution to minimize downtime.
  • Continuously optimize system performance while ensuring strong security and compliance standards are enforced.
  • Lead capacity planning, scalability initiatives, and cloud cost optimization strategies.
  • Drive disaster recovery (DR) planning and execution, ensuring business continuity.
  • Mentor and guide junior team members, instilling adherence to operational standards and best practices.
  • Develop and maintain documentation, SOPs, dashboards, and service management reports.
  • Act as a key contributor to operational excellence by improving reliability, automation, and service delivery.


Qualifications -

  • Bachelor’s degree in computer science, Information Technology, or related field (or equivalent professional experience).
  • Strong hands-on experience with Unified Cloud platforms (AWS & Azure).
  • Proven expertise in Disaster Recovery design, testing, and ongoing management.
  • Solid knowledge of DevOps tools: CI/CD (Jenkins, GitLab, Azure DevOps, etc.), configuration management (Ansible, Puppet, Chef), and version control (Git).
  • Familiarity with scripting/automation (Python, Shell, PowerShell, etc.).
  • Knowledge of monitoring tools, observability platforms, and incident management best practices.
  • Excellent problem-solving, debugging, and root cause analysis skills.
  • Strong communication and leadership skills with the ability to mentor and support team members.

Cloud Operations Engineer - Unified Cloud & Disaster Recovery

Jersey City, NJ
Full time

Published on 09/26/2025

Share this job now