IBM web Methods – Senior Integration Engineer
Key Responsibilities:
Production Support & Incident Management:
- Serve as the primary escalation point for all IBM webMethods production incidents ensuring timely triage root cause analysis and resolution.
- Monitor integration flows services and messaging channels across IS API Gateway and UM environments for faults failures and performance degradation.
- Manage and resolve P1/P2 critical incidents within defined SLAs coordinating with internal teams and vendors as required.
- Perform log analysis thread dump analysis and diagnostic investigations to identify and remediate integration failures.
- Maintain incident problem and change records in ITSM tools (e.g. ServiceNow Jira) with thorough documentation and closure reports.
- Conduct post-incident reviews and implement preventive measures to reduce recurrence.
Platform Administration:
- Administer and maintain IBM webMethods Integration Server (IS) API Gateway Universal Messaging (UM) and related platform components across all environments (Dev QA UAT Production).
- Manage server configurations package deployments user access controls security policies and SSL/TLS certificates.
- Perform capacity planning performance tuning and resource optimisation across the webMethods platform.
- Coordinate and execute platform upgrades patch applications and hotfix deployments with minimal service disruption.
- Manage API Gateway configurations including policy enforcement rate limiting OAuth2/JWT setup and developer portal administration.
- Configure and maintain Universal Messaging channels queues topics durable subscriptions and cluster settings.
- Ensure platform high availability through clustering failover configuration and disaster recovery planning and testing.
Monitoring & Observability:
- Design and maintain end-to-end monitoring dashboards and alerting frameworks for integration services APIs and messaging infrastructure.
- Utilise monitoring tools such as webMethods Optimize ELK Stack Splunk or equivalent to track service health throughput latency and error rates.
- Define and manage key operational metrics SLA thresholds and automated alerting rules.
- Conduct proactive health checks and scheduled reviews of platform components to identify potential issues before they impact production.
- Analyse trends in platform performance data and provide recommendations for optimisation and capacity planning.
Documentation & Knowledge Management:
- Develop and maintain comprehensive technical documentation including runbooks standard operating procedures (SOPs) architecture diagrams and integration design documents.
- Document incident resolutions known errors and workarounds in a centralised knowledge base to enable faster future resolution.
- Maintain up-to-date API catalogues data flow diagrams and interface inventories for all active integrations.
- Produce regular operational reports covering platform health incident trends SLA adherence and change activity for stakeholders and management.
- Ensure all change records deployment guides and rollback procedures are documented and reviewed prior to implementation.
Integration Development & Enhancement:
- Design and develop integration flows services and adapters on IBM webMethods IS to support new business requirements and enhancements.
- Build and expose REST SOAP and GraphQL web services ensuring adherence to enterprise API standards and security policies.
- Review and guide the development work of junior and mid-level integration developers providing technical mentorship and code review.
- Support CI/CD pipeline integration for automated testing packaging and deployment of webMethods services across environments.
- Contribute to continuous improvement of integration frameworks reusable components and delivery standards.
Required Skills & Qualifications:
- 1012 years of hands-on experience with the IBM webMethods platform spanning development support and administration.
- Deep expertise in IBM webMethods Integration Server (IS) API Gateway and Universal Messaging (UM) across multi-environment enterprise setups.
- Proven experience in production support incident management and SLA-driven operations for integration platforms.
- Strong understanding of REST SOAP and GraphQL web services and experience troubleshooting service failures across protocols.
- Expertise in platform administration tasks including server configuration package management user access clustering and disaster recovery.
- Experience with ITSM tools (ServiceNow Jira or equivalent) for incident problem and change management.
- Proficiency in log analysis diagnostic tooling and performance investigation techniques for webMethods environments.
- Experience designing and maintaining monitoring dashboards and alerting frameworks using tools such as Splunk ELK or webMethods Optimize.
- Strong technical documentation skills capable of producing runbooks SOPs architecture documents and operational reports.
- Experience working in agile or hybrid delivery environments collaborating with cross-functional and cross-organisational teams.
- Ability to manage multiple priorities incidents and stakeholder communications simultaneously and under pressure.
Employment Type : Full Time
Experience: years
Vacancy: 1