# Senior Consultant Specialist

**Company**: HSBC Software Development (GuangDong) Limited
**Location**: Guangzhou
**Work arrangement**: onsite
**Experience**: senior
**Job type**: full-time
**Category**: Engineering
**Industry**: Finance

**Apply**: https://portal.careers.hsbc.com/careers/job/563774611335128?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply
**Canonical**: https://yubhub.co/jobs/job_1eed2df6-72a

## Description

Some careers have more impact than others. If you’re looking for a career where you can make a real impression, join HSBC and discover how valued you’ll be. We are currently seeking an experienced professional to join our team in the role of Senior Consultant Specialist.

As a Senior Consultant Specialist, you will be responsible for end-to-end service ownership, observability, monitoring, and alerting, API gateway and traffic management, authentication, authorisation, and security controls, resilience engineering and DR, incident management and problem management, CI/CD, release reliability, and environment management, database and dependency reliability, vulnerability management (CVE) and patching, and leadership and stakeholder management.

Key responsibilities include:

- End-to-end service ownership (AWS API platform) - Own reliability and operational readiness for API services deployed on AWS across multiple environments (dev/test/stage/prod).

- Observability, monitoring, and alerting - Design and implement monitoring and service status visibility (dashboards, service health views, dependency mapping).

- API gateway and traffic management - Operate and optimise Kong Gateway (or equivalent) for routing, rate limiting, throttling, authentication integration, and policy enforcement.

- Authentication, authorisation, and security controls - Work with IAM/security teams to ensure strong authentication/authorisation controls (e.g., OAuth2/OIDC, mTLS, token validation, secrets management).

- Resilience engineering and DR - Define and implement resilience patterns: multi-AZ design, failover strategies, graceful degradation, and dependency resilience.

- Incident management and problem management - Lead major incident response (triage, coordination, communications, recovery).

- CI/CD, release reliability, and environment management - Partner with engineering teams to improve CI/CD pipelines and release safety (progressive delivery, canary/blue-green, automated rollback).

- Database and dependency reliability - Provide reliability guidance for databases and stateful components (performance, backup/restore, replication, patching, capacity).

- Vulnerability management (CVE) and patching - Own/drive operational response to CVE and vulnerability findings: triage, risk assessment, patch planning, and verification.

Requirements include:

- 10+ years’ experience in fintech or regulated financial services operating customer-facing digital platforms.

- Proven experience leading SRE/production operations for cloud-based services, ideally AWS.

- Strong hands-on experience with API gateways (Kong preferred) and API platform operations, observability tooling (AppDynamics, Splunk), incident management, RCA, and operational governance, CI/CD pipelines and release engineering practices, security controls (IAM, secrets management, secure configuration, vulnerability/CVE remediation), database operations and performance troubleshooting (SQL/NoSQL exposure beneficial).

- Strong understanding of reliability engineering concepts: SLI/SLO, error budgets, capacity planning, resilience patterns, DR.

Technical skills include infrastructure as Code (Terraform/CloudFormation), containers and orchestration (Docker/Kubernetes/EKS), scripting/automation (Python, Bash), and AWS services commonly used in API platforms.

Soft skills include calm, structured leadership during high-severity incidents, strong stakeholder communication, bias for automation and continuous improvement, and collaborative mindset across engineering, security, and operations.

## Skills

### Required
- API gateways
- API platform operations
- Observability tooling
- Incident management
- RCA
- Operational governance
- CI/CD pipelines
- Release engineering practices
- Security controls
- Database operations
- Performance troubleshooting
- Reliability engineering concepts
- Infrastructure as Code
- Containers and orchestration
- Scripting/automation
- AWS services

---

Source: [Apply at portal.careers.hsbc.com](https://portal.careers.hsbc.com/careers/job/563774611335128?utm_source=yubhub.co&utm_medium=jobs_feed&utm_campaign=apply)
