Job Description
We are seeking a highly skilled and experienced Site Reliability Engineer (SRE) to join our technology team in the banking sector. The ideal candidate will have a strong background in infrastructure reliability, monitoring (especially with Splunk), automation, and performance optimization, with a proven track record in financial services environments.
Key Responsibilities:
Design, implement, and maintain scalable, resilient, and secure infrastructure systems.
Develop and maintain monitoring and alerting systems using Splunk and other observability tools.
Collaborate with development and operations teams to ensure high availability and performance of critical banking applications.
Automate repetitive tasks and improve system reliability through Infrastructure as Code (IaC) and CI/CD pipelines.
Conduct root cause analysis and post-mortems for incidents and outages.
Ensure compliance with banking regulations and internal security policies.
Participate in on-call rotations and incident response.
Desired Candidate Profile
Any Nationality
Any Graduation
Any
Required Skills & Qualifications:
Bachelor’s degree in Computer Science, Engineering, or related field.
6+ years of experience in SRE, DevOps, or related roles.
Strong experience with Splunk for monitoring, alerting, and log analysis.
Proficiency in scripting languages (Python, Bash, etc.).
Hands-on experience with cloud platforms (AWS, Azure, or GCP).
Familiarity with containerization and orchestration tools (Docker, Kubernetes).
Experience in the banking or financial services industry is mandatory.
Strong understanding of networking, security, and system architecture.
Excellent problem-solving and communication skills.
Preferred Qualifications:
Certifications in cloud platforms (e.g., AWS Certified DevOps Engineer).
Experience with ITIL processes and regulatory compliance in banking.
Knowledge of database systems (SQL, NoSQL).