Role Overview
We are seeking a highly skilled Senior Reliability & Cloud Operations Engineer to ensure the reliability, security, and scalability of a large-scale SaaS platform hosted in the cloud. You will play a key role in maintaining 24/7 uptime, optimizing infrastructure, automating deployments, and resolving complex issues. The role requires deep cloud expertise, strong DevOps practices, and close collaboration with support and engineering teams. This is a permanent, full-time position in a fast-paced and dynamic environment.
Role type: Permanent role
Location: Dublin or Galway, Ireland (Once a week onsite)
Key Responsibilities
Manage and optimize cloud-based (AWS expert) infrastructure to ensure high availability, scalability, and cost efficiency.
Automate deployments and provisioning using modern CI/CD and Infrastructure as Code approaches.
Implement robust monitoring, logging, and alerting systems to improve visibility and reduce downtime.
Enforce security and compliance standards across systems and environments.
Lead incident response for critical outages, conduct root cause analysis, and maintain disaster recovery plans.
Partner with engineering teams to enhance system reliability, performance, and operational maturity.
Mentor junior engineers, contribute documentation, and share best practices.
Continuously evaluate and improve automation, tooling, and infrastructure design.
Required Skills & Experience
5+ years in cloud infrastructure (AWS) or reliability engineering roles.
Strong background in systems administration (Linux/Windows), networking, and scripting/automation (e.g., Python, Bash).
Hands-on experience with virtualization, container orchestration, and serverless platforms.
Proficiency with IaC, CI/CD pipelines, and configuration management tools.
Familiarity with observability platforms for monitoring, logging, and performance tracking.
Understanding of security principles, compliance frameworks, and vulnerability management.
Relevant cloud certifications are advantageous.
Strong problem-solving and troubleshooting abilities.
Comfortable working in a fast-paced, always-on SaaS environment.
Clear communicator, effective in cross-functional collaboration.
Proactive, detail-oriented, and able to work independently as well as in a team.
Nice to Haves
Experience in SaaS, B2B, or enterprise software environments.
Exposure to multi-region or large-scale distributed system architectures
***No Sponsorship Available***
***Stamp 1G Dependent/Stamp 4/EU Citizen's can apply***
The Next Step for you: Should this position be of interest to you, please forward your CV to Pankaj Sharma at GCS Recruitment specialists at [email protected] or call on +353-46901-1902
GCS is acting as an Employment Agency in relation to this vacancy.
Senior Reliability & Cloud Operations Engineer
Other similar jobs
Popular job searches
Your next job
starts here.
JOB SPECIALISMS
LATEST JOBS
TOP SEARCHES
LOCATIONS
- IT Support & Infrastructure
- BI & Data Analytics
- Project Management
- DevOps
- .NET/C#
- Business Change
- Product Management
- Engineering
- IT Audit & Risk
- Manufacturing & Production
- Software Development
- Cyber
LATEST JOBS
- C# / WPF / WCF / Winform Devel...
- Head of Operations and Plannin...
- AVD/Nerdio Specialist
- Senior Project Manager
- DevOps Engineer - Ansible, Iac...
- Senior Network Engineer (Aruba...
- Information Governance Officer...
- Cyber Software Engineer - Inci...
- IT Operations Manager
- UX Designer
- ERP Account Executive
- Senior Java Developer (Back en...
TOP SEARCHES
LOCATIONS
- Engineer
- Data Scientist
- Senior Data Scientist
- Head of Data Science
- Trainee Data Scientist
- Data Science Graduate
- Senior Financial Accountant
- Management Accountant
- Cost Accountant
- Civil Engineer
- Senior Civil Engineer
- Civil Design Engineer