Cloud Operations Engineer
Porter Airlines View all jobs
- Toronto, ON
- Permanent
- Full-time
- Design, install, configure, and maintain AWS cloud infrastructure, including server and serverless architectures.
- Architect and design improvements using new native solutions in AWS and/or alternative Cloud Environment.
- Good knowledge of core cloud services such as VMs, Containers, App Services, Virtual Networks, ASGs, Application Gateways, Load balancers and S3 accounts.
- Regularly evaluate cloud applications, designs, and best practices.
- Implement secure networking solutions such as VPC, subnets, routing and security groups
- Manage security groups, IAM roles, and policies for secure access control.
- Create and maintain automated solutions for repetitive tasks, including infrastructure provisioning, monitoring, and patching.
- Optimize cloud resources for cost management and performance.
- Implement auto-scaling and elasticity solutions to ensure infrastructure reliability and efficiency
- Use Infrastructure as Code tools to provision, manage and version AWS resources
- Implement and maintain configuration management tools to standardize and automation configurations
- Configure and maintain monitoring and alerting systems to ensure the health and performance of the infrastructure.
- Deploy, configure, and support applications and services on AWS infrastructure.
- Collaborate with DevOps teams to optimize and automate CI/CD pipelines.
- Prepare and document Change Requests and Methods of Procedures (MOPs) for infrastructure changes.
- Troubleshoot and resolve incidents affecting cloud services in accordance with predefined Service Level Agreements (SLAs).
- Escalate issues to internal teams or third-party vendors as required.
- Participate in incident response processes and post-incident reporting (PIR).
- Identify and deploy fixes to prevent recurrence of issues.
- Work with IT Security to ensure cloud infrastructure aligns with security best practices and compliance requirements
- Work with internal architecture, Solution Delivery(PMO), governance, and security teams to ensure that all security, governance and business continuity requirements and best practices are integrated and implemented.
- .Cloud Governance - Cost Optimization, Data Management, Asset Management, Performance Management, and Deployment Acceleration
- Implement encryption, backup, and disaster recovery solutions.
- Actively engage in the company’s Safety Management System (SMS) by reporting hazards and incidents encountered during daily operations.
- Coordinate with internal stakeholders, DevOps, and third-party vendors for deployments and fixes.
- Work within a cross-functional team of System Ops Engineers, App Admins, Data Engineers DevOps and DBA’s to specify, design, develop, test, and implement AWS cloud services and solutions.
- Provide guidance and knowledge to other team members, and promote efficiency, productivity, innovations, and knowledge-sharing across multi-functional teams.
- Work closely with internal business partners to gather requirements, design and implement solutions, manage technical operations, and triage and resolve operational issues.
- Maintain detailed and accurate system, application, and infrastructure documentation.
- Participate in an after-hours on-call rotation for critical incident resolution.
- Perform additional related duties as assigned by management.
- Strong experience with AWS services such as EC2, S3, RDS, Lambda, CloudFormation, and VPC.
- 5+ years of experience in IT Infrastructure Operations with a minimum of 3 years of experience in AWS.
- Proficiency in automation tools (e.g., AWS CLI, Terraform, or CloudFormation).
- Familiarity with monitoring tools like CloudWatch, Dynatrace
- Experience with scripting languages such as Python, PowerShell, or Bash
- Experience with implementing, supporting and monitoring servers and applications in both Windows and Linux environments
- Experience integrating applications and systems
- Understanding of ITIL practices and change management processes
- Strong problem-solving skills and ability to perform under pressure
- Excellent communication and documentation skills
- Experience with DevOps, CI/CD pipelines and Configuration Management is an asset
- Willingness to work flexible hours, including after-hours and on-call rotations