Role: SRE Infra Ops Engineer for AWS
Location: Remote
Duration-Full Time
Who are we looking for?
The DevOps Engineer will be a core, pivotal, and transformational member of the engineering team. You will work as an integral part of our agile teams in analyzing, designing, building, and testing high quality cloud deployment methodologies and systems. Working on a team under the leadership of architects, you will build, migrate and operate on cloud-based platforms. This position will be fully responsible for our platform and application pipelines, and the flow of code updates through the engineering group, as well as creating and monitoring highly available cloud infrastructure and platforms to host that code. You will have the opportunity to work on multiple technologies, especially cloud platform services.
Technical Skills:
4+ years overall experience with 2+ years in SRE role handling IaaS, PaaS and Microservices on AWS Cloud Platform
SRE Engineer with strong experience in monitoring, troubleshooting and support
Support rapid development and engineering productivity via release engineering, CI/CD automation, and build tools.
Perform health checks Apps/Infra to identify and proactively pre-empt issues from occurring (verification, alerts, etc).
Work closely with engineering or DevOps teams to debug and fix issues as they arise.
Work on development tasks and tools for infrastructure, deployment, monitoring, etc.
Hand on Experience working on AWS / OCI Platforms
Integration with Code Deploy / GitHub Actions.
Expertise in monitoring tools like CloudWatch and Datadog
Experience in IaaS tools like CFT, Terraform
Strong expertise in Cloud concepts like Infrastructure as Code, Cloud Computing, Cloud Networking, Containerization, and SRE.
Experience in migrating and implementing multiple applications from on-premise to cloud using AWS services like EC2, S3, EC2, VPC, RDS, CloudTrail, and EKS Function.
DevOps, AWS CloudFormation, Kubernetes, Amazon Web Services (AWS) and Shell Scripting
Process Skills:
Having sound knowledge of ITIL practices like Change Management, Incident Management etc.
Exceptional communication skills
Self-starter, ambitious, willing to take on difficult problems
Collaborative, team player attitude
Practical exposure & knowledge in existing / emerging cloud Database technologies.
Has worked in Metrix role with an ability to work independently with multiple managers with dotted line hierarchies.
Keeping abreast of industry trends, technology innovation, and changing customer requirements to help with the continual service improvement process.
Participate in on-call rotations and be responsible for infrastructure and platform level escalations.
Work with the DevOps team on planning and implementation of infrastructure capacity planning, upgrades, and monitoring.
Participate in Daily (Standup) Production Reviews
Contribute to the design and improvement of deployment architecture of new and existing applications based on the principles of reliability, high availability, efficiency, and observability.
Research, learn, adapt, customize, and create tools to improve the observability, resilience, and usability of applications in scope
Create and maintain SRE-related documentation (solution repository, Root Cause Analysis Reports etc)
Job Type: Full-time
Pay: $75,000.00 - $80,000.00 per year
Schedule:
8 hour shift
Experience:
SRE: 4 years (Required)
AWS: 3 years (Required)
DevOps: 3 years (Required)
Terraform: 4 years (Required)
Work Location: Remote