Who We Are

Join a team that puts its People First! Since 1889, First American (NYSE: FAF) has held an unwavering belief in its people. They are passionate about what they do, and we are equally passionate about fostering an environment where all feel welcome, supported, and empowered to be innovative and reach their full potential. Our inclusive, people-first culture has earned our company numerous accolades, including being named to the Fortune 100 Best Companies to Work For® list for nine consecutive years. We have also earned awards as a best place to work for women, diversity and LGBTQ+ employees, and have been included on more than 50 regional best places to work lists. First American will always strive to be a great place to work, for all. For more information, please visit www.careers.firstam.com.

What We Do

We are looking for a Senior Observability & Monitoring Engineer to develop and manage enterprise observability infrastructure and monitoring tools such as Elasticsearch Observability, Terraform, Azure DevOps, and AWS Native tools and constructs. This candidate will also provide guidance on how to best use these tools. Must have strong communication and technical skills.

What You’ll Do:

  • Build solutions to provide monitoring patterns for various in-house and off-the-shelf applications across the company.
  • Measure and monitor all production systems with an eye toward availability, latency, and overall system health.
  • Engage with application teams to improve and evolve systems by lobbying for changes that enhance reliability, resilience, and observability.
  • Contribute to continuous improvement initiatives for the team and customers, with a goal of providing automation and enhancing client service, efficiency, and profitability.
  • Fine-tune existing tools, or research, develop, and implement new tools, to deliver additional monitoring capabilities.
  • Work on complex problems where analysis of situations or data requires an in-depth evaluation of multiple factors.

What You’ll Bring:

  • Proactive approach, designing telemetry strategies, implementing comprehensive monitoring systems, and leveraging advanced tools to gain real-time insights and identify potential issues before they escalate.
  • Possess in-depth knowledge and expertise in telemetry data collection, analysis, and implementation, fully understanding the intricacies of and how to derive meaningful insights from different telemetry sources such as: Metrics, Events, Logs, Traces.
  • Expertise in identifying patterns, detecting anomalies, and building a holistic understanding of system behavior beyond traditional monitoring approaches' current limitations.
  • Experience in software engineering, software development, and/or system operations.
  • Experience with APM and Observability using tools such as ELK Stack, AWS CloudWatch, Azure Monitor, New Relic, Splunk, Prometheus, Grafana, Sentry, etc.
  • Extensive understanding of the complexities native to modern distributed systems
  • Well-versed in the challenges posed by microservices architectures, cloud-native environments, and hybrid infrastructure setups.
  • Proven ability to lead complex initiatives/projects from inception to completion.
  • Ability to perform analysis on metrics & logs, using problem-solving techniques to provide guidance on monitoring, alerting, dashboarding and visualization.
  • Ability to work with a high level of autonomy and with a globally distributed team.
  • Excellent communication skills, both verbal and written; able to explain complex technical topics to both internal and external stakeholders with ease and in remote/distributed environments.

Preferred Qualifications:

  • Hands-on experience with Elasticsearch, including deployment and management of the Elastic Stack, Beats and/or Fleet Agents, APM, Dashboarding, and Reporting.
  • Hands-on experience with DevOps practices, including using GIT & Developing CI/CD Pipelines.
  • Hands-on experience with Infrastructure as Code (Terraform preferred)
  • Hands-on experience with Monitoring & Log Aggregation technologies
  • Hands-on experience with cloud infrastructure such as AWS, Azure, or Oracle Cloud Infrastructure.
  • Opinions about dashboards, metrics, and SLO’s
  • Strong knowledge of cloud design patterns for observability monitoring, resiliency, etc.
  • Ability to understand and write code to perform various tasks related to automation & monitoring.

Pay Range: $156,000 - $176,000 Annually

This hiring range is a reasonable estimate of the base pay range for this position at the time of posting. Pay is based on a number of factors which may include job-related knowledge, skills, experience, business requirements and geographic location.

What We Offer

By choice, we don’t simply accept individuality – we embrace it, we support it, and we thrive on it! Our People First Culture celebrates diversity, equity and inclusion not simply because it’s the right thing to do, but also because it’s the key to our success. We are proud to foster an authentic and inclusive workplace For All. You are free and encouraged to bring your entire, unique self to work. First American is an equal opportunity employer in every sense of the term.

Based on eligibility, First American offers a comprehensive benefits package including medical, dental, vision, 401k, PTO/paid sick leave and other great benefits like an employee stock purchase plan.

Salary

Competitive

Project Basis based

Remote Job

Worldwide

Job Overview
Job Posted:
1 year ago
Job Type
Contractual
Job Role
Any
Education
Any
Experience
Any
Total Vacancies
-

Share This Job:

Location

United States