CheckSammy is a sustainability startup that provides purpose-built technology and an extensive network of service partners to deliver data-driven bulk waste removal, sustainability, and exterior maintenance services across North America. We're a fast-growing team of passionate individuals building an innovative and collaborative environment where you can make a real impact. We are looking to expand our operation by adding a new Site Reliability Engineer who can contribute to the establishment of best practices for infrastructure and application reliability
As a Site Reliability Engineer (SRE) on the Platform Engineering Team, you will be responsible for the stability of production tools and features, including system scalability and robustness. You will design, implement, and maintain the cloud infrastructure supporting our platform. You will also monitor the platform and provide deployment services that enable the rest of engineering to develop, deliver, and maintain our platform services.
The ideal candidate will have hands-on experience with cloud environments, containerized workloads, automation tools, and monitoring systems, as well as a proactive mindset for enhancing system availability and performance. Also, the ideal candidate will work closely with Engineering teams to identify and resolve performance bottlenecks.
You should apply if you possess:
· Bachelor's degree in Computer Science, Information Technology, or related fields
· 3+ years of progressive experience in a similar SRE, DevOps, or Infrastructure Engineer role.
· Strong technical experience with AWS cloud, Infrastructure as Code (IaC) tools like AWS Cloud Development Kit (CDK) or AWS CloudFormation, CI/CD tools (e.g. Jenkins, GitHub Actions) and containerization (e.g., Docker)
· Strong understanding of load balancers, REST APIs, networking (IP management, subnetting), and HA architecture.
· Working knowledge of serverless cloud computing
· Proficient with cloud monitoring and observability tools such as AWS CloudWatch, EFK Stack, OpenTelemetry, Datadog, Grafana, New Relic etc.
· Ability to define and track golden metrics and establish meaningful alerting thresholds.
· Strong analytical skills and proven track record in root cause analysis and incident management.
· Excellent communication and collaboration skills to work across teams.
· Cloud-related certifications such as AWS Certified DevOps Engineer, or Certified Kubernetes Administrator (CKA) is a plus
· Experience with Agile methodology or willingness to learn is preferred
· Preferred to be located in DFW area to be able to come on site as needed, if not located in other areas within TX, FL, CA, NC, MO, VA, PA, NJ
Who we are
At CheckSammy we believe landfill waste is a massive problem, and we knew we could do something about it. With our technology and commitment to sustainability, we’re redefining what it means to be a junk removal solutions and sustainability provider. We offer on-demand and subscription-based pricing and complete customization for all our services.
But what sets us apart is our proprietary technology, patented techniques, and exclusive partnerships with respected sustainability vendors across the board, allowing us to move efficiently and tackle complex waste and recycling situations. We take pride in providing junk removal solutions with a conscience in a data-driven world.
What You’ll Love About CheckSammy
CheckSammy’s greatest assets are the employees. The employees make the fast-paced and energetic culture a place you’ll love and want to be. A place where we are creating and innovating ways to help keep revolutionizing the future of waste, recycling and sustainability.
Job Type: Full-time
Pay: $80,000.00 - $120,000.00 per year
Benefits:
Dental insurance
Health insurance
Schedule:
8 hour shift
Location:
Dallas-Fort Worth, TX (Required)
Work Location: Remote