Shire Jobs

Mobile Shire Logo

Job Information

Aspen Dental Senior Site Reliability Engineer in Chicago, Illinois

The Aspen Group, Inc. (TAG) is one of the largest and most trusted retail healthcare business support organizations in the U.S., supporting 18,000 healthcare professionals and team members at more than 1,100 health and wellness offices across 46 states in three distinct categories: Dental care, urgent care, and medical aesthetics. Working in partnership with independent practice owners and clinicians, the team is united by a single purpose: to prove that healthcare can be better and smarter for everyone. TAG provides a comprehensive suite of centralized business support services that power the impact of four consumer-facing businesses: Aspen Dental, ClearChoice Dental Implant Centers, WellNow Urgent Care, Chapter Aesthetic Studio and AZPetVet. Each brand has access to a deep community of experts, tools and resources to grow their practices, and an unwavering commitment to delivering high-quality consumer healthcare experiences at scale.

Our continued growth has created an opportunity to join our Information Technology team as a Senior Site Reliability Engineer.

We are seeking a highly skilled and experienced Senior Site Reliability Engineer to join our team. As a Senior Site Reliability Engineer, you will be responsible for ensuring the reliability, availability, and performance of our multi-region production systems in GCP/AWS/Kubernetes.

Responsibilities:

  • Design and implement systems for monitoring, alerting, and logging of production systems using modern cloud native tools

  • Develop and maintain automation scripts and tools to streamline deployment, testing, and monitoring of production systems.

  • Collaborate with development teams to identify and resolve performance and reliability issues in production systems.

  • Develop and maintain disaster recovery plans and conduct regular disaster recovery tests.

  • Continuously improve the scalability and performance of our systems by identifying and addressing bottlenecks and inefficiencies.

  • Participate in creating shift left on-call strategies to ensure availability of production systems.

Requirements:

  • Bachelor's degree in computer science or a related field.

  • At least 3 years of experience in Senior Site Reliability Engineering or a similar role.

  • Proficiency in at least one programming language such as Python, C#, or Go.

  • Experience with containerization technologies such as Docker and Kubernetes.

  • Understanding of networking, distributed systems, and cloud infrastructure; preferably GCP.

  • Familiarity with monitoring and logging tools such as Prometheus, Grafana, Loki, and/or ELK Stack

  • Problem-solving skills and the ability to work independently and in a team environment.

  • Experience with incident management and root cause analysis.

Salary: $100,000 - 135,000/year

DirectEmployers