Staff Reliability Engineer
Job Details
- Location:
- Hartford, CT
- Category:
- Information Technology
- Employment Type:
- Full time
- Job Ref:
- R2623745
We’re determined to make a difference and are proud to be an insurance company that goes well beyond coverages and policies. Working here means having every opportunity to achieve your goals – and to help others accomplish theirs, too. Join our team as we help shape the future.
Position Overview:
The Staff Reliability Engineer plays a critical role in maintaining the stability, performance, and scalability of our systems and services. This senior-level position is responsible for implementing best practices in reliability engineering, driving continuous improvement, and mentoring team members. The ideal candidate possesses deep technical expertise, strong problem-solving skills, and a passion for building resilient infrastructure.
Key Responsibilities
Lead the design, implementation, and optimization of reliable systems and infrastructure.
Collaborate with software engineering, operations, and product teams to ensure uptime and availability targets are met.
Develop and maintain monitoring, alerting, and incident response strategies to detect and resolve issues quickly.
Conduct root cause analysis of system failures and drive corrective actions to prevent recurrence.
Advocate for reliability best practices and foster a culture of proactive risk mitigation across the organization.
Mentor and provide technical guidance to other reliability engineers and cross-functional team members.
Develop automation tools to enhance efficiency in deployment, monitoring, and recovery processes.
Participate in capacity planning, performance testing, and disaster recovery exercises.
Stay current with industry trends, emerging technologies, and best practices in reliability engineering.
Qualifications
5+ years of experience in reliability engineering, site reliability engineering (SRE), or related roles.
Expertise in cloud platforms (e.g., AWS, Azure, Google Cloud) and container orchestration (e.g., Kubernetes).
Strong programming skills in one or more languages (e.g., Python, Java).
Proven experience with logging and monitoring tools (e.g., Splunk, Dynatrace, Datadog) and incident management frameworks (e.g. ServiceNow).
Excellent analytical, troubleshooting, and communication skills.
Ability to lead complex projects and influence stakeholders at all levels.
Preferred Skills
Experience with infrastructure as code (e.g., Terraform, CloudFormation).
Knowledge of security best practices and compliance requirements.
Background in high-availability architectures and distributed systems.
Certifications in cloud or reliability engineering domains are a plus.
Work Environment
This position may require participation in an on-call rotation and occasional after-hours support for critical incidents. We offer a dynamic, collaborative environment where innovation and reliability are valued.
This role will have a Hybrid work schedule, with the expectation of working in an office (Columbus, OH, Chicago, IL, Hartford, CT or Charlotte, NC) 3 days a week (Tuesday through Thursday).
Candidates must be authorized to work in the US without company sponsorship. The company will not support the STEM OPT I-983 Training Plan endorsement for this position.
Compensation
The listed annualized base pay range is primarily based on analysis of similar positions in the external market. Actual base pay could vary and may be above or below the listed range based on factors including but not limited to performance, proficiency and demonstration of competencies required for the role. The base pay is just one component of The Hartford’s total compensation package for employees. Other rewards may include short-term or annual bonuses, long-term incentives, and on-the-spot recognition. The annualized base pay range for this role is:
$127,600 - $191,400Equal Opportunity Employer/Sex/Race/Color/Veterans/Disability/Sexual Orientation/Gender Identity or Expression/Religion/Age
About Us
We believe every day is a day to do right.
Featured Career Opportunities
-
Customer Care Representative
- Location
- San Antonio, TX
- Employment Type:
- Full time
- Job Ref:
- R2623665
-
Customer Care Representative
- Location
- Lake Mary, FL
- Employment Type:
- Full time
- Job Ref:
- R2623665
-
Associate Staff Attorney, Workers Compensation
- Location
- Los Angeles, CA
- Employment Type:
- Full time
- Job Ref:
- R2623751