Summary: The main function of a site reliability engineer is to participate in the full development lifecycle of creating automated solutions by designing and implementing performance tests, identify bottlenecks and opportunities for optimization and capacity demands, and present solutions for continuous improvements.
Job Responsibilities:
- Troubleshoot production processing and execute problem resolution through post-issue evaluations, root-cause analysis and remediation.
- Participate in the Agile software development process as a member of scrum teams.
- Analyze and address failure patterns and incidents in a team setting
- Analyze, performance test, document and identify optimization opportunities within software stack.
- Design automated software and product upgrades, change management and release management solutions
- Leverage background, knowledge and experience to coach or manage teams as applicable
Education/Experience:-Bachelor's degree in a technical field such as computer science, computer engineering or related field required. - 8-10 years of experience required